transformers

Rinat a03d13c83d Pvt model (#24720 ) * pull and push updates * add docs * fix modeling * Add and run test * make copies * add task * fix tests and fix small issues * Checks on a Pull Request * fix docs * add desc pvt.md	2023-07-24 15:34:19 +01:00
..
albert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
align	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
altclip	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
audio_spectrogram_transformer	is_batched fix for remaining 2-D numpy arrays (#23309 )	2023-05-23 14:37:35 -04:00
auto	Remote code improvements (#23959 )	2023-06-06 14:31:14 -04:00
autoformer	Compute `dropout_probability` only in training mode (#24486 )	2023-06-26 18:36:47 +02:00
bark	Add bark (#24086 )	2023-07-17 17:53:24 +01:00
bart	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
barthez	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
bartpho	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
beit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
bert	fix: Text splitting in the BasicTokenizer (#22280 )	2023-07-11 11:07:58 -04:00
bert_generation	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
bert_japanese	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
bertweet	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
big_bird	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
bigbird_pegasus	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
biogpt	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
bit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
blenderbot	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
blenderbot_small	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
blip	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
blip_2	Add InstructBLIP (#23460 )	2023-06-26 11:23:57 +02:00
bloom	Byebye pytorch 1.9 (#24080 )	2023-06-16 16:38:23 +02:00
bridgetower	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
byt5	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
camembert	Better TF docstring types (#23477 )	2023-05-24 13:52:52 +01:00
canine	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
chinese_clip	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
clap	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
clip	fix: Text splitting in the BasicTokenizer (#22280 )	2023-07-11 11:07:58 -04:00
clipseg	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
codegen	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
conditional_detr	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
convbert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
convnext	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
convnextv2	Update `ConvNextV2ModelIntegrationTest::test_inference_image_classification_head` (#23402 )	2023-05-16 23:35:11 +02:00
cpm	Fix PipelineTests skip conditions (#22320 )	2023-03-22 20:02:24 +01:00
cpmant	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
ctrl	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
cvt	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
data2vec	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
deberta	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
deberta_v2	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
decision_transformer	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
deformable_detr	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
deit	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
deta	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
detr	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
dinat	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
dinov2	Add DINOv2 (#24016 )	2023-07-18 15:34:06 +01:00
distilbert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
dit	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
donut	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
dpr	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
dpt	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
efficientformer	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
efficientnet	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
electra	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
encodec	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
encoder_decoder	Move TF building to an actual build() method (#23760 )	2023-06-06 18:30:51 +01:00
ernie	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
ernie_m	Automatically create/update tiny models (#22275 )	2023-03-23 19:14:17 +01:00
esm	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
falcon	Falcon port (#24523 )	2023-07-11 13:36:31 +01:00
flaubert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
flava	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
fnet	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
focalnet	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
fsmt	update_pip_test_mapping (#22606 )	2023-04-06 17:56:06 +02:00
funnel	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
git	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
glpn	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
gpt_bigcode	Add `torch >=1.12` requirement for `Tapas` (#24251 )	2023-06-13 19:19:40 +02:00
gpt_neo	Update tiny models and pipeline tests (#23446 )	2023-05-18 17:29:04 +02:00
gpt_neox	Llama/GPTNeoX: add RoPE scaling (#24653 )	2023-07-13 16:47:30 +01:00
gpt_neox_japanese	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
gpt_sw3	Add gpt-sw3 model to transformers (#20209 )	2022-12-12 13:12:13 -05:00
gpt2	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
gptj	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
gptsan_japanese	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
graphormer	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
groupvit	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
herbert	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
hubert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
ibert	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
imagegpt	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
informer	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
instructblip	[InstructBLIP] Fix bos token of LLaMa checkpoints (#24492 )	2023-07-11 20:43:01 +01:00
jukebox	Update `Jukebox` tests (#21984 )	2023-03-07 04:20:14 +01:00
layoutlm	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
layoutlmv2	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
layoutlmv3	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
layoutxlm	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
led	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
levit	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
lilt	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
llama	[`LlamaConfig`] Nit: pad token should be None by default (#24958 )	2023-07-21 14:32:34 +02:00
longformer	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
longt5	update_pip_test_mapping (#22606 )	2023-04-06 17:56:06 +02:00
luke	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
lxmert	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
m2m_100	update_pip_test_mapping (#22606 )	2023-04-06 17:56:06 +02:00
marian	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
markuplm	Update some `MarkupLM` tests' expected values (#22667 )	2023-04-11 10:00:34 +02:00
mask2former	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
maskformer	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
mbart	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
mbart50	Replace `as_target` context managers by direct calls (#18325 )	2022-07-29 08:09:09 -04:00
mega	Fix `MegaModel` CI (#22652 )	2023-04-07 17:13:04 +02:00
megatron_bert	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
megatron_gpt2	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
mgp_str	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
mluke	Black preview (#17217 )	2022-05-12 16:25:55 -04:00
mobilebert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
mobilenet_v1	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
mobilenet_v2	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
mobilevit	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
mobilevitv2	Make more test models smaller (#25005 )	2023-07-24 10:08:47 -04:00
mpnet	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
mra	Add Multi Resolution Analysis (MRA) (New PR) (#24513 )	2023-07-10 10:50:43 +01:00
mt5	Better TF docstring types (#23477 )	2023-05-24 13:52:52 +01:00
musicgen	Skip torchscript tests for `MusicgenForConditionalGeneration` (#24782 )	2023-07-13 15:54:18 +02:00
mvp	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
nat	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
nezha	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
nllb	🚨🚨🚨 `[NLLB Tokenizer]` Fix the prefix tokens 🚨🚨🚨 (#22313 )	2023-04-04 14:53:06 +02:00
nllb_moe	tests: Fix flaky test for NLLB-MoE (#22880 )	2023-04-21 17:09:40 +01:00
nystromformer	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
oneformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
openai	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
opt	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
owlvit	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
pegasus	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
pegasus_x	update_pip_test_mapping (#22606 )	2023-04-06 17:56:06 +02:00
perceiver	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
phobert	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
pix2struct	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
plbart	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
poolformer	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
prophetnet	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
pvt	Pvt model (#24720 )	2023-07-24 15:34:19 +01:00
qdqbert	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
rag	Big TF test cleanup (#24282 )	2023-06-16 15:40:49 +01:00
realm	Unpin numba (#23162 )	2023-05-31 14:59:30 +01:00
reformer	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
regnet	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
rembert	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
resnet	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
roberta	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
roberta_prelayernorm	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
roc_bert	Move `is_pipeline_test_to_skip` to specific model test classes (#21999 )	2023-03-14 10:03:02 +01:00
roformer	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
rwkv	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
sam	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
segformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
sew	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
sew_d	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
speech_encoder_decoder	Update quality tooling for formatting (#21480 )	2023-02-06 18:10:56 -05:00
speech_to_text	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
speech_to_text_2	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
speecht5	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
splinter	Make tiny model creation + pipeline testing more robust (#22500 )	2023-04-06 17:45:55 +02:00
squeezebert	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
swiftformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
swin	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
swin2sr	Skip `test_multi_gpu_data_parallel_forward` for some model tests (#21991 )	2023-03-07 14:23:36 +01:00
swinv2	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
switch_transformers	[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )	2023-07-11 15:02:18 +02:00
t5	[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )	2023-07-11 15:02:18 +02:00
table_transformer	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
tapas	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
time_series_transformer	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
timesformer	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
timm_backbone	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
transfo_xl	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
trocr	Generate: skip left-padding tests on old models (#23437 )	2023-05-18 11:04:51 +01:00
tvlt	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
umt5	[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )	2023-07-11 15:02:18 +02:00
unispeech	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
unispeech_sat	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
upernet	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
videomae	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
vilt	Removal of deprecated vision methods and specify deprecation versions (#24570 )	2023-06-29 15:09:51 +01:00
vision_encoder_decoder	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
vision_text_dual_encoder	Fix `VisionTextDualEncoderIntegrationTest` (#24661 )	2023-07-05 13:44:30 +02:00
visual_bert	Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )	2023-06-22 16:11:27 +02:00
vit	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
vit_hybrid	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
vit_mae	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
vit_msn	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
vivit	Check models used for common tests are small (#24824 )	2023-07-14 14:43:19 -04:00
wav2vec2	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
wav2vec2_conformer	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
wav2vec2_phoneme	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00
wav2vec2_with_lm	Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800 )	2023-04-17 12:41:55 +02:00
wavlm	Fix TypeError: Object of type int64 is not JSON serializable (#24340 )	2023-06-27 12:15:49 +01:00
whisper	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
x_clip	Update some torchscript tests after #24505 (#24566 )	2023-06-29 16:05:24 +02:00
xglm	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
xlm	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
xlm_prophetnet	Update expected values in `XLMProphetNetModelIntegrationTest` (#21957 )	2023-03-06 09:15:44 +01:00
xlm_roberta	Better TF docstring types (#23477 )	2023-05-24 13:52:52 +01:00
xlm_roberta_xl	Use real tokenizers if tiny version(s) creation has issue(s) (#22428 )	2023-03-29 16:16:23 +02:00
xlnet	Speed up TF tests by reducing hidden layer counts (#24595 )	2023-06-30 16:30:33 +01:00
xmod	Use real tokenizers if tiny version(s) creation has issue(s) (#22428 )	2023-03-29 16:16:23 +02:00
yolos	Update old existing feature extractor references (#24552 )	2023-06-29 10:17:36 +01:00
yoso	🔥Rework pipeline testing by removing `PipelineTestCaseMeta` 🚀 (#21516 )	2023-02-28 19:40:57 +01:00
__init__.py	Move test model folders (#17034 )	2022-05-03 14:42:02 +02:00

albert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

align

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

altclip

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

audio_spectrogram_transformer

is_batched fix for remaining 2-D numpy arrays (#23309 )

2023-05-23 14:37:35 -04:00

auto

Remote code improvements (#23959 )

2023-06-06 14:31:14 -04:00

autoformer

Compute dropout_probability only in training mode (#24486 )

2023-06-26 18:36:47 +02:00

bark

Add bark (#24086 )

2023-07-17 17:53:24 +01:00

bart

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

barthez

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

bartpho

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

beit

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

bert

fix: Text splitting in the BasicTokenizer (#22280 )

2023-07-11 11:07:58 -04:00

bert_generation

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

bert_japanese

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

bertweet

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

big_bird

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

bigbird_pegasus

Generate: skip left-padding tests on old models (#23437 )

2023-05-18 11:04:51 +01:00

biogpt

Update tiny models and pipeline tests (#23446 )

2023-05-18 17:29:04 +02:00

bit

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

blenderbot

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

blenderbot_small

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

blip

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

blip_2

Add InstructBLIP (#23460 )

2023-06-26 11:23:57 +02:00

bloom

Byebye pytorch 1.9 (#24080 )

2023-06-16 16:38:23 +02:00

bridgetower

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

byt5

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

camembert

Better TF docstring types (#23477 )

2023-05-24 13:52:52 +01:00

canine

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

chinese_clip

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

clap

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

clip

fix: Text splitting in the BasicTokenizer (#22280 )

2023-07-11 11:07:58 -04:00

clipseg

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

codegen

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

conditional_detr

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

convbert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

convnext

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

convnextv2

Update ConvNextV2ModelIntegrationTest::test_inference_image_classification_head (#23402 )

2023-05-16 23:35:11 +02:00

cpm

Fix PipelineTests skip conditions (#22320 )

2023-03-22 20:02:24 +01:00

cpmant

Update tiny models and pipeline tests (#23446 )

2023-05-18 17:29:04 +02:00

ctrl

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

cvt

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

data2vec

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

deberta

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

deberta_v2

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

decision_transformer

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

deformable_detr

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

deit

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

deta

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

detr

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

dinat

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

dinov2

Add DINOv2 (#24016 )

2023-07-18 15:34:06 +01:00

distilbert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

dit

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

donut

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

dpr

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

dpt

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

efficientformer

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

efficientnet

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

electra

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

encodec

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

encoder_decoder

Move TF building to an actual build() method (#23760 )

2023-06-06 18:30:51 +01:00

ernie

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

ernie_m

Automatically create/update tiny models (#22275 )

2023-03-23 19:14:17 +01:00

esm

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

falcon

Falcon port (#24523 )

2023-07-11 13:36:31 +01:00

flaubert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

flava

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

fnet

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

focalnet

Update tiny models and pipeline tests (#23446 )

2023-05-18 17:29:04 +02:00

fsmt

update_pip_test_mapping (#22606 )

2023-04-06 17:56:06 +02:00

funnel

Big TF test cleanup (#24282 )

2023-06-16 15:40:49 +01:00

git

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

glpn

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

gpt_bigcode

Add torch >=1.12 requirement for Tapas (#24251 )

2023-06-13 19:19:40 +02:00

gpt_neo

Update tiny models and pipeline tests (#23446 )

2023-05-18 17:29:04 +02:00

gpt_neox

Llama/GPTNeoX: add RoPE scaling (#24653 )

2023-07-13 16:47:30 +01:00

gpt_neox_japanese

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

gpt_sw3

Add gpt-sw3 model to transformers (#20209 )

2022-12-12 13:12:13 -05:00

gpt2

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

gptj

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

gptsan_japanese

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

graphormer

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

groupvit

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

herbert

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

hubert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

ibert

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

imagegpt

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

informer

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

instructblip

[InstructBLIP] Fix bos token of LLaMa checkpoints (#24492 )

2023-07-11 20:43:01 +01:00

jukebox

Update Jukebox tests (#21984 )

2023-03-07 04:20:14 +01:00

layoutlm

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

layoutlmv2

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

layoutlmv3

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

layoutxlm

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

led

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

levit

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

lilt

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

llama

[LlamaConfig] Nit: pad token should be None by default (#24958 )

2023-07-21 14:32:34 +02:00

longformer

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

longt5

update_pip_test_mapping (#22606 )

2023-04-06 17:56:06 +02:00

luke

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

lxmert

Big TF test cleanup (#24282 )

2023-06-16 15:40:49 +01:00

m2m_100

update_pip_test_mapping (#22606 )

2023-04-06 17:56:06 +02:00

marian

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

markuplm

Update some MarkupLM tests' expected values (#22667 )

2023-04-11 10:00:34 +02:00

mask2former

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

maskformer

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

mbart

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

mbart50

Replace as_target context managers by direct calls (#18325 )

2022-07-29 08:09:09 -04:00

mega

Fix MegaModel CI (#22652 )

2023-04-07 17:13:04 +02:00

megatron_bert

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

megatron_gpt2

Move test model folders (#17034 )

2022-05-03 14:42:02 +02:00

mgp_str

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

mluke

Black preview (#17217 )

2022-05-12 16:25:55 -04:00

mobilebert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

mobilenet_v1

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

mobilenet_v2

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

mobilevit

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

mobilevitv2

Make more test models smaller (#25005 )

2023-07-24 10:08:47 -04:00

mpnet

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

mra

Add Multi Resolution Analysis (MRA) (New PR) (#24513 )

2023-07-10 10:50:43 +01:00

mt5

Better TF docstring types (#23477 )

2023-05-24 13:52:52 +01:00

musicgen

Skip torchscript tests for MusicgenForConditionalGeneration (#24782 )

2023-07-13 15:54:18 +02:00

mvp

Generate: skip left-padding tests on old models (#23437 )

2023-05-18 11:04:51 +01:00

nat

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

nezha

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

nllb

🚨🚨🚨 [NLLB Tokenizer] Fix the prefix tokens 🚨🚨🚨 (#22313 )

2023-04-04 14:53:06 +02:00

nllb_moe

tests: Fix flaky test for NLLB-MoE (#22880 )

2023-04-21 17:09:40 +01:00

nystromformer

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

oneformer

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

openai

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

opt

Big TF test cleanup (#24282 )

2023-06-16 15:40:49 +01:00

owlvit

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

pegasus

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

pegasus_x

update_pip_test_mapping (#22606 )

2023-04-06 17:56:06 +02:00

perceiver

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

phobert

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

pix2struct

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

plbart

Generate: skip left-padding tests on old models (#23437 )

2023-05-18 11:04:51 +01:00

poolformer

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

prophetnet

Generate: skip left-padding tests on old models (#23437 )

2023-05-18 11:04:51 +01:00

pvt

Pvt model (#24720 )

2023-07-24 15:34:19 +01:00

qdqbert

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

rag

Big TF test cleanup (#24282 )

2023-06-16 15:40:49 +01:00

realm

Unpin numba (#23162 )

2023-05-31 14:59:30 +01:00

reformer

Generate: skip left-padding tests on old models (#23437 )

2023-05-18 11:04:51 +01:00

regnet

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

rembert

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

resnet

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

roberta

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

roberta_prelayernorm

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

roc_bert

Move is_pipeline_test_to_skip to specific model test classes (#21999 )

2023-03-14 10:03:02 +01:00

roformer

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

rwkv

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

sam

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

segformer

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

sew

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

sew_d

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

speech_encoder_decoder

Update quality tooling for formatting (#21480 )

2023-02-06 18:10:56 -05:00

speech_to_text

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

speech_to_text_2

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

speecht5

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

splinter

Make tiny model creation + pipeline testing more robust (#22500 )

2023-04-06 17:45:55 +02:00

squeezebert

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

swiftformer

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

swin

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

swin2sr

Skip test_multi_gpu_data_parallel_forward for some model tests (#21991 )

2023-03-07 14:23:36 +01:00

swinv2

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

switch_transformers

[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )

2023-07-11 15:02:18 +02:00

t5

[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )

2023-07-11 15:02:18 +02:00

table_transformer

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

tapas

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

time_series_transformer

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

timesformer

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

timm_backbone

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

transfo_xl

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

trocr

Generate: skip left-padding tests on old models (#23437 )

2023-05-18 11:04:51 +01:00

tvlt

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

umt5

[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622 )

2023-07-11 15:02:18 +02:00

unispeech

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

unispeech_sat

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

upernet

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

videomae

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

vilt

Removal of deprecated vision methods and specify deprecation versions (#24570 )

2023-06-29 15:09:51 +01:00

vision_encoder_decoder

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

vision_text_dual_encoder

Fix VisionTextDualEncoderIntegrationTest (#24661 )

2023-07-05 13:44:30 +02:00

visual_bert

Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420 )

2023-06-22 16:11:27 +02:00

vit

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

vit_hybrid

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

vit_mae

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

vit_msn

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

vivit

Check models used for common tests are small (#24824 )

2023-07-14 14:43:19 -04:00

wav2vec2

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

wav2vec2_conformer

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

wav2vec2_phoneme

Move test model folders (#17034 )

2022-05-03 14:42:02 +02:00

wav2vec2_with_lm

Fix test_word_time_stamp_integration for Wav2Vec2ProcessorWithLMTest (#22800 )

2023-04-17 12:41:55 +02:00

wavlm

Fix TypeError: Object of type int64 is not JSON serializable (#24340 )

2023-06-27 12:15:49 +01:00

whisper

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

x_clip

Update some torchscript tests after #24505 (#24566 )

2023-06-29 16:05:24 +02:00

xglm

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

xlm

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

xlm_prophetnet

Update expected values in XLMProphetNetModelIntegrationTest (#21957 )

2023-03-06 09:15:44 +01:00

xlm_roberta

Better TF docstring types (#23477 )

2023-05-24 13:52:52 +01:00

xlm_roberta_xl

Use real tokenizers if tiny version(s) creation has issue(s) (#22428 )

2023-03-29 16:16:23 +02:00

xlnet

Speed up TF tests by reducing hidden layer counts (#24595 )

2023-06-30 16:30:33 +01:00

xmod

Use real tokenizers if tiny version(s) creation has issue(s) (#22428 )

2023-03-29 16:16:23 +02:00

yolos

Update old existing feature extractor references (#24552 )

2023-06-29 10:17:36 +01:00

yoso

🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516 )

2023-02-28 19:40:57 +01:00

__init__.py

Move test model folders (#17034 )

2022-05-03 14:42:02 +02:00