..
albert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
align
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
altclip
Hotfix
2023-08-19 11:15:38 +02:00
audio_spectrogram_transformer
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
auto
Remote code improvements ( #23959 )
2023-06-06 14:31:14 -04:00
autoformer
Compute dropout_probability
only in training mode ( #24486 )
2023-06-26 18:36:47 +02:00
bark
Update Bark generation configs and tests ( #25409 )
2023-08-09 18:28:02 +02:00
bart
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
barthez
bartpho
beit
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
bert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
bert_generation
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
bert_japanese
bertweet
big_bird
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
bigbird_pegasus
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
biogpt
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
bit
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
blenderbot
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
blenderbot_small
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
blip
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
blip_2
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
bloom
Fix failing test_batch_generation
for bloom ( #25718 )
2023-08-24 11:15:29 +02:00
bridgetower
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
byt5
camembert
Better TF docstring types ( #23477 )
2023-05-24 13:52:52 +01:00
canine
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
chinese_clip
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
clap
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
clip
Add FlaxCLIPTextModelWithProjection ( #25254 )
2023-08-25 10:58:14 +02:00
clipseg
Fix test_model_parallelism
( #25359 )
2023-08-08 10:48:45 +02:00
code_llama
Update-llama-code ( #25826 )
2023-09-01 20:40:40 +02:00
codegen
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
conditional_detr
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
convbert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
convnext
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
convnextv2
Update ConvNextV2ModelIntegrationTest::test_inference_image_classification_head
( #23402 )
2023-05-16 23:35:11 +02:00
cpm
cpmant
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
ctrl
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
cvt
Make more test models smaller ( #25005 )
2023-07-24 10:08:47 -04:00
data2vec
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
deberta
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
deberta_v2
Add TFDebertaV2ForMultipleChoice ( #25932 )
2023-09-05 17:13:06 +01:00
decision_transformer
deformable_detr
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
deit
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
deta
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
detr
Fix Detr CI ( #25972 )
2023-09-05 11:19:56 +02:00
dinat
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
dinov2
[DINOv2] Add backbone class ( #25520 )
2023-08-29 11:05:27 +01:00
distilbert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
dit
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
donut
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
dpr
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
dpt
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
efficientformer
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
efficientnet
🚨 🚨 🚨 Remove softmax for EfficientNetForImageClassification 🚨 🚨 🚨 ( #25501 )
2023-08-14 17:08:47 +01:00
electra
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
encodec
Make more test models smaller ( #25005 )
2023-07-24 10:08:47 -04:00
encoder_decoder
Move TF building to an actual build() method ( #23760 )
2023-06-06 18:30:51 +01:00
ernie
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
ernie_m
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
esm
Fix test_model_parallelism
( #25359 )
2023-08-08 10:48:45 +02:00
falcon
Put Falcon back ( #25960 )
2023-09-04 14:17:09 -04:00
flaubert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
flava
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
fnet
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
focalnet
Update tiny models and pipeline tests ( #23446 )
2023-05-18 17:29:04 +02:00
fsmt
funnel
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
git
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
glpn
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
gpt_bigcode
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
gpt_neo
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
gpt_neox
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
gpt_neox_japanese
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
gpt_sw3
gpt2
Correct attention mask dtype for Flax GPT2 ( #25636 )
2023-08-25 17:36:37 +02:00
gptj
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
gptsan_japanese
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
graphormer
Make more test models smaller ( #25005 )
2023-07-24 10:08:47 -04:00
groupvit
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
herbert
hubert
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
ibert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
idefics
[idefics] idefics-9b test use 4bit quant ( #25734 )
2023-08-24 08:33:14 -07:00
imagegpt
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
informer
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
instructblip
[InstructBlip
] FINAL Fix instructblip test ( #25887 )
2023-08-31 17:01:27 +02:00
jukebox
Replaces calls to .cuda
with .to(torch_device)
in tests ( #25571 )
2023-08-18 12:40:40 +02:00
layoutlm
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
layoutlmv2
[split_special_tokens
] Add support for split_special_tokens
argument to encode ( #25081 )
2023-08-18 13:26:27 +02:00
layoutlmv3
[split_special_tokens
] Add support for split_special_tokens
argument to encode ( #25081 )
2023-08-18 13:26:27 +02:00
layoutxlm
[split_special_tokens
] Add support for split_special_tokens
argument to encode ( #25081 )
2023-08-18 13:26:27 +02:00
led
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
levit
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
lilt
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
llama
[LlamaTokenizer
] tokenize
nits. ( #25793 )
2023-08-29 15:08:14 +02:00
longformer
Fix more offload edge cases ( #25342 )
2023-08-07 17:45:41 +02:00
longt5
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
luke
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
lxmert
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
m2m_100
marian
Fix MarianTokenizer
to remove metaspace character in decode
( #26091 )
2023-09-12 21:53:31 +02:00
markuplm
[split_special_tokens
] Add support for split_special_tokens
argument to encode ( #25081 )
2023-08-18 13:26:27 +02:00
mask2former
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
maskformer
Fix MaskFormerModelIntegrationTest
OOM ( #25544 )
2023-08-16 18:11:24 +02:00
mbart
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
mbart50
mega
Fix Mega chunking error when using decoder-only model ( #25765 )
2023-09-05 21:50:14 +02:00
megatron_bert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
megatron_gpt2
mgp_str
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
mluke
mobilebert
Hotfix
2023-08-19 11:15:38 +02:00
mobilenet_v1
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
mobilenet_v2
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
mobilevit
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
mobilevitv2
Make more test models smaller ( #25005 )
2023-07-24 10:08:47 -04:00
mpnet
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
mpt
Fix test_modeling_mpt typo in model id ( #25606 )
2023-08-21 11:11:21 +02:00
mra
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
mt5
Better TF docstring types ( #23477 )
2023-05-24 13:52:52 +01:00
musicgen
[MusicGen] Fix integration tests ( #25169 )
2023-07-28 18:50:15 +01:00
mvp
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
nat
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
nezha
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
nllb
nllb_moe
[NllbMoe
] Update code to properly support loss computation ( #25429 )
2023-08-17 17:21:56 +02:00
nystromformer
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
oneformer
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
openai
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
opt
Replaces calls to .cuda
with .to(torch_device)
in tests ( #25571 )
2023-08-18 12:40:40 +02:00
owlvit
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
pegasus
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
pegasus_x
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
perceiver
Fix last models for common tests that are too big. ( #25058 )
2023-07-25 07:56:04 -04:00
persimmon
[Persimmon
] Add support for persimmon ( #26042 )
2023-09-12 11:33:27 +02:00
phobert
pix2struct
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
plbart
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
poolformer
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
pop2piano
update remaining Pop2Piano
checkpoints ( #25827 )
2023-08-29 18:00:40 +01:00
prophetnet
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
pvt
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
qdqbert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
rag
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
realm
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
reformer
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
regnet
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
rembert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
resnet
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
roberta
Fix test_model_parallelism
( #25359 )
2023-08-08 10:48:45 +02:00
roberta_prelayernorm
Fix test_model_parallelism
( #25359 )
2023-08-08 10:48:45 +02:00
roc_bert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
roformer
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
rwkv
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
sam
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
segformer
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
sew
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
sew_d
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
speech_encoder_decoder
speech_to_text
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
speech_to_text_2
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
speecht5
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
splinter
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
squeezebert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
swiftformer
Fix last models for common tests that are too big. ( #25058 )
2023-07-25 07:56:04 -04:00
swin
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
swin2sr
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
swinv2
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
switch_transformers
Switch Transformers: remove overwritten beam sample test ( #25458 )
2023-08-11 13:16:01 +01:00
t5
[LlamaTokenizer
] tokenize
nits. ( #25793 )
2023-08-29 15:08:14 +02:00
table_transformer
Fix last models for common tests that are too big. ( #25058 )
2023-07-25 07:56:04 -04:00
tapas
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
time_series_transformer
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
timesformer
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
timm_backbone
Fix last models for common tests that are too big. ( #25058 )
2023-07-25 07:56:04 -04:00
transfo_xl
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
trocr
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
tvlt
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
umt5
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
unispeech
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
unispeech_sat
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
upernet
Fix last models for common tests that are too big. ( #25058 )
2023-07-25 07:56:04 -04:00
videomae
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
vilt
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
vision_encoder_decoder
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
vision_text_dual_encoder
Fix VisionTextDualEncoderIntegrationTest
( #24661 )
2023-07-05 13:44:30 +02:00
visual_bert
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
vit
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
vit_hybrid
fix vit hybrid test ( #25543 )
2023-08-16 17:02:57 +02:00
vit_mae
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
vit_msn
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
vitdet
Skip offload tests for ViTDet
( #25913 )
2023-09-04 11:35:39 +02:00
vits
[VITS] Fix nightly tests ( #25986 )
2023-09-07 17:49:14 +01:00
vivit
Input data format ( #25464 )
2023-08-16 17:45:02 +01:00
wav2vec2
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
wav2vec2_conformer
[Wav2Vec2 Conformer] Fix inference float16 ( #25985 )
2023-09-05 18:26:06 +01:00
wav2vec2_phoneme
wav2vec2_with_lm
wavlm
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
whisper
[CITests
] skip failing tests until #26054 is merged ( #26063 )
2023-09-09 05:43:26 +02:00
x_clip
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
xglm
Replaces calls to .cuda
with .to(torch_device)
in tests ( #25571 )
2023-08-18 12:40:40 +02:00
xlm
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
xlm_prophetnet
xlm_roberta
Better TF docstring types ( #23477 )
2023-05-24 13:52:52 +01:00
xlm_roberta_xl
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
xlnet
Skip test_contrastive_generate
for TFXLNet
( #25574 )
2023-08-17 18:56:34 +02:00
xmod
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
yolos
Refactor image processor testers ( #25450 )
2023-08-11 11:30:18 +01:00
yoso
CI with num_hidden_layers=2
🚀 🚀 🚀 ( #25266 )
2023-08-02 20:22:36 +02:00
__init__.py