..
albert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
align
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
altclip
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
audio_spectrogram_transformer
is_batched fix for remaining 2-D numpy arrays ( #23309 )
2023-05-23 14:37:35 -04:00
auto
Remote code improvements ( #23959 )
2023-06-06 14:31:14 -04:00
autoformer
Compute dropout_probability
only in training mode ( #24486 )
2023-06-26 18:36:47 +02:00
bark
Add bark ( #24086 )
2023-07-17 17:53:24 +01:00
bart
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
barthez
bartpho
beit
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
bert
fix: Text splitting in the BasicTokenizer ( #22280 )
2023-07-11 11:07:58 -04:00
bert_generation
bert_japanese
bertweet
big_bird
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
bigbird_pegasus
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
biogpt
Update tiny models and pipeline tests ( #23446 )
2023-05-18 17:29:04 +02:00
bit
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
blenderbot
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
blenderbot_small
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
blip
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
blip_2
Add InstructBLIP ( #23460 )
2023-06-26 11:23:57 +02:00
bloom
Byebye pytorch 1.9 ( #24080 )
2023-06-16 16:38:23 +02:00
bridgetower
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
byt5
camembert
Better TF docstring types ( #23477 )
2023-05-24 13:52:52 +01:00
canine
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
chinese_clip
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
clap
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
clip
fix: Text splitting in the BasicTokenizer ( #22280 )
2023-07-11 11:07:58 -04:00
clipseg
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
codegen
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
conditional_detr
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
convbert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
convnext
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
convnextv2
Update ConvNextV2ModelIntegrationTest::test_inference_image_classification_head
( #23402 )
2023-05-16 23:35:11 +02:00
cpm
cpmant
Update tiny models and pipeline tests ( #23446 )
2023-05-18 17:29:04 +02:00
ctrl
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
cvt
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
data2vec
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
deberta
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
deberta_v2
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
decision_transformer
deformable_detr
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
deit
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
deta
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
detr
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
dinat
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
dinov2
Add DINOv2 ( #24016 )
2023-07-18 15:34:06 +01:00
distilbert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
dit
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
donut
dpr
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
dpt
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
efficientformer
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
efficientnet
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
electra
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
encodec
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
encoder_decoder
Move TF building to an actual build() method ( #23760 )
2023-06-06 18:30:51 +01:00
ernie
ernie_m
esm
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
falcon
Falcon port ( #24523 )
2023-07-11 13:36:31 +01:00
flaubert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
flava
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
fnet
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
focalnet
Update tiny models and pipeline tests ( #23446 )
2023-05-18 17:29:04 +02:00
fsmt
funnel
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
git
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
glpn
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
gpt_bigcode
Add torch >=1.12
requirement for Tapas
( #24251 )
2023-06-13 19:19:40 +02:00
gpt_neo
Update tiny models and pipeline tests ( #23446 )
2023-05-18 17:29:04 +02:00
gpt_neox
Llama/GPTNeoX: add RoPE scaling ( #24653 )
2023-07-13 16:47:30 +01:00
gpt_neox_japanese
gpt_sw3
gpt2
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
gptj
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
gptsan_japanese
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
graphormer
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
groupvit
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
herbert
hubert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
ibert
imagegpt
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
informer
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
instructblip
[InstructBLIP] Fix bos token of LLaMa checkpoints ( #24492 )
2023-07-11 20:43:01 +01:00
jukebox
layoutlm
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
layoutlmv2
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
layoutlmv3
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
layoutxlm
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
led
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
levit
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
lilt
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
llama
[Llama2
] Add support for Llama 2 ( #24891 )
2023-07-18 15:18:31 -04:00
longformer
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
longt5
luke
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
lxmert
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
m2m_100
marian
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
markuplm
mask2former
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
maskformer
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
mbart
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
mbart50
mega
megatron_bert
megatron_gpt2
mgp_str
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
mluke
mobilebert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
mobilenet_v1
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
mobilenet_v2
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
mobilevit
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
mobilevitv2
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
mpnet
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
mra
Add Multi Resolution Analysis (MRA) (New PR) ( #24513 )
2023-07-10 10:50:43 +01:00
mt5
Better TF docstring types ( #23477 )
2023-05-24 13:52:52 +01:00
musicgen
Skip torchscript tests for MusicgenForConditionalGeneration
( #24782 )
2023-07-13 15:54:18 +02:00
mvp
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
nat
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
nezha
nllb
nllb_moe
nystromformer
oneformer
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
openai
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
opt
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
owlvit
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
pegasus
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
pegasus_x
perceiver
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
phobert
pix2struct
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
plbart
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
poolformer
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
prophetnet
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
qdqbert
rag
Big TF test cleanup ( #24282 )
2023-06-16 15:40:49 +01:00
realm
Unpin numba ( #23162 )
2023-05-31 14:59:30 +01:00
reformer
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
regnet
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
rembert
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
resnet
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
roberta
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
roberta_prelayernorm
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
roc_bert
roformer
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
rwkv
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
sam
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
segformer
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
sew
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
sew_d
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
speech_encoder_decoder
speech_to_text
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
speech_to_text_2
speecht5
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
splinter
squeezebert
swiftformer
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
swin
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
swin2sr
swinv2
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
switch_transformers
[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words ( #24622 )
2023-07-11 15:02:18 +02:00
t5
[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words ( #24622 )
2023-07-11 15:02:18 +02:00
table_transformer
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
tapas
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
time_series_transformer
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
timesformer
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
timm_backbone
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
transfo_xl
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
trocr
Generate: skip left-padding tests on old models ( #23437 )
2023-05-18 11:04:51 +01:00
tvlt
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
umt5
[Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words ( #24622 )
2023-07-11 15:02:18 +02:00
unispeech
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
unispeech_sat
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
upernet
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
videomae
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
vilt
Removal of deprecated vision methods and specify deprecation versions ( #24570 )
2023-06-29 15:09:51 +01:00
vision_encoder_decoder
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
vision_text_dual_encoder
Fix VisionTextDualEncoderIntegrationTest
( #24661 )
2023-07-05 13:44:30 +02:00
visual_bert
Revert "Fix gradient checkpointing + fp16 autocast for most models" ( #24420 )
2023-06-22 16:11:27 +02:00
vit
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
vit_hybrid
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
vit_mae
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
vit_msn
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
vivit
Check models used for common tests are small ( #24824 )
2023-07-14 14:43:19 -04:00
wav2vec2
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
wav2vec2_conformer
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
wav2vec2_phoneme
wav2vec2_with_lm
wavlm
Fix TypeError: Object of type int64 is not JSON serializable ( #24340 )
2023-06-27 12:15:49 +01:00
whisper
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
x_clip
Update some torchscript tests after #24505 ( #24566 )
2023-06-29 16:05:24 +02:00
xglm
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
xlm
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
xlm_prophetnet
xlm_roberta
Better TF docstring types ( #23477 )
2023-05-24 13:52:52 +01:00
xlm_roberta_xl
xlnet
Speed up TF tests by reducing hidden layer counts ( #24595 )
2023-06-30 16:30:33 +01:00
xmod
yolos
Update old existing feature extractor references ( #24552 )
2023-06-29 10:17:36 +01:00
yoso
__init__.py