transformers/tests/models
Tom Aarsen 79444f370f
Deprecate unused OpenLlama architecture (#24922)
* Resolve typo in check_repo.py

* Specify encoding when opening modeling files

* Deprecate the OpenLlama architecture

* Add disclaimer pointing to Llama

I'm open to different wordings here

* Match the capitalisation of LLaMA
2023-07-20 07:03:24 -04:00
..
albert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
align Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
altclip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
audio_spectrogram_transformer is_batched fix for remaining 2-D numpy arrays (#23309) 2023-05-23 14:37:35 -04:00
auto Remote code improvements (#23959) 2023-06-06 14:31:14 -04:00
autoformer Compute dropout_probability only in training mode (#24486) 2023-06-26 18:36:47 +02:00
bark Add bark (#24086) 2023-07-17 17:53:24 +01:00
bart Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
barthez
bartpho
beit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
bert fix: Text splitting in the BasicTokenizer (#22280) 2023-07-11 11:07:58 -04:00
bert_generation
bert_japanese
bertweet
big_bird Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
bigbird_pegasus Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
biogpt Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
bit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
blenderbot Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blenderbot_small Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blip Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blip_2 Add InstructBLIP (#23460) 2023-06-26 11:23:57 +02:00
bloom Byebye pytorch 1.9 (#24080) 2023-06-16 16:38:23 +02:00
bridgetower Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
byt5
camembert Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
canine Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
chinese_clip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
clap Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
clip fix: Text splitting in the BasicTokenizer (#22280) 2023-07-11 11:07:58 -04:00
clipseg Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
codegen Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
conditional_detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
convbert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
convnext Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
convnextv2 Update ConvNextV2ModelIntegrationTest::test_inference_image_classification_head (#23402) 2023-05-16 23:35:11 +02:00
cpm
cpmant Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
ctrl Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
cvt Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
data2vec Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deberta Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deberta_v2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
decision_transformer
deformable_detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
deit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deta Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
dinat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
dinov2 Add DINOv2 (#24016) 2023-07-18 15:34:06 +01:00
distilbert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
dit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
donut
dpr Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
dpt Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
efficientformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
efficientnet Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
electra Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
encodec Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
encoder_decoder Move TF building to an actual build() method (#23760) 2023-06-06 18:30:51 +01:00
ernie
ernie_m
esm Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
falcon Falcon port (#24523) 2023-07-11 13:36:31 +01:00
flaubert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
flava Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
fnet Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
focalnet Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
fsmt
funnel Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
git Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
glpn Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
gpt_bigcode Add torch >=1.12 requirement for Tapas (#24251) 2023-06-13 19:19:40 +02:00
gpt_neo Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
gpt_neox Llama/GPTNeoX: add RoPE scaling (#24653) 2023-07-13 16:47:30 +01:00
gpt_neox_japanese
gpt_sw3
gpt2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
gptj Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
gptsan_japanese Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
graphormer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
groupvit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
herbert
hubert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
ibert
imagegpt Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
informer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
instructblip [InstructBLIP] Fix bos token of LLaMa checkpoints (#24492) 2023-07-11 20:43:01 +01:00
jukebox
layoutlm Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
layoutlmv2 Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
layoutlmv3 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
layoutxlm Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
led Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
levit Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
lilt Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
llama [Llama2] Add support for Llama 2 (#24891) 2023-07-18 15:18:31 -04:00
longformer Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
longt5
luke Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
lxmert Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
m2m_100
marian Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
markuplm
mask2former Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
maskformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
mbart Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mbart50
mega
megatron_bert
megatron_gpt2
mgp_str Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mluke
mobilebert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mobilenet_v1 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mobilenet_v2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mobilevit Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
mobilevitv2 Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
mpnet Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mra Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
mt5 Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
musicgen Skip torchscript tests for MusicgenForConditionalGeneration (#24782) 2023-07-13 15:54:18 +02:00
mvp Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
nat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
nezha
nllb
nllb_moe
nystromformer
oneformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
openai Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
opt Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
owlvit Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
pegasus Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
pegasus_x
perceiver Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
phobert
pix2struct Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
plbart Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
poolformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
prophetnet Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
qdqbert
rag Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
realm Unpin numba (#23162) 2023-05-31 14:59:30 +01:00
reformer Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
regnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
rembert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
resnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
roberta Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
roberta_prelayernorm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
roc_bert
roformer Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
rwkv Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
sam Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
segformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
sew Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
sew_d Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
speech_encoder_decoder
speech_to_text Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
speech_to_text_2
speecht5 Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
splinter
squeezebert
swiftformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
swin Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
swin2sr
swinv2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
switch_transformers [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
t5 [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
table_transformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
tapas Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
time_series_transformer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
timesformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
timm_backbone Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
transfo_xl Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
trocr Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
tvlt Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
umt5 [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
unispeech Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
unispeech_sat Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
upernet Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
videomae Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
vilt Removal of deprecated vision methods and specify deprecation versions (#24570) 2023-06-29 15:09:51 +01:00
vision_encoder_decoder Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vision_text_dual_encoder Fix VisionTextDualEncoderIntegrationTest (#24661) 2023-07-05 13:44:30 +02:00
visual_bert Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
vit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
vit_hybrid Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vit_mae Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
vit_msn Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vivit Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
wav2vec2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
wav2vec2_conformer Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
wav2vec2_phoneme
wav2vec2_with_lm
wavlm Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
whisper Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
x_clip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
xglm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xlm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xlm_prophetnet
xlm_roberta Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
xlm_roberta_xl
xlnet Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xmod
yolos Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
yoso
__init__.py