transformers/tests/models
Rinat a03d13c83d
Pvt model (#24720)
* pull and push updates

* add docs

* fix modeling

* Add and run test

* make copies

* add task

* fix tests and fix small issues

* Checks on a Pull Request

* fix docs

* add desc pvt.md
2023-07-24 15:34:19 +01:00
..
albert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
align Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
altclip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
audio_spectrogram_transformer is_batched fix for remaining 2-D numpy arrays (#23309) 2023-05-23 14:37:35 -04:00
auto Remote code improvements (#23959) 2023-06-06 14:31:14 -04:00
autoformer Compute dropout_probability only in training mode (#24486) 2023-06-26 18:36:47 +02:00
bark Add bark (#24086) 2023-07-17 17:53:24 +01:00
bart Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
barthez Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
bartpho Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
beit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
bert fix: Text splitting in the BasicTokenizer (#22280) 2023-07-11 11:07:58 -04:00
bert_generation 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
bert_japanese Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
bertweet Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
big_bird Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
bigbird_pegasus Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
biogpt Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
bit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
blenderbot Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blenderbot_small Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blip Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blip_2 Add InstructBLIP (#23460) 2023-06-26 11:23:57 +02:00
bloom Byebye pytorch 1.9 (#24080) 2023-06-16 16:38:23 +02:00
bridgetower Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
byt5 Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
camembert Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
canine Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
chinese_clip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
clap Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
clip fix: Text splitting in the BasicTokenizer (#22280) 2023-07-11 11:07:58 -04:00
clipseg Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
codegen Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
conditional_detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
convbert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
convnext Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
convnextv2 Update ConvNextV2ModelIntegrationTest::test_inference_image_classification_head (#23402) 2023-05-16 23:35:11 +02:00
cpm Fix PipelineTests skip conditions (#22320) 2023-03-22 20:02:24 +01:00
cpmant Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
ctrl Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
cvt Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
data2vec Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deberta Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deberta_v2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
decision_transformer 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
deformable_detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
deit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deta Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
dinat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
dinov2 Add DINOv2 (#24016) 2023-07-18 15:34:06 +01:00
distilbert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
dit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
donut 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
dpr Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
dpt Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
efficientformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
efficientnet Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
electra Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
encodec Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
encoder_decoder Move TF building to an actual build() method (#23760) 2023-06-06 18:30:51 +01:00
ernie 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
ernie_m Automatically create/update tiny models (#22275) 2023-03-23 19:14:17 +01:00
esm Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
falcon Falcon port (#24523) 2023-07-11 13:36:31 +01:00
flaubert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
flava Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
fnet Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
focalnet Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
fsmt update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
funnel Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
git Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
glpn Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
gpt_bigcode Add torch >=1.12 requirement for Tapas (#24251) 2023-06-13 19:19:40 +02:00
gpt_neo Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
gpt_neox Llama/GPTNeoX: add RoPE scaling (#24653) 2023-07-13 16:47:30 +01:00
gpt_neox_japanese 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
gpt_sw3 Add gpt-sw3 model to transformers (#20209) 2022-12-12 13:12:13 -05:00
gpt2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
gptj Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
gptsan_japanese Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
graphormer Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
groupvit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
herbert Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
hubert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
ibert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
imagegpt Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
informer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
instructblip [InstructBLIP] Fix bos token of LLaMa checkpoints (#24492) 2023-07-11 20:43:01 +01:00
jukebox Update Jukebox tests (#21984) 2023-03-07 04:20:14 +01:00
layoutlm Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
layoutlmv2 Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
layoutlmv3 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
layoutxlm Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
led Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
levit Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
lilt Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
llama [LlamaConfig] Nit: pad token should be None by default (#24958) 2023-07-21 14:32:34 +02:00
longformer Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
longt5 update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
luke Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
lxmert Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
m2m_100 update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
marian Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
markuplm Update some MarkupLM tests' expected values (#22667) 2023-04-11 10:00:34 +02:00
mask2former Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
maskformer Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
mbart Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mbart50 Replace as_target context managers by direct calls (#18325) 2022-07-29 08:09:09 -04:00
mega Fix MegaModel CI (#22652) 2023-04-07 17:13:04 +02:00
megatron_bert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
megatron_gpt2 Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
mgp_str Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mluke Black preview (#17217) 2022-05-12 16:25:55 -04:00
mobilebert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mobilenet_v1 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mobilenet_v2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mobilevit Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
mobilevitv2 Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
mpnet Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mra Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
mt5 Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
musicgen Skip torchscript tests for MusicgenForConditionalGeneration (#24782) 2023-07-13 15:54:18 +02:00
mvp Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
nat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
nezha 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
nllb 🚨🚨🚨 [NLLB Tokenizer] Fix the prefix tokens 🚨🚨🚨 (#22313) 2023-04-04 14:53:06 +02:00
nllb_moe tests: Fix flaky test for NLLB-MoE (#22880) 2023-04-21 17:09:40 +01:00
nystromformer 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
oneformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
openai Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
opt Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
owlvit Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
pegasus Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
pegasus_x update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
perceiver Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
phobert Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
pix2struct Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
plbart Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
poolformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
prophetnet Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
pvt Pvt model (#24720) 2023-07-24 15:34:19 +01:00
qdqbert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
rag Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
realm Unpin numba (#23162) 2023-05-31 14:59:30 +01:00
reformer Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
regnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
rembert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
resnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
roberta Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
roberta_prelayernorm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
roc_bert Move is_pipeline_test_to_skip to specific model test classes (#21999) 2023-03-14 10:03:02 +01:00
roformer Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
rwkv Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
sam Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
segformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
sew Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
sew_d Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
speech_encoder_decoder Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
speech_to_text Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
speech_to_text_2 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
speecht5 Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
splinter Make tiny model creation + pipeline testing more robust (#22500) 2023-04-06 17:45:55 +02:00
squeezebert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
swiftformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
swin Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
swin2sr Skip test_multi_gpu_data_parallel_forward for some model tests (#21991) 2023-03-07 14:23:36 +01:00
swinv2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
switch_transformers [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
t5 [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
table_transformer Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
tapas Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
time_series_transformer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
timesformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
timm_backbone Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
transfo_xl Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
trocr Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
tvlt Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
umt5 [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
unispeech Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
unispeech_sat Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
upernet Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
videomae Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
vilt Removal of deprecated vision methods and specify deprecation versions (#24570) 2023-06-29 15:09:51 +01:00
vision_encoder_decoder Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vision_text_dual_encoder Fix VisionTextDualEncoderIntegrationTest (#24661) 2023-07-05 13:44:30 +02:00
visual_bert Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
vit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
vit_hybrid Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vit_mae Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
vit_msn Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vivit Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
wav2vec2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
wav2vec2_conformer Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
wav2vec2_phoneme Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
wav2vec2_with_lm Fix test_word_time_stamp_integration for Wav2Vec2ProcessorWithLMTest (#22800) 2023-04-17 12:41:55 +02:00
wavlm Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
whisper Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
x_clip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
xglm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xlm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xlm_prophetnet Update expected values in XLMProphetNetModelIntegrationTest (#21957) 2023-03-06 09:15:44 +01:00
xlm_roberta Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
xlm_roberta_xl Use real tokenizers if tiny version(s) creation has issue(s) (#22428) 2023-03-29 16:16:23 +02:00
xlnet Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xmod Use real tokenizers if tiny version(s) creation has issue(s) (#22428) 2023-03-29 16:16:23 +02:00
yolos Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
yoso 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
__init__.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00