transformers/tests/models
Sanchit Gandhi e93103632b
Add bloom flax (#25094)
* First commit

* step 1 working

* add alibi

* placeholder for `scan`

* add matrix mult alibi

* beta scaling factor for bmm

* working v1 - simple forward pass

* move layer_number from attribute to arg in call

* partial functioning scan

* hacky working scan

* add more modifs

* add test

* update scan for new kwarg order

* fix position_ids problem

* fix bug in attention layer

* small fix

- do the alibi broadcasting only once

* prelim refactor

* finish refactor

* alibi shifting

* incorporate dropout_add to attention module

* make style

* make padding work again

* update

* remove bogus file

* up

* get generation to work

* clean code a bit

* added small tests

* adding albii test

* make CI tests pass:

- change init weight
- add correct tuple for output attention
- add scan test
- make CI tests work

* fix few nits

* fix nit onnx

* fix onnx nit

* add missing dtype args to nn.Modules

* remove debugging statements

* fix scan generate

* Update modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* Update test_modeling_flax_bloom.py

* fix small test issue + make style

* clean up

* Update tests/models/bloom/test_modeling_flax_bloom.py

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* fix function name

* small fix test

* forward contrib credits from PR17761

* Fix failing test

* fix small typo documentation

* fix non passing test

- remove device from build alibi

* refactor call

- refactor `FlaxBloomBlockCollection` module

* make style

* upcast to fp32

* cleaner way to upcast

* remove unused args

* remove layer number

* fix scan test

* make style

* fix i4 casting

* fix slow test

* Update src/transformers/models/bloom/modeling_flax_bloom.py

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove `layer_past`

* refactor a bit

* fix `scan` slow test

* remove useless import

* major changes

- remove unused code
- refactor a bit
- revert import `torch`

* major refactoring

- change build alibi

* remove scan

* fix tests

* make style

* clean-up alibi

* add integration tests

* up

* fix batch norm conversion

* style

* style

* update pt-fx cross tests

* update copyright

* Update src/transformers/modeling_flax_pytorch_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* per-weight check

* style

* line formats

---------

Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: haileyschoelkopf <haileyschoelkopf@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-07-27 18:24:56 +01:00
..
albert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
align Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
altclip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
audio_spectrogram_transformer is_batched fix for remaining 2-D numpy arrays (#23309) 2023-05-23 14:37:35 -04:00
auto Remote code improvements (#23959) 2023-06-06 14:31:14 -04:00
autoformer Compute dropout_probability only in training mode (#24486) 2023-06-26 18:36:47 +02:00
bark Add offload support to Bark (#25037) 2023-07-27 15:35:17 +01:00
bart Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
barthez Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
bartpho Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
beit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
bert fix: Text splitting in the BasicTokenizer (#22280) 2023-07-11 11:07:58 -04:00
bert_generation 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
bert_japanese Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
bertweet Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
big_bird Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
bigbird_pegasus Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
biogpt Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
bit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
blenderbot Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blenderbot_small Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blip Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
blip_2 Add InstructBLIP (#23460) 2023-06-26 11:23:57 +02:00
bloom Add bloom flax (#25094) 2023-07-27 18:24:56 +01:00
bridgetower Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
byt5 Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
camembert Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
canine Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
chinese_clip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
clap Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
clip fix: Text splitting in the BasicTokenizer (#22280) 2023-07-11 11:07:58 -04:00
clipseg Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
codegen Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
conditional_detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
convbert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
convnext Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
convnextv2 Update ConvNextV2ModelIntegrationTest::test_inference_image_classification_head (#23402) 2023-05-16 23:35:11 +02:00
cpm Fix PipelineTests skip conditions (#22320) 2023-03-22 20:02:24 +01:00
cpmant Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
ctrl Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
cvt Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
data2vec Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deberta Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deberta_v2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
decision_transformer 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
deformable_detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
deit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
deta Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
detr Check models used for common tests are small (#24824) 2023-07-14 14:43:19 -04:00
dinat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
dinov2 Add DINOv2 (#24016) 2023-07-18 15:34:06 +01:00
distilbert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
dit Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
donut 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
dpr Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
dpt Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
efficientformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
efficientnet Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
electra Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
encodec Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
encoder_decoder Move TF building to an actual build() method (#23760) 2023-06-06 18:30:51 +01:00
ernie 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
ernie_m Automatically create/update tiny models (#22275) 2023-03-23 19:14:17 +01:00
esm Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
falcon Falcon port (#24523) 2023-07-11 13:36:31 +01:00
flaubert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
flava Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
fnet Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
focalnet Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
fsmt update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
funnel Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
git Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
glpn Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
gpt_bigcode Add torch >=1.12 requirement for Tapas (#24251) 2023-06-13 19:19:40 +02:00
gpt_neo Update tiny models and pipeline tests (#23446) 2023-05-18 17:29:04 +02:00
gpt_neox Llama/GPTNeoX: add RoPE scaling (#24653) 2023-07-13 16:47:30 +01:00
gpt_neox_japanese 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
gpt_sw3 Add gpt-sw3 model to transformers (#20209) 2022-12-12 13:12:13 -05:00
gpt2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
gptj Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
gptsan_japanese Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
graphormer Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
groupvit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
herbert Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
hubert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
ibert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
imagegpt Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
informer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
instructblip [InstructBLIP] Fix bos token of LLaMa checkpoints (#24492) 2023-07-11 20:43:01 +01:00
jukebox Set TF32 flag for PyTorch cuDNN backend (#25075) 2023-07-25 08:04:48 -04:00
layoutlm Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
layoutlmv2 Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
layoutlmv3 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
layoutxlm Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
led Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
levit Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
lilt Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
llama [LlamaConfig] Nit: pad token should be None by default (#24958) 2023-07-21 14:32:34 +02:00
longformer Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
longt5 update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
luke Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
lxmert Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
m2m_100 update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
marian Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
markuplm Update some MarkupLM tests' expected values (#22667) 2023-04-11 10:00:34 +02:00
mask2former Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
maskformer MaskFormer - enable return_dict in order to compile (#25052) 2023-07-26 16:23:30 +01:00
mbart Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mbart50 Replace as_target context managers by direct calls (#18325) 2022-07-29 08:09:09 -04:00
mega Fix MegaModel CI (#22652) 2023-04-07 17:13:04 +02:00
megatron_bert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
megatron_gpt2 Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
mgp_str Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mluke Black preview (#17217) 2022-05-12 16:25:55 -04:00
mobilebert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mobilenet_v1 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mobilenet_v2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
mobilevit Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
mobilevitv2 Make more test models smaller (#25005) 2023-07-24 10:08:47 -04:00
mpnet Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
mpt [MptConfig] support from pretrained args (#25116) 2023-07-27 16:24:52 +02:00
mra Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
mt5 Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
musicgen Skip torchscript tests for MusicgenForConditionalGeneration (#24782) 2023-07-13 15:54:18 +02:00
mvp Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
nat Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
nezha 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
nllb 🚨🚨🚨 [NLLB Tokenizer] Fix the prefix tokens 🚨🚨🚨 (#22313) 2023-04-04 14:53:06 +02:00
nllb_moe tests: Fix flaky test for NLLB-MoE (#22880) 2023-04-21 17:09:40 +01:00
nystromformer 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
oneformer Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
openai Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
opt Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
owlvit Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
pegasus Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
pegasus_x update_pip_test_mapping (#22606) 2023-04-06 17:56:06 +02:00
perceiver Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
phobert Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
pix2struct Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
plbart Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
poolformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
prophetnet Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
pvt Fix PvtModelIntegrationTest::test_inference_fp16 (#25106) 2023-07-26 14:57:44 +02:00
qdqbert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
rag Big TF test cleanup (#24282) 2023-06-16 15:40:49 +01:00
realm Unpin numba (#23162) 2023-05-31 14:59:30 +01:00
reformer Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
regnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
rembert Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
resnet Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
roberta Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
roberta_prelayernorm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
roc_bert Move is_pipeline_test_to_skip to specific model test classes (#21999) 2023-03-14 10:03:02 +01:00
roformer Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
rwkv Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
sam Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
segformer Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
sew Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
sew_d Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
speech_encoder_decoder Update quality tooling for formatting (#21480) 2023-02-06 18:10:56 -05:00
speech_to_text Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
speech_to_text_2 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
speecht5 Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
splinter Make tiny model creation + pipeline testing more robust (#22500) 2023-04-06 17:45:55 +02:00
squeezebert 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
swiftformer Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
swin Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
swin2sr Skip test_multi_gpu_data_parallel_forward for some model tests (#21991) 2023-03-07 14:23:36 +01:00
swinv2 Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
switch_transformers [Patch-t5-tokenizer] Patches the changes on T5 to make sure previous behaviour is still valide for beginning of words (#24622) 2023-07-11 15:02:18 +02:00
t5 [T5, MT5, UMT5] Add [T5, MT5, UMT5]ForSequenceClassification (#24726) 2023-07-25 21:02:49 +02:00
table_transformer Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
tapas Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
time_series_transformer Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
timesformer Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
timm_backbone Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
transfo_xl Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
trocr Generate: skip left-padding tests on old models (#23437) 2023-05-18 11:04:51 +01:00
tvlt Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
umt5 [T5, MT5, UMT5] Add [T5, MT5, UMT5]ForSequenceClassification (#24726) 2023-07-25 21:02:49 +02:00
unispeech Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
unispeech_sat Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
upernet Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
videomae Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
vilt Removal of deprecated vision methods and specify deprecation versions (#24570) 2023-06-29 15:09:51 +01:00
vision_encoder_decoder Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vision_text_dual_encoder Fix VisionTextDualEncoderIntegrationTest (#24661) 2023-07-05 13:44:30 +02:00
visual_bert Revert "Fix gradient checkpointing + fp16 autocast for most models" (#24420) 2023-06-22 16:11:27 +02:00
vit Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
vit_hybrid Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vit_mae Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
vit_msn Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
vivit Fix last models for common tests that are too big. (#25058) 2023-07-25 07:56:04 -04:00
wav2vec2 Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
wav2vec2_conformer Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
wav2vec2_phoneme Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
wav2vec2_with_lm Fix test_word_time_stamp_integration for Wav2Vec2ProcessorWithLMTest (#22800) 2023-04-17 12:41:55 +02:00
wavlm Fix TypeError: Object of type int64 is not JSON serializable (#24340) 2023-06-27 12:15:49 +01:00
whisper Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
x_clip Update some torchscript tests after #24505 (#24566) 2023-06-29 16:05:24 +02:00
xglm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xlm Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xlm_prophetnet Update expected values in XLMProphetNetModelIntegrationTest (#21957) 2023-03-06 09:15:44 +01:00
xlm_roberta Better TF docstring types (#23477) 2023-05-24 13:52:52 +01:00
xlm_roberta_xl Use real tokenizers if tiny version(s) creation has issue(s) (#22428) 2023-03-29 16:16:23 +02:00
xlnet Speed up TF tests by reducing hidden layer counts (#24595) 2023-06-30 16:30:33 +01:00
xmod Use real tokenizers if tiny version(s) creation has issue(s) (#22428) 2023-03-29 16:16:23 +02:00
yolos Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
yoso 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
__init__.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00