transformers/tests/pipelines
Younes Belkada 163ac3d3ee
Add Switch transformers (#19323)
* first commit

* add more comments

* add router v1

* clean up

- remove `tf` modeling files

* clean up

- remove `tf` modeling files

* clean up

* v0 routers

* added more router

- Implemented `ExpertsChooseMaskedRouter`

- added tests
- 2 more routers to implement

* last router

* improved docstring

- completed the docstring in `router.py`
- added more args in the config

* v0 sparse mlp

* replace wrong naming

* forward pass run

* update MOE layer

* small router update

* fixup

* consistency

* remove scatter router

* remove abstract layer

* update test and model for integration testing

* v1 conversion

* update

* hardcode hack

* all keys match

* add gin conversion, without additional libraries

* update conversion sctipy

* delete router file

* update tests wrt router deletion

* fix router issues

* update expert code

* update, logits match, code needsREFACTORING

* Refactor code

Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* add generate tests

Co-authored-by: younesbelkada <younesbelkada@gmail.com>

* add support for router loss

Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fix forward error

* refactor a bit

* remove `FlaxSwitchTransformers` modules

* more tests pass

* Update code

Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>

* fixup

* fix tests

* fix doc

* fix doc + tokenization

* fix tokenizer test

* fix test

* fix loss output

* update code for backward pass

* add loss support

* update documentation

* fix documentation, clean tokenizer

* more doc fix, cleanup example_switch

* fix failing test

* fix test

* fix test

* fix loss issue

* move layer

* update doc and fix router capacity usage

* fixup

* add sparse mlp index for documentation on hub

* fixup

* test sparse mix architecture

* Apply suggestions from code review

* Update docs/source/en/model_doc/switch_transformers.mdx

* fixup on update

* fix tests

* fix another test

* attempt fix

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/convert_switch_transformers_original_flax_checkpoint_to_pytorch.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* try

* all tests pass

* fix jitter noise

* Apply suggestions from code review

* doc tests pass

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/switch_transformers/modeling_switch_transformers.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove assert

* change config order

* fix readme japanese

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove parallelizable tests + add one liners

* remove ONNX config

* fix nits

- add `T5Tokenizer` in auto mapping
- remove `Switch Transformers` from ONNX supported models

* remove `_get_router`

* remove asserts

* add check in test for `router_dtype`

* add `SwitchTransformersConfig` in `run_pipeline_test`

* Update tests/pipelines/test_pipelines_summarization.py

* add huge model conversion script

* fix slow tests

- add better casting for `Linear8bitLt`
- remove `torchscript` tests

* add make dir

* style on new script

* fix nits

- doctest
- remove `_keys_to_ignore_on_load_unexpected`

* Update src/transformers/models/switch_transformers/configuration_switch_transformers.py

* add google as authors

* fix year

* remove last `assert` statements

* standardize vertical spaces

* fix failing import

* fix another failing test

* Remove strange àuthorized_keys`

* removing todo and padding that is never used

Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: ybelkada <younes@huggingface.co>
Co-authored-by: Younes Belkada <younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur Zucker <arthur@huggingface.co>
2022-11-15 13:06:45 +01:00
..
__init__.py [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
test_pipelines_audio_classification.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_automatic_speech_recognition.py Adding chunking for whisper (all seq2seq actually). Very crude matching algorithm. (#20104) 2022-11-14 22:32:50 +01:00
test_pipelines_common.py Fix tapas scatter (#20149) 2022-11-14 01:04:26 -05:00
test_pipelines_conversational.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_depth_estimation.py Add depth estimation pipeline (#18618) 2022-10-12 08:54:20 -04:00
test_pipelines_document_question_answering.py A few CI fixes for DocumentQuestionAnsweringPipeline (#19584) 2022-10-17 15:35:27 +02:00
test_pipelines_feature_extraction.py add return_tensors parameter for feature_extraction 2 (#19707) 2022-10-18 16:29:00 +02:00
test_pipelines_fill_mask.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_image_classification.py Specify TF framework explicitly in more pipeline tests (#19748) 2022-10-19 16:24:03 +02:00
test_pipelines_image_segmentation.py Make ImageSegmentationPipelineTests less flaky (#20147) 2022-11-15 09:14:55 +01:00
test_pipelines_image_to_text.py fix image2test args forwarding (#19648) 2022-10-24 09:49:24 -04:00
test_pipelines_object_detection.py Adding support for LayoutLMvX variants for object-detection. (#20143) 2022-11-10 11:33:38 +01:00
test_pipelines_question_answering.py Clean up deprecation warnings (#19654) 2022-10-18 13:34:47 -04:00
test_pipelines_summarization.py Add Switch transformers (#19323) 2022-11-15 13:06:45 +01:00
test_pipelines_table_question_answering.py Fix tapas scatter (#20149) 2022-11-14 01:04:26 -05:00
test_pipelines_text_classification.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_text_generation.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_text2text_generation.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_token_classification.py TF port of ESM (#19587) 2022-10-17 14:16:16 +01:00
test_pipelines_translation.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_visual_question_answering.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_zero_shot_image_classification.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00
test_pipelines_zero_shot_object_detection.py Remove ref to is_pipeline_test 2022-10-07 21:38:07 -04:00
test_pipelines_zero_shot.py Rework pipeline tests (#19366) 2022-10-07 18:01:58 -04:00