transformers/docs/source
Amit Garg e3775539c8
PhiMoE (#33363)
* onboard phimoe model

* removed debug code

* added unit tests

* updated docs

* formatted

* fixed unit tests

* fixed test case

* fixed format

* refactored code

* fixed expected outputs in the integration tests

* Added a warning msg

* Addressed comments

* Addressed comments

* fixed test cases

* added paper link

* Addressed comments

* Refactored PhimoeForCausalLM forward fn

* Refactored PhimoeRotaryEmbedding class

* fixed test cases

* fixed testcase

* fixed test case

* Addressed comments

* fixed test cases

* fixed testcases

* Used cache position instead to get the seq len
2024-10-04 21:39:45 +02:00
..
ar [i18n-ar] Add File : docs/source/ar/_toctree.yml (#32696) 2024-09-16 10:02:03 -07:00
de Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
en PhiMoE (#33363) 2024-10-04 21:39:45 +02:00
es Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
fr Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
hi More fixes for doctest (#30265) 2024-04-16 11:58:55 +02:00
it Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
ja Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
ko Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
ms Remove old TF port docs (#30426) 2024-04-23 16:06:20 +01:00
pt Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
te Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh Trainer - deprecate tokenizer for processing_class (#32385) 2024-10-02 14:08:46 +01:00
_config.py [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00