transformers/docs/source
Bo Zheng 1c39974a4c
Add Qwen2MoE (#29377)
* add support for qwen2 MoE models

* update docs

* add support for qwen2 MoE models

* update docs

* update model name & test

* update readme

* update class names & readme & model_doc of Qwen2MoE.

* update architecture name

* fix qwen2_moe tests

* use Qwen2Tokenizer instead of Qwen2MoeTokenizer

* update modeling_qwen2_moe.py

* fix model architecture

* fix qwen2_moe tests

* use Qwen2Tokenizer instead of Qwen2MoeTokenizer

* update modeling_qwen2_moe.py

* fix model architecture

* fix style

* fix test when there are sparse and non sparse layers

* fixup

* Update README.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* fixup

* add archive back

* add support for qwen2 MoE models

* update docs

* update model name & test

* update readme

* update class names & readme & model_doc of Qwen2MoE.

* update architecture name

* fix qwen2_moe tests

* use Qwen2Tokenizer instead of Qwen2MoeTokenizer

* update modeling_qwen2_moe.py

* fix model architecture

* fixup

* fix qwen2_moe tests

* use Qwen2Tokenizer instead of Qwen2MoeTokenizer

* fix style

* fix test when there are sparse and non sparse layers

* fixup

* add archive back

* fix integration test

* fixup

---------

Co-authored-by: bozheng-hit <dsoul0621@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-03-27 02:11:55 +01:00
..
de Make torch xla available on GPU (#29334) 2024-03-11 14:07:16 +00:00
en Add Qwen2MoE (#29377) 2024-03-27 02:11:55 +01:00
es [docs] Spanish translation of attention.md (#29681) 2024-03-15 11:55:35 -07:00
fr Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
hi Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
it Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
ja [docs] Remove broken ChatML format link from chat_templating.md (#29643) 2024-03-13 13:04:51 -07:00
ko Make torch xla available on GPU (#29334) 2024-03-11 14:07:16 +00:00
ms [Docs] Add missing language options and fix broken links (#28852) 2024-02-06 12:01:01 -08:00
pt Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
te Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh [docs] Remove broken ChatML format link from chat_templating.md (#29643) 2024-03-13 13:04:51 -07:00
_config.py [Styling] stylify using ruff (#27144) 2023-11-16 17:43:19 +01:00