transformers/docs/source
Bo Zheng 6acd5aecb3
Adding Qwen3 and Qwen3MoE (#36878)
* Initial commit for Qwen3

* fix and add tests for qwen3 & qwen3_moe

* rename models for tests.

* fix

* fix

* fix and add docs.

* fix model name in docs.

* simplify modular and fix configuration issues

* Fix the red CI: ruff was updated

* revert ruff, version was wrong

* fix qwen3moe.

* fix

* make sure MOE can load

* fix copies

---------

Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
2025-03-31 09:50:49 +02:00
..
ar Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
de Fix typos (#36910) 2025-03-24 14:08:29 +00:00
en Adding Qwen3 and Qwen3MoE (#36878) 2025-03-31 09:50:49 +02:00
es Fix typos (#36910) 2025-03-24 14:08:29 +00:00
fr Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
hi [i18n-HI] Translated TFLite page to Hindi (#34572) 2024-11-04 09:40:30 -08:00
it Fix typos (#36910) 2025-03-24 14:08:29 +00:00
ja Just import torch AdamW instead (#36177) 2025-03-19 18:29:40 +00:00
ko 🌐 [i18n-KO] Translated qwen2_vl.md to Korean (#36750) 2025-03-30 15:00:27 -07:00
ms Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
pt Fix typos (#36910) 2025-03-24 14:08:29 +00:00
te Fix typos in translated quicktour docs (#35302) 2024-12-17 09:32:00 -08:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh Just import torch AdamW instead (#36177) 2025-03-19 18:29:40 +00:00
_config.py [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00