mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-03 12:50:06 +06:00

* Initial commit for Qwen3 * fix and add tests for qwen3 & qwen3_moe * rename models for tests. * fix * fix * fix and add docs. * fix model name in docs. * simplify modular and fix configuration issues * Fix the red CI: ruff was updated * revert ruff, version was wrong * fix qwen3moe. * fix * make sure MOE can load * fix copies --------- Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
1.4 KiB
1.4 KiB
Qwen3MoE
Overview
To be released with the official model launch.
Model Details
To be released with the official model launch.
Usage tips
To be released with the official model launch.
Qwen3MoeConfig
autodoc Qwen3MoeConfig
Qwen3MoeModel
autodoc Qwen3MoeModel - forward
Qwen3MoeForCausalLM
autodoc Qwen3MoeForCausalLM - forward
Qwen3MoeForSequenceClassification
autodoc Qwen3MoeForSequenceClassification - forward
Qwen3MoeForTokenClassification
autodoc Qwen3MoeForTokenClassification - forward
Qwen3MoeForQuestionAnswering
autodoc Qwen3MoeForQuestionAnswering - forward