transformers/docs/source/en/model_doc/qwen3_moe.md
Bo Zheng 6acd5aecb3
Adding Qwen3 and Qwen3MoE (#36878)
* Initial commit for Qwen3

* fix and add tests for qwen3 & qwen3_moe

* rename models for tests.

* fix

* fix

* fix and add docs.

* fix model name in docs.

* simplify modular and fix configuration issues

* Fix the red CI: ruff was updated

* revert ruff, version was wrong

* fix qwen3moe.

* fix

* make sure MOE can load

* fix copies

---------

Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
2025-03-31 09:50:49 +02:00

1.4 KiB

Qwen3MoE

Overview

To be released with the official model launch.

Model Details

To be released with the official model launch.

Usage tips

To be released with the official model launch.

Qwen3MoeConfig

autodoc Qwen3MoeConfig

Qwen3MoeModel

autodoc Qwen3MoeModel - forward

Qwen3MoeForCausalLM

autodoc Qwen3MoeForCausalLM - forward

Qwen3MoeForSequenceClassification

autodoc Qwen3MoeForSequenceClassification - forward

Qwen3MoeForTokenClassification

autodoc Qwen3MoeForTokenClassification - forward

Qwen3MoeForQuestionAnswering

autodoc Qwen3MoeForQuestionAnswering - forward