mirror of
https://github.com/huggingface/transformers.git
synced 2025-08-02 19:21:31 +06:00
![]() This commit adds gate jitter to MixtralSparseMoeBlock's input data before passing it through the MoE layer, if turned on. |
||
---|---|---|
.. | ||
__init__.py | ||
test_modeling_mixtral.py |