transformers/tests/models/mixtral
Lorenzo Verardo a25037beb9
MixtralSparseMoeBlock: add gate jitter (#29865)
This commit adds gate jitter to MixtralSparseMoeBlock's input data
before passing it through the MoE layer, if turned on.
2024-03-27 16:14:26 +01:00
..
__init__.py [Add Mixtral] Adds support for the Mixtral MoE (#27942) 2023-12-11 12:50:27 +01:00
test_modeling_mixtral.py MixtralSparseMoeBlock: add gate jitter (#29865) 2024-03-27 16:14:26 +01:00