transformers/docs/source
Niklas Muennighoff ecd61c6286
Add OLMoE (#32406)
* Add OLMoE

* Add OLMoE

* Updates

* Make norm optional; add keys

* Add output

* Add

* Fix dtype

* Fix eos config

* Update

* Add OLMoE

* Fix OLMoE path

* Format

* Format

* Rmv copy statement

* Rmv copy statement

* Format

* Add copies

* Cp rotary

* Fix aming

* Fix naming

* Update RoPE integration; num_logits_to_keep; Add copy statements

* Add eps to config

* Format

* Add aux loss

* Adapt router_aux_loss_coef

* Update md

* Adapt

* adapt tests
2024-09-03 18:43:12 +02:00
..
de Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
en Add OLMoE (#32406) 2024-09-03 18:43:12 +02:00
es Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
fr Add French version of run scripts tutorial (#31483) 2024-06-28 18:02:30 +02:00
hi More fixes for doctest (#30265) 2024-04-16 11:58:55 +02:00
it Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
ja Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
ko 🌐 [i18n-KO] Translated llm_optims.md to Korean (#32325) 2024-08-30 09:52:41 -07:00
ms Remove old TF port docs (#30426) 2024-04-23 16:06:20 +01:00
pt Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
te docs: fix broken link (#31370) 2024-06-12 11:33:00 +01:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh Updated the custom_models.md changed cross_entropy code (#33118) 2024-08-26 13:15:43 +02:00
_config.py [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00