transformers/docs/source
Sourab Mangrulkar a761d6e9a0
Refactoring Trainer, adds save_only_model arg and simplifying FSDP integration (#27652)
* add code changes

1. Refactor FSDP
2. Add `--save_only_model` option: When checkpointing, whether to only save the model, or also the optimizer, scheduler & rng state.
3. Bump up the minimum `accelerate` version to `0.21.0`

* quality

* fix quality?

* Revert "fix quality?"

This reverts commit 149330a6ab.

* fix fsdp doc strings

* fix quality

* Update src/transformers/training_args.py

Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* please fix the quality issue 😅

* Apply suggestions from code review

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* address comment

* simplify conditional check as per the comment

* update documentation

---------

Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
2023-11-24 11:40:52 +05:30
..
de Broken links fixed related to datasets docs (#27569) 2023-11-17 13:44:09 -08:00
en Refactoring Trainer, adds save_only_model arg and simplifying FSDP integration (#27652) 2023-11-24 11:40:52 +05:30
es Broken links fixed related to datasets docs (#27569) 2023-11-17 13:44:09 -08:00
fr Fix typos (#25936) 2023-09-04 11:15:12 +01:00
hi Hindi translation of pipeline_tutorial.md (#26837) 2023-10-25 11:21:49 -07:00
it Broken links fixed related to datasets docs (#27569) 2023-11-17 13:44:09 -08:00
ja docs: fix 404 link (#27529) 2023-11-20 12:24:38 +00:00
ko Update Korean tutorial for using LLMs, and refactor the nested conditional statements in hr_argparser.py (#27489) 2023-11-20 17:14:23 +00:00
ms TVP model (#25856) 2023-11-21 16:41:55 +00:00
pt Broken links fixed related to datasets docs (#27569) 2023-11-17 13:44:09 -08:00
te Added Telugu [te] translations (#26828) 2023-10-20 15:27:55 -07:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh translate deepspeed.md to chinese (#27495) 2023-11-17 13:49:31 -08:00
_config.py [Styling] stylify using ruff (#27144) 2023-11-16 17:43:19 +01:00