Leandro von Werra
|
b18e31407c
|
add info on TRL docs (#27024)
* add info on TRL docs
* add TRL link
* tweak text
* tweak text
|
2023-10-24 14:56:00 +02:00 |
|
Arup De
|
738ecd17d8
|
Arde/fsdp activation checkpointing (#25771)
* add FSDP config option to enable activation-checkpointing
* update docs
* add checks and remove redundant code
* fix formatting error
|
2023-08-29 12:52:14 +05:30 |
|
mchau
|
6f041fcbb8
|
fix documentation for CustomTrainer (#25635)
fix doc
|
2023-08-21 17:23:17 +02:00 |
|
Sourab Mangrulkar
|
f4eb459ef2
|
fsdp fixes and enhancements (#24980)
* fix fsdp prepare to remove the warnings and fix excess memory usage
* Update training_args.py
* parity for FSDP+XLA
* Update trainer.py
|
2023-07-21 17:52:48 +05:30 |
|
statelesshz
|
8ba26c18cf
|
deprecate sharded_ddp training argument (#24825)
* deprecate fairscale's ShardedDDP
* fix code style
* roll back
* deprecate the `sharded_ddp` training argument
---------
Co-authored-by: jihuazhong <jihuazhong1@huawei.com>
|
2023-07-17 06:57:42 -04:00 |
|
Sylvain Gugger
|
eb849f6604
|
Migrate doc files to Markdown. (#24376)
* Rename index.mdx to index.md
* With saved modifs
* Address review comment
* Treat all files
* .mdx -> .md
* Remove special char
* Update utils/tests_fetcher.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
---------
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
|
2023-06-20 18:07:47 -04:00 |
|