transformers/examples/tests
Sylvain Gugger 9d14be5c20
Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354)
* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale

* Quality

* Rework from review comments

* Add doc

* Apply suggestions from code review

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
2021-02-25 11:07:53 -05:00
..
deepspeed [Trainer/Deepspeed] handle get_last_lr() before first step() (#10362) 2021-02-23 17:42:25 -08:00
trainer Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354) 2021-02-25 11:07:53 -05:00