transformers/examples/tests/deepspeed
Stas Bekman 3437d12134
[Trainer/Deepspeed] handle get_last_lr() before first step() (#10362)
* handle get_last_lr() before first step()

* abstract away the lr getting logic

* cleanup

* add test

* move to utils
2021-02-23 17:42:25 -08:00
..
ds_config.json [Trainer] implement gradient_accumulation_steps support in DeepSpeed integration (#10310) 2021-02-22 11:15:59 -08:00
test_deepspeed.py [Trainer/Deepspeed] handle get_last_lr() before first step() (#10362) 2021-02-23 17:42:25 -08:00