Stas Bekman
|
0e82f0cbc2
|
typo
|
2021-06-08 12:55:17 -07:00 |
|
Stas Bekman
|
32290d87f6
|
[Deepspeed] various fixes (#12058)
* replace deprecated config
* sub_group_size was too big
* complete deprecation removal
|
2021-06-08 08:36:15 -07:00 |
|
Stas Bekman
|
2c73b93099
|
[Deepspeed] Assert on mismatches between ds and hf args (#12021)
* wip
* add mismatch validation + test
* renames
* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* renames
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-06-04 08:58:23 -07:00 |
|
Stas Bekman
|
640318befa
|
[deepspeed] Move code and doc into standalone files (#11984)
* move code and docs
* style
* moved
* restore
|
2021-06-02 09:56:00 -07:00 |
|
Stas Bekman
|
7ec596ecda
|
[DeepSpeed] decouple DeepSpeedConfigHF from Trainer (#11966)
* decouple DeepSpeedConfigHF from Trainer
* add LoggingLevel ctx manager; add new test
* cleanup
* add docs
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* implemented suggested renames
* formatter workaround
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
|
2021-06-01 13:24:52 -07:00 |
|