transformers/.github/ISSUE_TEMPLATE
Sourab Mangrulkar a73b1d59a3
accelerate deepspeed and gradient accumulation integrate (#23236)
* mixed precision support via accelerate

* fix issues

* fix for the sharded ddp case

* fix flax and tf failing tests

* `refactor the place to create `Accelerator` object

* move ddp prep to accelerate

* fix 😅

* resolving comments

* move fsdp handling to accelerate

* fixex

* fix saving

* shift torch dynamo handling to accelerate

* shift deepspeed integration and save & load utils to accelerate

* fix accelerate launcher support

* oops

* fix 🐛

* save ckpt fix

* Trigger CI

* nasty 🐛 😅

* as deepspeed needs grad_acc fixes, transfer grad_acc to accelerate

* make tests happy

* quality 

* loss tracked needs to account for grad_acc

* fixing the deepspeed tests

* quality 

* 😅😅😅

* tests 😡

* quality 

* Trigger CI

* resolve comments and fix the issue with the previous merge from branch

* Trigger CI

* accelerate took over deepspeed integration

---------

Co-authored-by: Stas Bekman <stas@stason.org>
2023-05-31 15:16:22 +05:30
..
bug-report.yml accelerate deepspeed and gradient accumulation integrate (#23236) 2023-05-31 15:16:22 +05:30
config.yml Add HF.co for PRs / Issues regarding specific model checkpoints (#17485) 2022-05-31 15:58:39 +02:00
feature-request.yml Refactor issues with yaml (#16772) 2022-04-18 16:43:21 -04:00
i18n.md feat: add i18n issue template (#20199) 2022-11-14 12:36:58 -05:00
migration.yml Refactor issues with yaml (#16772) 2022-04-18 16:43:21 -04:00
new-model-addition.yml Refactor issues with yaml (#16772) 2022-04-18 16:43:21 -04:00