transformers/tests/deepspeed
Ilyas Moutawwakil 89f6956015
HPU support (#36424)
* test

* fix

* fix

* skip some and run some first

* test fsdp

* fix

* patches for generate

* test distributed

* copy

* don't test distributed loss for hpu

* require fp16 and run first

* changes from marc's PR fixing zero3

* better alternative

* return True when fp16 support on gaudi without creating bridge

* fix

* fix tested dtype in deepspeed inference test

* test

* fix

* test

* fix

* skip

* require fp16

* run first fsdp

* Apply suggestions from code review

* address comments

* address comments and refactor test

* reduce precison

* avoid doing gaudi1 specific stuff in the genreation loop

* document test_gradient_accumulation_loss_alignment_with_model_loss test a bit more
2025-03-12 09:08:12 +01:00
..
ds_config_zero2.json [Deepspeed] add support for bf16 mode (#14569) 2022-03-11 17:53:53 -08:00
ds_config_zero3.json Update ds_config_zero3.json (#30829) 2024-05-15 10:02:31 -04:00
test_deepspeed.py HPU support (#36424) 2025-03-12 09:08:12 +01:00
test_model_zoo.py Pass datasets trust_remote_code (#31406) 2024-06-17 17:29:13 +01:00
vit_feature_extractor.json missing file (#17164) 2022-05-10 10:19:50 -07:00