Yih-Dar
|
6e3063422c
|
Uninstall kernels for AMD docker images (#38354)
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
Uninstall kernels for AMD docker images
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
|
2025-05-25 19:42:25 +02:00 |
|
ivarflakstad
|
936aeb70ab
|
AMD DeepSpeed image additional HIP dependencies (#36195)
* Add hipsolver and hipblastlt as dependencies
* Upgrade torch libs with rocm6.2.4 index
|
2025-02-17 11:50:49 +01:00 |
|
ivarflakstad
|
96625d85fd
|
Use rocm6.2 for AMD images (#35930)
* Use rocm6.2 as rocm6.3 only has nightly pytorch wheels atm
* Use stable wheel index for torch libs
|
2025-01-28 11:10:28 +01:00 |
|
ivarflakstad
|
a50befa9b9
|
Update deepspeed amd image (#35906)
|
2025-01-27 14:32:36 +01:00 |
|
Sai-Suraj-27
|
3562772969
|
fix: Fixed pydantic required version in dockerfiles to make it compatible with DeepSpeed (#33105)
Fixed pydantic required version in dockerfiles.
|
2024-08-26 17:10:36 +02:00 |
|
Ilyas Moutawwakil
|
07d79520ef
|
Disable AMD memory benchmarks (#29871)
* remove py3nvml to skip amd memory benchmarks
* uninstall pynvml from docker images
|
2024-03-26 14:43:12 +01:00 |
|
Ella Charlaix
|
39acfe84ba
|
Add deepspeed test to amd scheduled CI (#27633)
* add deepspeed scheduled test for amd
* fix image
* add dockerfile
* add comment
* enable tests
* trigger
* remove trigger for this branch
* trigger
* change runner env to trigger the docker build image test
* use new docker image
* remove test suffix from docker image tag
* replace test docker image with original image
* push new image
* Trigger
* add back amd tests
* fix typo
* add amd tests back
* fix
* comment until docker image build scheduled test fix
* remove deprecated deepspeed build option
* upgrade torch
* update docker & make tests pass
* Update docker/transformers-pytorch-deepspeed-amd-gpu/Dockerfile
* fix
* tmp disable test
* precompile deepspeed to avoid timeout during tests
* fix comment
* trigger deepspeed tests with new image
* comment tests
* trigger
* add sklearn dependency to fix slow tests
* enable back other tests
* final update
---------
Co-authored-by: Felix Marty <felix@hf.co>
Co-authored-by: Félix Marty <9808326+fxmarty@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
|
2023-12-11 16:33:36 +01:00 |
|