transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

Author	SHA1	Message	Date
Yih-Dar	6e3063422c	Uninstall `kernels` for AMD docker images (#38354 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details Uninstall kernels for AMD docker images Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-25 19:42:25 +02:00
Ákos Hadnagy	9895819514	Update CI Docker base image for AMD tests (#38261 ) use newer Pytorch base image for AMD CI tests	2025-05-22 16:38:40 +02:00
ivarflakstad	f4684a6eb2	Update amd pytorch index to match base image (#36347 ) pip pytorch index should match docker base image	2025-02-24 16:17:20 +01:00
ivarflakstad	847854b023	Add git LFS to AMD docker image (#36016 ) Add git lfs to AMD docker image	2025-02-12 22:27:21 +01:00
ivarflakstad	96625d85fd	Use rocm6.2 for AMD images (#35930 ) * Use rocm6.2 as rocm6.3 only has nightly pytorch wheels atm * Use stable wheel index for torch libs	2025-01-28 11:10:28 +01:00
ivarflakstad	8e4cedd9ca	Update AMD Docker image (#35804 )	2025-01-21 12:11:23 +01:00
ivarflakstad	bc6ae0d55e	Update AMD docker image (rocm 6.1) (#35259 ) * Use rocm 6.3 as base amd image and add nvidia-ml-py to exclude list * Align rocm base image with torch wheels @6.1. Seems like the most stable combo	2024-12-13 15:41:03 +01:00
fxmarty	37bba2a32d	CI: update to ROCm 6.0.2 and test MI300 (#30266 ) * update to ROCm 6.0.2 and test MI300 * add callers for mi300 * update dockerfile * fix trainer tests * remove apex * style * Update tests/trainer/test_trainer_seq2seq.py * Update tests/trainer/test_trainer_seq2seq.py * Update tests/trainer/test_trainer_seq2seq.py * Update tests/trainer/test_trainer_seq2seq.py * update to torch 2.3 * add workflow dispatch target * we may need branches: mi300-ci after all * nit * fix docker build * nit * add check runner * remove docker-gpu * fix issues * fix --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-05-13 18:14:36 +02:00
Ilyas Moutawwakil	07d79520ef	Disable AMD memory benchmarks (#29871 ) * remove py3nvml to skip amd memory benchmarks * uninstall pynvml from docker images	2024-03-26 14:43:12 +01:00
Ella Charlaix	39acfe84ba	Add deepspeed test to amd scheduled CI (#27633 ) * add deepspeed scheduled test for amd * fix image * add dockerfile * add comment * enable tests * trigger * remove trigger for this branch * trigger * change runner env to trigger the docker build image test * use new docker image * remove test suffix from docker image tag * replace test docker image with original image * push new image * Trigger * add back amd tests * fix typo * add amd tests back * fix * comment until docker image build scheduled test fix * remove deprecated deepspeed build option * upgrade torch * update docker & make tests pass * Update docker/transformers-pytorch-deepspeed-amd-gpu/Dockerfile * fix * tmp disable test * precompile deepspeed to avoid timeout during tests * fix comment * trigger deepspeed tests with new image * comment tests * trigger * add sklearn dependency to fix slow tests * enable back other tests * final update --------- Co-authored-by: Felix Marty <felix@hf.co> Co-authored-by: Félix Marty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-12-11 16:33:36 +01:00
fxmarty	f93c1e9ece	Add RoCm scheduled CI & upgrade RoCm CI to PyTorch 2.1 (#26940 ) * add scheduled ci on amdgpu * fix likely typo * more tests, avoid parallelism * precise comment * fix report channel * trigger docker build on this branch * fix * fix * run rocm scheduled ci * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-11-21 14:55:13 +01:00
Funtowicz Morgan	2d71307dc0	Integrate AMD GPU in CI/CD environment (#26007 ) * Add a Dockerfile for PyTorch + ROCm based on official AMD released artifact * Add a new artifact single-amdgpu testing on main * Attempt to test the workflow without merging. * Changed BERT to check if things are triggered * Meet the dependencies graph on workflow * Revert BERT changes * Add check_runners_amdgpu to correctly mount and check availability * Rename setup to setup_gpu for CUDA and add setup_amdgpu for AMD * Fix all the needs.setup -> needs.setup_[gpu\|amdgpu] dependencies * Fix setup dependency graph to use check_runner_amdgpu * Let's do the runner status check only on AMDGPU target * Update the Dockerfile.amd to put ourselves in / rather than /var/lib * Restore the whole setup for CUDA too. * Let's redisable them * Change BERT to trigger tests * Restore BERT * Add torchaudio with rocm 5.6 to AMD Dockerfile (#26050) fix dockerfile Co-authored-by: Felix Marty <felix@hf.co> * Place AMD GPU tests in a separate workflow (correct branch) (#26105) AMDGPU CI lives in an other workflow * Fix invalid job name is dependencies. * Remove tests multi-amdgpu for now. * Use single-amdgpu * Use --net=host for now. * Remote host networking. * Removed duplicated check_runners_amdgpu step * Let's tag machine-types with mi210 for now. * Machine type should be only mi210 * Remove unnecessary push.branches item * Apply review suggestions moving from `x-amdgpu` to `x-gpu` introducing `amd-gpu` and `miXXX` labels. * Remove amdgpu from step names. * finalize * delete --------- Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Felix Marty <felix@hf.co> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-09-20 14:48:49 +02:00

12 Commits