transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 17:22:25 +06:00

History

Jerry Zhang 78d78cdf8a Add TorchAOHfQuantizer (#32306 ) * Add TorchAOHfQuantizer Summary: Enable loading torchao quantized model in huggingface. Test Plan: local test Reviewers: Subscribers: Tasks: Tags: * Fix a few issues * style * Added tests and addressed some comments about dtype conversion * fix torch_dtype warning message * fix tests * style * TorchAOConfig -> TorchAoConfig * enable offload + fix memory with multi-gpu * update torchao version requirement to 0.4.0 * better comments * add torch.compile to torchao README, add perf number link --------- Co-authored-by: Marc Sun <marc@huggingface.co>		2024-08-14 16:14:24 +02:00
..
de	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00
en	Add TorchAOHfQuantizer (#32306 )	2024-08-14 16:14:24 +02:00
es	🚨 No more default chat templates (#31733 )	2024-07-24 17:36:32 +01:00
fr	Add French version of run scripts tutorial (#31483 )	2024-06-28 18:02:30 +02:00
hi	More fixes for doctest (#30265 )	2024-04-16 11:58:55 +02:00
it	Docs / Quantization: Replace all occurences of `load_in_8bit` with bnb config (#31136 )	2024-05-30 16:47:35 +02:00
ja	Cleanup tool calling documentation and rename doc (#32337 )	2024-08-12 16:20:14 +01:00
ko	🌐 [i18n-KO] Translated `awq.md`to Korean (#32324 )	2024-08-12 10:12:48 -07:00
ms	Remove old TF port docs (#30426 )	2024-04-23 16:06:20 +01:00
pt	Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016 )	2024-05-29 11:55:43 +01:00
te	docs: fix broken link (#31370 )	2024-06-12 11:33:00 +01:00
tr	Translate index.md to Turkish (#27093 )	2023-11-08 08:35:20 -05:00
zh	Fix issue #32518 : Update llm_tutorial.md (#32523 )	2024-08-08 10:54:02 +01:00
_config.py	[#29174 ] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888 )	2024-04-08 14:21:16 +01:00