transformers/__init__.py at 1dc619e59f4f1103a30a303404a2b0990d45f07c

mirror of https://github.com/huggingface/transformers.git synced 2025-07-05 13:50:13 +06:00

* Add TorchAOHfQuantizer

Summary:
Enable loading torchao quantized model in huggingface.

Test Plan:
local test

Reviewers:

Subscribers:

Tasks:

Tags:

* Fix a few issues

* style

* Added tests and addressed some comments about dtype conversion

* fix torch_dtype warning message

* fix tests

* style

* TorchAOConfig -> TorchAoConfig

* enable offload + fix memory with multi-gpu

* update torchao version requirement to 0.4.0

* better comments

* add torch.compile to torchao README, add perf number link

---------

Co-authored-by: Marc Sun <marc@huggingface.co>

2024-08-14 16:14:24 +02:00

0 lines Python Raw Blame History

0 lines

Python

Raw Blame History