transformers/__init__.py at 19b9d8ae13c99554e5c857c8534f87b7951bf821

mirror of https://github.com/huggingface/transformers.git synced 2025-07-13 17:48:22 +06:00

Improve model loading for compressed tensor models (#36152 )

* Disable warnings for stacked compressors
* Introduce two new hooks in HfQuantizer lifecycle
to allow updates to missing and unexpected keys
* Update missing and unexpected keys
for stacked compressors
* Add tests
* Fix: run_compressed cases
* Fix: uncompressed cases

* Rename compressed_tensor folder to compressed_tensors
Move RunCompressedTest to the same file
Update tests to unittest

2025-02-24 13:47:21 +01:00

0 lines Python Raw Blame History

0 lines

Python

Raw Blame History