transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 17:52:35 +06:00

History

Wenhua Cheng b3492ff9f7 Add AutoRound quantization support (#37393 ) * add auto-round support * Update src/transformers/quantizers/auto.py Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> * fix style issue Signed-off-by: wenhuach <wenhuach87@gmail.com> * tiny change * tiny change * refine ut and doc * revert unnecessary change * tiny change * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * fix doc issue * Update tests/quantization/autoround/test_auto_round.py * fix comments * Update tests/quantization/autoround/test_auto_round.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update tests/quantization/autoround/test_auto_round.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * update doc * Update src/transformers/quantizers/quantizer_auto_round.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * update * update * fix * try to fix style issue * Update src/transformers/quantizers/auto.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update docs/source/en/quantization/auto_round.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update docs/source/en/quantization/auto_round.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update docs/source/en/quantization/auto_round.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * update * fix style issue * update doc * update doc * Refine the doc * refine doc * revert one change * set sym to True by default * Enhance the unit test's robustness. * update * add torch dtype * tiny change * add awq convert test * fix typo * update * fix packing format issue * use one gpu --------- Signed-off-by: wenhuach <wenhuach87@gmail.com> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by: Shen, Haihao <haihao.shen@intel.com>		2025-04-22 13:56:54 +02:00
..
aqlm_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
autoawq	enable several cases on XPU (#37516 )	2025-04-16 11:01:04 +02:00
autoround	Add AutoRound quantization support (#37393 )	2025-04-22 13:56:54 +02:00
bitnet_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
bnb	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
compressed_tensors_integration	Fix: Unexpected Keys, Improve `run_compressed`, Rename Test Folder (#37077 )	2025-04-04 21:30:11 +02:00
eetq_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
fbgemm_fp8	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
finegrained_fp8	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
ggml	Support loading Gemma3 QAT GGUF models (#37649 )	2025-04-22 11:23:17 +02:00
gptq	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
higgs	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
hqq	Fixes hqq by following a new path for bias parameter in pre_quantized models (#37530 )	2025-04-16 13:58:14 +02:00
quanto_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
quark_integration	Fix Quark quantization config (#37578 )	2025-04-18 07:23:39 +02:00
spqr_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
torchao_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
vptq_integration	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00