transformers/docs/source/en/quantization
Wenhua Cheng b3492ff9f7
Add AutoRound quantization support (#37393)
* add auto-round support

* Update src/transformers/quantizers/auto.py

Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

* fix style issue

Signed-off-by: wenhuach <wenhuach87@gmail.com>

* tiny change

* tiny change

* refine ut and doc

* revert unnecessary change

* tiny change

* try to fix style issue

* try to fix style issue

* try to fix style issue

* try to fix style issue

* try to fix style issue

* try to fix style issue

* try to fix style issue

* fix doc issue

* Update tests/quantization/autoround/test_auto_round.py

* fix comments

* Update tests/quantization/autoround/test_auto_round.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update tests/quantization/autoround/test_auto_round.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* update doc

* Update src/transformers/quantizers/quantizer_auto_round.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* update

* update

* fix

* try to fix style issue

* Update src/transformers/quantizers/auto.py

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* Update docs/source/en/quantization/auto_round.md

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* Update docs/source/en/quantization/auto_round.md

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* Update docs/source/en/quantization/auto_round.md

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* update

* fix style issue

* update doc

* update doc

* Refine the doc

* refine doc

* revert one change

* set sym to True by default

* Enhance the unit test's robustness.

* update

* add torch dtype

* tiny change

* add awq convert test

* fix typo

* update

* fix packing format issue

* use one gpu

---------

Signed-off-by: wenhuach <wenhuach87@gmail.com>
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
Co-authored-by: Shen, Haihao <haihao.shen@intel.com>
2025-04-22 13:56:54 +02:00
..
aqlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
auto_round.md Add AutoRound quantization support (#37393) 2025-04-22 13:56:54 +02:00
awq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitsandbytes.md Add Space to Bitsandbytes doc (#36834) 2025-03-19 18:56:07 +01:00
compressed_tensors.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
concept_guide.md Update quantization docs (#37439) 2025-04-16 15:44:53 +02:00
contribute.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
eetq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fbgemm_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
finegrained_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
higgs.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hqq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
optimum.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
overview.md Add AutoRound quantization support (#37393) 2025-04-22 13:56:54 +02:00
quanto.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
quark.md Support loading Quark quantized models in Transformers (#36372) 2025-03-20 15:40:51 +01:00
selecting.md Update quantization docs (#37439) 2025-04-16 15:44:53 +02:00
spqr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
torchao.md [qwen-omni] fix training (#37517) 2025-04-22 12:36:07 +02:00
vptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00