transformers/docs/source/en/quantization
fxmarty-amd 1a374799ce
Support loading Quark quantized models in Transformers (#36372)
* add quark quantizer

* add quark doc

* clean up doc

* fix tests

* make style

* more style fixes

* cleanup imports

* cleaning

* precise install

* Update docs/source/en/quantization/quark.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update tests/quantization/quark_integration/test_quark.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* remove import guard as suggested

* update copyright headers

* add quark to transformers-quantization-latest-gpu Dockerfile

* make tests pass on transformers main + quark==0.7

* add missing F8_E4M3 and F8_E5M2 keys from str_to_torch_dtype

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Bowen Bao <bowenbao@amd.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-03-20 15:40:51 +01:00
..
aqlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
awq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitsandbytes.md Add Space to Bitsandbytes doc (#36834) 2025-03-19 18:56:07 +01:00
compressed_tensors.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
contribute.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
eetq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fbgemm_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
finegrained_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
higgs.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hqq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
optimum.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
overview.md Support loading Quark quantized models in Transformers (#36372) 2025-03-20 15:40:51 +01:00
quanto.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
quark.md Support loading Quark quantized models in Transformers (#36372) 2025-03-20 15:40:51 +01:00
spqr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
torchao.md Disable inductor config setter by default (#36608) 2025-03-20 11:23:14 +01:00
vptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00