transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

History

fxmarty-amd 1a374799ce Support loading Quark quantized models in Transformers (#36372 ) * add quark quantizer * add quark doc * clean up doc * fix tests * make style * more style fixes * cleanup imports * cleaning * precise install * Update docs/source/en/quantization/quark.md Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update tests/quantization/quark_integration/test_quark.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/utils/quantization_config.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * remove import guard as suggested * update copyright headers * add quark to transformers-quantization-latest-gpu Dockerfile * make tests pass on transformers main + quark==0.7 * add missing F8_E4M3 and F8_E5M2 keys from str_to_torch_dtype --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Bowen Bao <bowenbao@amd.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>		2025-03-20 15:40:51 +01:00
..
aqlm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
awq.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bitnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bitsandbytes.md	Add Space to Bitsandbytes doc (#36834 )	2025-03-19 18:56:07 +01:00
compressed_tensors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
contribute.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
eetq.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fbgemm_fp8.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
finegrained_fp8.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gptq.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
higgs.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
hqq.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
optimum.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
overview.md	Support loading Quark quantized models in Transformers (#36372 )	2025-03-20 15:40:51 +01:00
quanto.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
quark.md	Support loading Quark quantized models in Transformers (#36372 )	2025-03-20 15:40:51 +01:00
spqr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
torchao.md	Disable inductor config setter by default (#36608 )	2025-03-20 11:23:14 +01:00
vptq.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00