transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 21:00:08 +06:00

History

jiqing-feng b916efcb3c Enables CPU AWQ model with IPEX version. (#33460 ) * enable cpu awq ipex linear * add doc for cpu awq with ipex kernel * add tests for cpu awq * fix code style * fix doc and tests * Update docs/source/en/quantization/awq.md Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update tests/quantization/autoawq/test_awq.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * fix comments * fix log * fix log * fix style --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>		2024-10-04 16:25:10 +02:00
..
aqlm.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
awq.md	Enables CPU AWQ model with IPEX version. (#33460 )	2024-10-04 16:25:10 +02:00
bitsandbytes.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
compressed_tensors.md	HFQuantizer implementation for compressed-tensors library (#31704 )	2024-09-25 14:31:38 +02:00
contribute.md	Docs / Quantization: refactor quantization documentation (#30942 )	2024-05-23 14:31:52 +02:00
eetq.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
fbgemm_fp8.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
gptq.md	Docs / Quantization: refactor quantization documentation (#30942 )	2024-05-23 14:31:52 +02:00
hqq.md	Hqq serialization (#33141 )	2024-09-30 14:47:18 +02:00
optimum.md	Docs / Quantization: refactor quantization documentation (#30942 )	2024-05-23 14:31:52 +02:00
overview.md	HFQuantizer implementation for compressed-tensors library (#31704 )	2024-09-25 14:31:38 +02:00
quanto.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
torchao.md	Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 (#33456 )	2024-09-30 11:30:29 +02:00