transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 21:30:07 +06:00

History

mobicham f5247aca01 Hqq serialization (#33141 ) * HQQ model serialization attempt * fix hqq dispatch and unexpected keys * style * remove check_old_param * revert to check HQQLinear in quantizer_hqq.py * revert to check HQQLinear in quantizer_hqq.py * update HqqConfig default params * make ci happy * make ci happy * revert to HQQLinear check in quantizer_hqq.py * check hqq_min version 0.2.0 * set axis=1 as default in quantization_config.py * validate_env with hqq>=0.2.0 version message * deprecated hqq kwargs message * make ci happy * remove run_expected_keys_check hack + bump to 0.2.1 min hqq version * fix unexpected_keys hqq update * add pre_quantized check * add update_expected_keys to base quantizerr * ci base.py fix? * ci base.py fix? * fix "quantization typo" src/transformers/utils/quantization_config.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix post merge --------- Co-authored-by: Marc Sun <marc@huggingface.co> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>		2024-09-30 14:47:18 +02:00
..
aqlm.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
awq.md	docs: fix broken link (#31370 )	2024-06-12 11:33:00 +01:00
bitsandbytes.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
compressed_tensors.md	HFQuantizer implementation for compressed-tensors library (#31704 )	2024-09-25 14:31:38 +02:00
contribute.md	Docs / Quantization: refactor quantization documentation (#30942 )	2024-05-23 14:31:52 +02:00
eetq.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
fbgemm_fp8.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
gptq.md	Docs / Quantization: refactor quantization documentation (#30942 )	2024-05-23 14:31:52 +02:00
hqq.md	Hqq serialization (#33141 )	2024-09-30 14:47:18 +02:00
optimum.md	Docs / Quantization: refactor quantization documentation (#30942 )	2024-05-23 14:31:52 +02:00
overview.md	HFQuantizer implementation for compressed-tensors library (#31704 )	2024-09-25 14:31:38 +02:00
quanto.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
torchao.md	Enable non-safetensor ser/deser for TorchAoConfig quantized model 🔴 (#33456 )	2024-09-30 11:30:29 +02:00