transformers/tests/quantization
2024-04-05 13:11:28 +02:00
..
aqlm_integration Cleaner Cache dtype and device extraction for CUDA graph generation for quantizers compatibility (#29079) 2024-02-27 09:32:39 +01:00
autoawq Exllama kernels support for AWQ models (#28634) 2024-03-05 03:22:48 +01:00
bnb [bnb] Fix offload test (#30039) 2024-04-05 13:11:28 +02:00
gptq [GPTQ] Fix test (#28018) 2024-01-15 11:22:54 -05:00
quanto_integration [Quantization] Quanto quantizer (#29023) 2024-03-15 11:51:29 -04:00