transformers/tests/quantization
Andrei Panferov e3fc90ae68
Cleaner Cache dtype and device extraction for CUDA graph generation for quantizers compatibility (#29079)
* input_layernorm as the beacon of hope

* cleaner dtype extraction

* AQLM + CUDA graph test

* is available check

* shorter text test
2024-02-27 09:32:39 +01:00
..
aqlm_integration Cleaner Cache dtype and device extraction for CUDA graph generation for quantizers compatibility (#29079) 2024-02-27 09:32:39 +01:00
autoawq HfQuantizer class for quantization-related stuff in modeling_utils.py (#26610) 2024-01-30 02:48:25 +01:00
bnb FIX [bnb / tests] Propagate the changes from #29092 to 4-bit tests (#29122) 2024-02-20 11:11:15 +01:00
gptq [GPTQ] Fix test (#28018) 2024-01-15 11:22:54 -05:00