transformers/tests/quantization
Isotr0py e4628434d8
Add Qwen2 GGUF loading support (#31175)
* add qwen2 gguf support

* Update docs

* fix qwen2 tokenizer

* add qwen2 gguf test

* fix typo in qwen2 gguf test

* format code

* Remove mistral, clarify the error message

* format code

* add typing and update docstring
2024-06-03 14:55:10 +01:00
..
aqlm_integration Cache: Static cache as a standalone object (#30476) 2024-04-30 16:37:19 +01:00
autoawq FIX / TST: Fix expected results on Mistral AWQ test (#30971) 2024-05-24 14:06:31 +02:00
bnb FIX / Quantization: Add extra validation for bnb config (#31135) 2024-05-30 11:45:03 +02:00
eetq_integration [FEAT]: EETQ quantizer support (#30262) 2024-04-22 20:38:58 +01:00
ggml Add Qwen2 GGUF loading support (#31175) 2024-06-03 14:55:10 +01:00
gptq [GPTQ] Fix test (#28018) 2024-01-15 11:22:54 -05:00
hqq Quantization / HQQ: Fix HQQ tests on our runner (#30668) 2024-05-06 11:33:52 +02:00
quanto_integration Fix quantized cache output (#31143) 2024-05-31 12:08:55 +02:00