transformers/tests/quantization
Penut Chen ac946aac25
Fix the incorrect permutation of gguf (#31788)
* Fix the incorrect permutation of gguf

* rename num_kv_heads

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* add typing to num_kv_heads

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* rename variables

* refactor permute function name

* update the expected text of the llama3 q4 test

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2024-07-16 08:20:34 +02:00
..
aqlm_integration Cache: Static cache as a standalone object (#30476) 2024-04-30 16:37:19 +01:00
autoawq Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
bnb Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
eetq_integration [FEAT]: EETQ quantizer support (#30262) 2024-04-22 20:38:58 +01:00
ggml Fix the incorrect permutation of gguf (#31788) 2024-07-16 08:20:34 +02:00
gptq 🚨 Remove dataset with restrictive license (#31452) 2024-06-17 17:56:51 +01:00
hqq Quantization / HQQ: Fix HQQ tests on our runner (#30668) 2024-05-06 11:33:52 +02:00
quanto_integration Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00