transformers/tests/quantization/ggml
Penut Chen ac946aac25
Fix the incorrect permutation of gguf (#31788)
* Fix the incorrect permutation of gguf

* rename num_kv_heads

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* add typing to num_kv_heads

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* rename variables

* refactor permute function name

* update the expected text of the llama3 q4 test

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
2024-07-16 08:20:34 +02:00
..
__init__.py Loading GGUF files support (#30391) 2024-05-15 14:28:20 +02:00
test_ggml.py Fix the incorrect permutation of gguf (#31788) 2024-07-16 08:20:34 +02:00