transformers/tests/quantization/ggml
Younes Belkada 6d4306160a
GGUF: Fix llama 3 GGUF (#31358)
* Create push-important-models.yml

* llama3 support for GGUF

* fixup

* Update src/transformers/integrations/ggml.py

* fix pre-tokenizer

* fix

* fix

* fix

* fix

* fix

* fix

* address final comment

* handle special tokens + add tests
2024-06-20 14:29:58 +02:00
..
__init__.py Loading GGUF files support (#30391) 2024-05-15 14:28:20 +02:00
test_ggml.py GGUF: Fix llama 3 GGUF (#31358) 2024-06-20 14:29:58 +02:00