transformers/tests/quantization/ggml
Penut Chen 1c122a46dc
Support dequantizing GGUF FP16 format (#31783)
* support gguf fp16

* support gguf bf16 with pytorch

* add gguf f16 test

* remove bf16
2024-07-24 17:59:59 +02:00
..
__init__.py Loading GGUF files support (#30391) 2024-05-15 14:28:20 +02:00
test_ggml.py Support dequantizing GGUF FP16 format (#31783) 2024-07-24 17:59:59 +02:00