transformers/docs/source/en/quantization
Marc Sun 96a074fa7e
Add new quant method (#32047)
* Add new quant method

* update

* fix multi-device

* add test

* add offload

* style

* style

* add simple example

* initial doc

* docstring

* style again

* works ?

* better docs

* switch to non persistant

* remove print

* fix init

* code review
2024-07-22 20:21:59 +02:00
..
aqlm.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
awq.md docs: fix broken link (#31370) 2024-06-12 11:33:00 +01:00
bitsandbytes.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
contribute.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
eetq.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
fbgemm_fp8.md Add new quant method (#32047) 2024-07-22 20:21:59 +02:00
gptq.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
hqq.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
optimum.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00
overview.md Add new quant method (#32047) 2024-07-22 20:21:59 +02:00
quanto.md Docs / Quantization: refactor quantization documentation (#30942) 2024-05-23 14:31:52 +02:00