transformers/tests/quantization/fbgemm_fp8
Marc Sun 96a074fa7e
Add new quant method (#32047)
* Add new quant method

* update

* fix multi-device

* add test

* add offload

* style

* style

* add simple example

* initial doc

* docstring

* style again

* works ?

* better docs

* switch to non persistant

* remove print

* fix init

* code review
2024-07-22 20:21:59 +02:00
..
__init__.py Add new quant method (#32047) 2024-07-22 20:21:59 +02:00
test_fbgemm_fp8.py Add new quant method (#32047) 2024-07-22 20:21:59 +02:00