transformers/docs/source
Marc Sun 55db70c63d
GPTQ integration (#25062)
* GTPQ integration

* Add tests for gptq

* support for more quantization model

* fix style

* typo

* fix method

* Update src/transformers/modeling_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add dataclass and fix quantization_method

* fix doc

* Update tests/quantization/gptq/test_gptq.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* modify dataclass

* add gtpqconfig import

* fix typo

* fix tests

* remove dataset as req arg

* remove tokenizer import

* add offload cpu quantization test

* fix check dataset

* modify dockerfile

* protect trainer

* style

* test for config

* add more log

* overwrite torch_dtype

* draft doc

* modify quantization_config docstring

* fix class name in docstring

* Apply suggestions from code review

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* more warning

* fix 8bit kwargs tests

* peft compatibility

* remove var

* fix is_gptq_quantized

* remove is_gptq_quantized

* fix wrap

* Update src/transformers/modeling_utils.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* add exllama

* skip test

* overwrite float16

* style

* fix skip test

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix docsting formatting

* add doc

* better test

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2023-08-10 16:06:29 -04:00
..
de Add bloom flax (#25094) 2023-07-27 18:24:56 +01:00
en GPTQ integration (#25062) 2023-08-10 16:06:29 -04:00
es Update old existing feature extractor references (#24552) 2023-06-29 10:17:36 +01:00
fr Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
it Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
ja Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
ko 🌐 [i18n-KO] Translated philosophy.md to Korean (#25010) 2023-08-10 09:50:51 +02:00
ms Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
pt Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
zh Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
_config.py Adding evaluate to the list of libraries required in generated notebooks (#20850) 2022-12-21 14:04:08 +01:00