transformers/docs/source/en/main_classes
Younes Belkada ae093eef01
[core / Quantization ] AWQ integration (#27045)
* working v1

* oops

* Update src/transformers/modeling_utils.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* fixup

* oops

* push

* more changes

* add docs

* some fixes

* fix copies

* add v1 doc

* added installation guide

* relax constraints

* revert

* attempt llm-awq

* oops

* oops

* fixup

* raise error when incorrect cuda compute capability

* nit

* add instructions for llm-awq

* fixup

* fix copies

* fixup and docs

* change

* few changes + add demo

* add v1 tests

* add autoawq in dockerfile

* finalize

* Update tests/quantization/autoawq/test_awq.py

* fix test

* fix

* fix issue

* Update src/transformers/integrations/awq.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update docs/source/en/main_classes/quantization.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update docs/source/en/main_classes/quantization.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/integrations/awq.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/integrations/awq.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* add link to example script

* Update docs/source/en/main_classes/quantization.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* add more content

* add more details

* add link to quantization docs

* camel case + change backend class name

* change to string

* fixup

* raise errors if libs not installed

* change to `bits` and `group_size`

* nit

* nit

* Apply suggestions from code review

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* disable training

* address some comments and fix nits

* fix

* final nits and fix tests

* adapt to our new runners

* make fix-copies

* Update src/transformers/utils/quantization_config.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/utils/quantization_config.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/integrations/awq.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/integrations/awq.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* move to top

* add conversion test

* final nit

* add more elaborated test

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-11-01 09:06:31 +01:00
..
agent.md [doc] Always call it Agents for consistency (#25958) 2023-09-05 12:27:20 +01:00
callback.md Update docs to explain disabling callbacks using report_to (#26155) 2023-10-11 07:50:23 -04:00
configuration.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
data_collator.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
deepspeed.md Fix Typo: table in deepspeed.md (#26705) 2023-10-10 11:50:10 +02:00
feature_extractor.md Fixed typos (#26810) 2023-10-16 09:52:29 +02:00
image_processor.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
keras_callbacks.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
logging.md Warnings controlled by logger level (#26527) 2023-10-12 10:48:38 +02:00
model.md Fix typo 'submosules' (#24809) 2023-07-13 16:56:53 +01:00
onnx.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
optimizer_schedules.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
output.md Translating en/main_classes folder docs to Japanese 🇯🇵 (#26894) 2023-10-30 09:39:14 -07:00
pipelines.md [docs] Add MaskGenerationPipeline in docs (#27063) 2023-10-25 19:31:36 +02:00
processors.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
quantization.md [core / Quantization ] AWQ integration (#27045) 2023-11-01 09:06:31 +01:00
text_generation.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
tokenizer.md Tweaks to Chat Templates docs (#26168) 2023-09-15 12:50:57 +01:00
trainer.md [FEAT] Add Neftune into transformers Trainer (#27141) 2023-10-31 16:03:59 +01:00