transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-06 06:10:04 +06:00

History

Wenhua Cheng b3492ff9f7 Add AutoRound quantization support (#37393 ) * add auto-round support * Update src/transformers/quantizers/auto.py Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> * fix style issue Signed-off-by: wenhuach <wenhuach87@gmail.com> * tiny change * tiny change * refine ut and doc * revert unnecessary change * tiny change * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * try to fix style issue * fix doc issue * Update tests/quantization/autoround/test_auto_round.py * fix comments * Update tests/quantization/autoround/test_auto_round.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update tests/quantization/autoround/test_auto_round.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * update doc * Update src/transformers/quantizers/quantizer_auto_round.py Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * update * update * fix * try to fix style issue * Update src/transformers/quantizers/auto.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update docs/source/en/quantization/auto_round.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update docs/source/en/quantization/auto_round.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update docs/source/en/quantization/auto_round.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * update * fix style issue * update doc * update doc * Refine the doc * refine doc * revert one change * set sym to True by default * Enhance the unit test's robustness. * update * add torch dtype * tiny change * add awq convert test * fix typo * update * fix packing format issue * use one gpu --------- Signed-off-by: wenhuach <wenhuach87@gmail.com> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by: Shen, Haihao <haihao.shen@intel.com>		2025-04-22 13:56:54 +02:00
..
backbones.md	doc: fix broken BEiT and DiNAT model links on Backbone page (#32029 )	2024-07-17 20:24:10 +01:00
callback.md	Integrate SwanLab for offline/online experiment tracking and local visualization (#36433 )	2025-03-06 17:35:30 +01:00
configuration.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
data_collator.md	Move `DataCollatorForMultipleChoice` from the docs to the package (#34763 )	2025-02-13 12:01:28 +01:00
deepspeed.md	DeepSpeed github repo move sync (#36021 )	2025-02-05 08:19:31 -08:00
executorch.md	Fix flax failures (#33912 )	2024-10-11 14:38:35 +02:00
feature_extractor.md	Fixed typos (#26810 )	2023-10-16 09:52:29 +02:00
image_processor.md	Fall back to slow image processor in ImageProcessingAuto when no fast processor available (#34785 )	2024-12-15 14:00:36 -05:00
keras_callbacks.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
logging.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
model.md	Speedup model init on CPU (by 10x+ for llama-3-8B as one example) (#31771 )	2024-07-16 09:32:01 -04:00
onnx.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
optimizer_schedules.md	Just import torch AdamW instead (#36177 )	2025-03-19 18:29:40 +00:00
output.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
peft.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pipelines.md	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
processors.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
quantization.md	Add AutoRound quantization support (#37393 )	2025-04-22 13:56:54 +02:00
text_generation.md	Add SynthID (watermerking by Google DeepMind) (#34350 )	2024-10-23 21:18:52 +01:00
tokenizer.md	VLM: special multimodal Tokenizer (#34461 )	2024-11-04 16:37:51 +01:00
trainer.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00