mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-04 05:10:06 +06:00
![]() * Restructure torchao quantization examples Summary: Mainly structured the examples by hardwares and then listed the recommended quantization methods for each hardware H100 GPU, A100 GPU and CPU Also added example for push_to_hub Test Plan: not required Reviewers: Subscribers: Tasks: Tags: * update * drop float8 cpu * address comments and simplify * small update * link update * minor update |
||
---|---|---|
.. | ||
aqlm.md | ||
awq.md | ||
bitnet.md | ||
bitsandbytes.md | ||
compressed_tensors.md | ||
concept_guide.md | ||
contribute.md | ||
eetq.md | ||
fbgemm_fp8.md | ||
finegrained_fp8.md | ||
gptq.md | ||
higgs.md | ||
hqq.md | ||
optimum.md | ||
overview.md | ||
quanto.md | ||
quark.md | ||
selecting.md | ||
spqr.md | ||
torchao.md | ||
vptq.md |