transformers/docs/source/en/quantization
Jerry Zhang 7eb1107cc2
Restructure torchao quantization examples (#37592)
* Restructure torchao quantization examples

Summary:
Mainly structured the examples by hardwares and then listed
the recommended quantization methods for each hardware H100 GPU, A100 GPU and CPU

Also added example for push_to_hub

Test Plan:
not required

Reviewers:

Subscribers:

Tasks:

Tags:

* update

* drop float8 cpu

* address comments and simplify

* small update

* link update

* minor update
2025-04-22 11:20:34 +02:00
..
aqlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
awq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitsandbytes.md Add Space to Bitsandbytes doc (#36834) 2025-03-19 18:56:07 +01:00
compressed_tensors.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
concept_guide.md Update quantization docs (#37439) 2025-04-16 15:44:53 +02:00
contribute.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
eetq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fbgemm_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
finegrained_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
higgs.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hqq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
optimum.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
overview.md [doc] Fix link for Quark quantization page (#37179) 2025-04-01 20:57:38 +02:00
quanto.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
quark.md Support loading Quark quantized models in Transformers (#36372) 2025-03-20 15:40:51 +01:00
selecting.md Update quantization docs (#37439) 2025-04-16 15:44:53 +02:00
spqr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
torchao.md Restructure torchao quantization examples (#37592) 2025-04-22 11:20:34 +02:00
vptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00