transformers/docs/source/en/quantization
Jesse Cai e1812864ab
Some checks are pending
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
[docs] Add int4wo + 2:4 sparsity example to TorchAO README (#38592)
* update quantization readme

* update

---------

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-06-12 12:17:07 +00:00
..
aqlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
auto_round.md Fix auto-round hfoption (#37759) 2025-04-24 18:19:38 +02:00
awq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bitsandbytes.md Refactor bitsandbytes doc (#37668) 2025-04-22 16:13:25 +02:00
compressed_tensors.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
concept_guide.md Update quantization docs (#37439) 2025-04-16 15:44:53 +02:00
contribute.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
eetq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fbgemm_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
finegrained_fp8.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
higgs.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hqq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
optimum.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
overview.md Docs: update bitsandbytes torch.compile compatibility (#38651) 2025-06-09 14:51:57 -04:00
quanto.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
quark.md Support loading Quark quantized models in Transformers (#36372) 2025-03-20 15:40:51 +01:00
selecting.md Update quantization docs (#37439) 2025-04-16 15:44:53 +02:00
spqr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
torchao.md [docs] Add int4wo + 2:4 sparsity example to TorchAO README (#38592) 2025-06-12 12:17:07 +00:00
vptq.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00