transformers/docs/source
Raushan Turganbay d583f1317b
Quantized KV Cache (#30483)
* clean-up

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* Update tests/quantization/quanto_integration/test_quanto.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/generation/configuration_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* more suggestions

* mapping if torch available

* run tests & add 'support_quantized' flag

* fix jamba test

* revert, will be fixed by another PR

* codestyle

* HQQ and versatile cache classes

* final update

* typo

* make tests happy

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2024-05-23 17:25:20 +05:00
..
de Fix broken link to Transformers notebooks (#30512) 2024-04-29 10:57:51 +01:00
en Quantized KV Cache (#30483) 2024-05-23 17:25:20 +05:00
es [docs] Spanish translation of model_memory_anatomy.md (#30885) 2024-05-20 16:48:52 -07:00
fr [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00
hi More fixes for doctest (#30265) 2024-04-16 11:58:55 +02:00
it Fix canonical model --model_type in examples (#30480) 2024-05-01 15:47:05 +01:00
ja Update CodeLlama references (#30218) 2024-05-09 22:57:52 +02:00
ko Docs: update example with assisted generation + sample (#30853) 2024-05-16 14:32:21 +01:00
ms Remove old TF port docs (#30426) 2024-04-23 16:06:20 +01:00
pt Fix canonical model --model_type in examples (#30480) 2024-05-01 15:47:05 +01:00
te [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00
tr Translate index.md to Turkish (#27093) 2023-11-08 08:35:20 -05:00
zh Reboot Agents (#30387) 2024-05-07 12:59:49 +02:00
_config.py [#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) 2024-04-08 14:21:16 +01:00