transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

Raushan Turganbay d583f1317b Quantized KV Cache (#30483 ) * clean-up * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/cache_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup * Update tests/quantization/quanto_integration/test_quanto.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update src/transformers/generation/configuration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * more suggestions * mapping if torch available * run tests & add 'support_quantized' flag * fix jamba test * revert, will be fixed by another PR * codestyle * HQQ and versatile cache classes * final update * typo * make tests happy --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>		2024-05-23 17:25:20 +05:00
..
agents	Reboot Agents (#30387 )	2024-05-07 12:59:49 +02:00
benchmark	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	Update ds_config_zero3.json (#30829 )	2024-05-15 10:02:31 -04:00
extended	CI: update to ROCm 6.0.2 and test MI300 (#30266 )	2024-05-13 18:14:36 +02:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	Add FSDP config for CPU RAM efficient loading through accelerate (#30002 )	2024-04-22 13:15:28 +01:00
generation	Quantized KV Cache (#30483 )	2024-05-23 17:25:20 +05:00
models	Paligemma causal attention mask (#30967 )	2024-05-22 19:37:15 +02:00
optimization	Add WSD scheduler (#30231 )	2024-04-25 12:07:21 +01:00
peft_integration	FIX [`CI`]: Fix failing tests for peft integration (#29330 )	2024-02-29 03:56:16 +01:00
pipelines	Using assistant in AutomaticSpeechRecognitionPipeline with different encoder size (#30637 )	2024-05-23 09:59:38 +01:00
quantization	Quantized KV Cache (#30483 )	2024-05-23 17:25:20 +05:00
repo_utils	Allow `# Ignore copy` (#27328 )	2023-12-07 10:00:08 +01:00
sagemaker	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
tokenization	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
trainer	Enforce saving at end of training if saving option chosen (#30160 )	2024-05-21 07:50:11 -04:00
utils	🚨 out_indices always a list (#30941 )	2024-05-22 15:23:04 +01:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_cache_utils.py	Generate: add tests for caches with `pad_to_multiple_of` (#29462 )	2024-03-06 10:57:04 +00:00
test_configuration_common.py	[ `PretrainedConfig`] Improve messaging (#27438 )	2023-11-15 14:10:39 +01:00
test_configuration_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_feature_extraction_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00
test_image_processing_common.py	Raise unused kwargs image processor (#29063 )	2024-02-20 16:20:20 +01:00
test_image_processing_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00
test_image_transforms.py	fix: center_crop occasionally outputs off-by-one dimension matrix (#30934 )	2024-05-21 13:56:52 +01:00
test_modeling_common.py	Fix low cpu mem usage tests (#30808 )	2024-05-22 14:09:01 +02:00
test_modeling_flax_common.py	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
test_modeling_flax_utils.py	Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599 )	2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py	Port IDEFICS to tensorflow (#26870 )	2024-05-13 15:59:46 +01:00
test_modeling_tf_utils.py	Cast bfloat16 to float32 for Numpy conversions (#29755 )	2024-03-21 14:04:11 +00:00
test_modeling_utils.py	Llama: fix custom 4D masks, v2 (#30348 )	2024-05-13 13:46:06 +02:00
test_pipeline_mixin.py	Image Feature Extraction pipeline (#28216 )	2024-02-05 14:50:07 +00:00
test_processing_common.py	Don't save `processor_config.json` if a processor has no extra attribute (#28584 )	2024-01-19 09:59:14 +00:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	update ruff version (#30932 )	2024-05-22 06:40:15 +02:00
test_tokenization_utils.py	[tests] remove deprecated tests for model loading (#29450 )	2024-03-15 14:18:41 +00:00