mirror of
https://github.com/huggingface/transformers.git
synced 2025-08-02 11:11:05 +06:00
![]() * Initial implementation of OffloadedCache * enable usage via cache_implementation * Address feedback, add tests, remove legacy methods. * Remove flash-attn, discover synchronization bugs, fix bugs * Prevent usage in CPU only mode * Add a section about offloaded KV cache to the docs * Fix typos in docs * Clarifications and better explanation of streams |
||
---|---|---|
.. | ||
__init__.py | ||
test_activations_tf.py | ||
test_activations.py | ||
test_add_new_model_like.py | ||
test_audio_utils.py | ||
test_backbone_utils.py | ||
test_cache_utils.py | ||
test_chat_template_utils.py | ||
test_cli.py | ||
test_configuration_utils.py | ||
test_convert_slow_tokenizer.py | ||
test_deprecation.py | ||
test_doc_samples.py | ||
test_dynamic_module_utils.py | ||
test_feature_extraction_utils.py | ||
test_file_utils.py | ||
test_generic.py | ||
test_hf_argparser.py | ||
test_hub_utils.py | ||
test_image_processing_utils.py | ||
test_image_utils.py | ||
test_logging.py | ||
test_model_card.py | ||
test_model_output.py | ||
test_modeling_flax_utils.py | ||
test_modeling_rope_utils.py | ||
test_modeling_tf_core.py | ||
test_modeling_tf_utils.py | ||
test_modeling_utils.py | ||
test_offline.py | ||
test_skip_decorators.py | ||
test_tokenization_utils.py | ||
test_versions_utils.py | ||
tiny_model_summary.json |