mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-03 12:50:06 +06:00
![]() * initial GPT2 changes * causal_mask support * return_legacy_cache * cleanup * fix1 * outputs shape fixes * gpt2 return fix * pkv, attn fixes * fix dual_head * is_causal arg fix * decision transformer updated * style fix * batch_size from inputs_embeds * DecisionTransformerModel fixes * cross-attn support + cache warning * x-attn @decision * EDCache proper init * simplified logic in `if use_cache:` for GPT2Model * @deprecate_kwarg for DecisionTr attn fwd * @deprecate_kwarg in gpt2 * deprecation version updated to 4.51 * kwargs in gradient_checkpointing_fn * rename next_cache to past_key_values * attention_mask prep * +cache_position in GPT2DoubleHeadsModel * undo kwargs in gradient checkpointing * moved up `if self.gradient_checkpointing` * consistency in decision_transformer * pastkv, cache_pos in grad_checkpt args * rm _reorder_cache * output_attentions streamlined * decision_transformer consistency * return_legacy_cache improved * ClvpForCausalLM used for legacy cache test now * is_causal fixed * attn_output cleanup * consistency @ decision_transformer * Updated deprecation notice version to 4.52 * upd deprecation * consistent legacy cache code in decision transformers\ * next_cache -> past_kv in decision_tr * cache support flags in decision_transf * rm legacy cache warning * consistency in cache init for decision transf * no Static Cache for Decision Transformer --------- Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co> |
||
---|---|---|
.. | ||
import_structures | ||
__init__.py | ||
test_activations_tf.py | ||
test_activations.py | ||
test_add_new_model_like.py | ||
test_audio_utils.py | ||
test_backbone_utils.py | ||
test_cache_utils.py | ||
test_chat_template_utils.py | ||
test_cli.py | ||
test_configuration_utils.py | ||
test_convert_slow_tokenizer.py | ||
test_deprecation.py | ||
test_doc_samples.py | ||
test_dynamic_module_utils.py | ||
test_expectations.py | ||
test_feature_extraction_utils.py | ||
test_file_utils.py | ||
test_generic.py | ||
test_hf_argparser.py | ||
test_hub_utils.py | ||
test_image_processing_utils.py | ||
test_image_utils.py | ||
test_import_structure.py | ||
test_import_utils.py | ||
test_logging.py | ||
test_model_card.py | ||
test_model_debugging_utils.py | ||
test_model_output.py | ||
test_modeling_flax_utils.py | ||
test_modeling_rope_utils.py | ||
test_modeling_tf_core.py | ||
test_modeling_tf_utils.py | ||
test_modeling_utils.py | ||
test_offline.py | ||
test_processing_utils.py | ||
test_skip_decorators.py | ||
test_tokenization_utils.py | ||
test_versions_utils.py | ||
tiny_model_summary.json |