transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-01 18:51:14 +06:00

History

Sandeep Yadav 18143c76bf Sandeepyadav1478/2025 06 19 deberta v2 model card update (#38895 ) * [docs]: update deberta-v2.md model card * chore: req updates * chore: address code review feedback and update docs * chore: review feedback and updates * chore: model selection updates * chores: quantizations review updates		2025-06-27 10:35:30 -07:00
..
internal	Remove all traces of `low_cpu_mem_usage` (#38792 )	2025-06-12 16:39:33 +02:00
main_classes	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
model_doc	Sandeepyadav1478/2025 06 19 deberta v2 model card update (#38895 )	2025-06-27 10:35:30 -07:00
quantization	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
reference	Enhance Model Loading By Providing Parallelism, Uses Optional Env Flag (#36835 )	2025-05-23 16:39:47 +00:00
tasks	No more Tuple, List, Dict (#38797 )	2025-06-17 19:37:18 +01:00
_config.py
_redirects.yml
_toctree.yml	✨ Add EoMT Model \|\| 🚨 Fix Mask2Former loss calculation (#37610 )	2025-06-27 14:18:18 +02:00
accelerate.md	change fsdp_strategy to fsdp in TrainingArguments in accelerate doc (#38807 )	2025-06-13 15:32:40 +00:00
accelerator_selection.md	[docs] add xpu environment variable for gpu selection (#38194 )	2025-05-30 16:05:07 +00:00
add_new_model.md	[docs] Model contribution (#38995 )	2025-06-26 12:25:14 -07:00
add_new_pipeline.md
agents.md
attention_interface.md	No more Tuple, List, Dict (#38797 )	2025-06-17 19:37:18 +01:00
auto_docstring.md	[docs] @auto_docstring (#39011 )	2025-06-26 14:21:54 -07:00
backbones.md
cache_explanation.md	[docs] Format fix (#38414 )	2025-06-03 09:53:23 -07:00
chat_extras.md
chat_templating_multimodal.md
chat_templating_writing.md
chat_templating.md
community.md
contributing.md
conversations.md	[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes (#38047 )	2025-05-12 14:04:41 +01:00
custom_models.md	No more Tuple, List, Dict (#38797 )	2025-06-17 19:37:18 +01:00
debugging.md
deepspeed.md
executorch.md
fast_tokenizers.md
feature_extractors.md
fsdp.md
generation_features.md
generation_strategies.md	Fix custom generate from local directory (#38916 )	2025-06-20 17:36:57 +01:00
gguf.md
glossary.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
how_to_hack_models.md	[doc] fix bugs in `how_to_hack_models.md` (#38198 )	2025-05-19 10:37:54 -07:00
hpo_train.md	[Nit] Add Note on SigOpt being in Public Archive Mode (#38610 )	2025-06-05 14:07:23 -07:00
image_processors.md	🔴 Video processors as a separate class (#35206 )	2025-05-12 11:55:51 +02:00
index.md	[docs] Update docs moved to the course (#38800 )	2025-06-13 12:02:27 -07:00
installation.md
kv_cache.md	[docs] update cache docs with new info (#38775 )	2025-06-13 07:10:56 +00:00
llm_optims.md
llm_tutorial_optimization.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
llm_tutorial.md	No more Tuple, List, Dict (#38797 )	2025-06-17 19:37:18 +01:00
model_memory_anatomy.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
model_sharing.md
models.md	Fix grammatical error in models documentation (#39019 )	2025-06-25 14:55:22 +00:00
modular_transformers.md	[docs] @auto_docstring (#39011 )	2025-06-26 14:21:54 -07:00
notebooks.md
optimizers.md
pad_truncation.md
peft.md
perf_hardware.md
perf_infer_cpu.md	remove ipex_optimize_model usage (#38632 )	2025-06-06 20:04:44 +02:00
perf_infer_gpu_multi.md	[docs] Tensor parallelism (#38241 )	2025-06-26 14:40:45 -07:00
perf_infer_gpu_one.md	Small typo lines 47 and 199 perf_infer_gpu_one.md (#37938 )	2025-05-06 14:32:55 +01:00
perf_torch_compile.md
perf_train_cpu_many.md	remove ipex_optimize_model usage (#38632 )	2025-06-06 20:04:44 +02:00
perf_train_cpu.md	remove ipex_optimize_model usage (#38632 )	2025-06-06 20:04:44 +02:00
perf_train_gaudi.md
perf_train_gpu_many.md	[docs] Tensor parallelism (#38241 )	2025-06-26 14:40:45 -07:00
perf_train_gpu_one.md	[docs] Typos - Single GPU efficient training features (#38964 )	2025-06-23 12:33:10 -07:00
perf_train_special.md
perf_train_tpu_tf.md
perplexity.md
philosophy.md
pipeline_gradio.md
pipeline_tutorial.md
pipeline_webserver.md
pr_checks.md
processors.md	[docs] add Audio import (#38195 )	2025-05-19 13:16:35 +00:00
quicktour.md	Add Hugging Face authentication procedure for IDEs (PyCharm, VS Code,… (#38954 )	2025-06-24 11:48:15 -07:00
run_scripts.md
serialization.md
serving.md
testing.md	[tests] remove TF tests (uses of `require_tf`) (#38944 )	2025-06-25 17:29:10 +00:00
tf_xla.md
tflite.md
tokenizer_summary.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
tools.md
torchscript.md	Fix wording in `torchscript.md` (#38004 )	2025-05-08 16:47:45 +01:00
trainer.md	feat: add flexible Liger Kernel configuration to TrainingArguments (#38911 )	2025-06-19 15:54:08 +00:00
training.md
troubleshooting.md
video_processors.md	🔴 Video processors as a separate class (#35206 )	2025-05-12 11:55:51 +02:00