transformers/docs/source/en
Sandeep Yadav 18143c76bf
Sandeepyadav1478/2025 06 19 deberta v2 model card update (#38895)
* [docs]: update deberta-v2.md model card

* chore: req updates

* chore: address code review feedback and update docs

* chore: review feedback and updates

* chore: model selection updates

* chores: quantizations review updates
2025-06-27 10:35:30 -07:00
..
internal Remove all traces of low_cpu_mem_usage (#38792) 2025-06-12 16:39:33 +02:00
main_classes Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
model_doc Sandeepyadav1478/2025 06 19 deberta v2 model card update (#38895) 2025-06-27 10:35:30 -07:00
quantization Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
reference Enhance Model Loading By Providing Parallelism, Uses Optional Env Flag (#36835) 2025-05-23 16:39:47 +00:00
tasks No more Tuple, List, Dict (#38797) 2025-06-17 19:37:18 +01:00
_config.py
_redirects.yml
_toctree.yml Add EoMT Model || 🚨 Fix Mask2Former loss calculation (#37610) 2025-06-27 14:18:18 +02:00
accelerate.md change fsdp_strategy to fsdp in TrainingArguments in accelerate doc (#38807) 2025-06-13 15:32:40 +00:00
accelerator_selection.md [docs] add xpu environment variable for gpu selection (#38194) 2025-05-30 16:05:07 +00:00
add_new_model.md [docs] Model contribution (#38995) 2025-06-26 12:25:14 -07:00
add_new_pipeline.md
agents.md
attention_interface.md No more Tuple, List, Dict (#38797) 2025-06-17 19:37:18 +01:00
auto_docstring.md [docs] @auto_docstring (#39011) 2025-06-26 14:21:54 -07:00
backbones.md
cache_explanation.md [docs] Format fix (#38414) 2025-06-03 09:53:23 -07:00
chat_extras.md
chat_templating_multimodal.md
chat_templating_writing.md
chat_templating.md
community.md
contributing.md
conversations.md [chat] generate parameterization powered by GenerationConfig and UX-related changes (#38047) 2025-05-12 14:04:41 +01:00
custom_models.md No more Tuple, List, Dict (#38797) 2025-06-17 19:37:18 +01:00
debugging.md
deepspeed.md
executorch.md
fast_tokenizers.md
feature_extractors.md
fsdp.md
generation_features.md
generation_strategies.md Fix custom generate from local directory (#38916) 2025-06-20 17:36:57 +01:00
gguf.md
glossary.md Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
how_to_hack_models.md [doc] fix bugs in how_to_hack_models.md (#38198) 2025-05-19 10:37:54 -07:00
hpo_train.md [Nit] Add Note on SigOpt being in Public Archive Mode (#38610) 2025-06-05 14:07:23 -07:00
image_processors.md 🔴 Video processors as a separate class (#35206) 2025-05-12 11:55:51 +02:00
index.md [docs] Update docs moved to the course (#38800) 2025-06-13 12:02:27 -07:00
installation.md
kv_cache.md [docs] update cache docs with new info (#38775) 2025-06-13 07:10:56 +00:00
llm_optims.md
llm_tutorial_optimization.md Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
llm_tutorial.md No more Tuple, List, Dict (#38797) 2025-06-17 19:37:18 +01:00
model_memory_anatomy.md Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
model_sharing.md
models.md Fix grammatical error in models documentation (#39019) 2025-06-25 14:55:22 +00:00
modular_transformers.md [docs] @auto_docstring (#39011) 2025-06-26 14:21:54 -07:00
notebooks.md
optimizers.md
pad_truncation.md
peft.md
perf_hardware.md
perf_infer_cpu.md remove ipex_optimize_model usage (#38632) 2025-06-06 20:04:44 +02:00
perf_infer_gpu_multi.md [docs] Tensor parallelism (#38241) 2025-06-26 14:40:45 -07:00
perf_infer_gpu_one.md Small typo lines 47 and 199 perf_infer_gpu_one.md (#37938) 2025-05-06 14:32:55 +01:00
perf_torch_compile.md
perf_train_cpu_many.md remove ipex_optimize_model usage (#38632) 2025-06-06 20:04:44 +02:00
perf_train_cpu.md remove ipex_optimize_model usage (#38632) 2025-06-06 20:04:44 +02:00
perf_train_gaudi.md
perf_train_gpu_many.md [docs] Tensor parallelism (#38241) 2025-06-26 14:40:45 -07:00
perf_train_gpu_one.md [docs] Typos - Single GPU efficient training features (#38964) 2025-06-23 12:33:10 -07:00
perf_train_special.md
perf_train_tpu_tf.md
perplexity.md
philosophy.md
pipeline_gradio.md
pipeline_tutorial.md
pipeline_webserver.md
pr_checks.md
processors.md [docs] add Audio import (#38195) 2025-05-19 13:16:35 +00:00
quicktour.md Add Hugging Face authentication procedure for IDEs (PyCharm, VS Code,… (#38954) 2025-06-24 11:48:15 -07:00
run_scripts.md
serialization.md
serving.md
testing.md [tests] remove TF tests (uses of require_tf) (#38944) 2025-06-25 17:29:10 +00:00
tf_xla.md
tflite.md
tokenizer_summary.md Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
tools.md
torchscript.md Fix wording in torchscript.md (#38004) 2025-05-08 16:47:45 +01:00
trainer.md feat: add flexible Liger Kernel configuration to TrainingArguments (#38911) 2025-06-19 15:54:08 +00:00
training.md
troubleshooting.md
video_processors.md 🔴 Video processors as a separate class (#35206) 2025-05-12 11:55:51 +02:00