transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-13 09:40:06 +06:00

History

Vladislav Bronzov 4abeb50f6e Add D-FINE Model into Transformers (#36261 ) * copy the last changes from broken PR * small format * some fixes and refactoring after review * format * add config attr for loss * some fixes and refactoring * fix copies * fix style * add test for d-fine resnet * fix decoder layer prop * fix dummies * format init * remove extra print * refactor modeling, move resnet into separate folder * fix resnet config * change resnet on hgnet_v2, add clamp into decoder * fix init * fix config doc * fix init * fix dummies * fix config docs * fix hgnet_v2 config typo * format modular * add image classification for hgnet, some refactoring * format tests * fix dummies * fix init * fix style * fix init for hgnet v2 * fix index.md, add init rnage for hgnet * fix conversion * add missing attr to encoder * add loss for d-fine, add additional output for rt-detr decoder * tests and docs fixes * fix rt_detr v2 conversion * some fixes for loos and decoder output * some fixes for loss * small fix for converted modeling * add n model config, some todo comments for modular * convert script adjustments and fixes, small refact * remove extra output for rt_detr * make some outputs optionsl, fix conversion * some posr merge fixes * small fix * last field fix * fix not split for hgnet_v2 * disable parallelism test for hgnet_v2 image classification * skip multi gpu for d-fine * adjust after merge init * remove extra comment * fix repo name references * small fixes for tests * Fix checkpoint path * Fix consistency * Fixing docs --------- Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>		2025-04-29 12:17:55 +01:00
..
internal	Introduce GradientCheckpointingLayer (#37223 )	2025-04-22 11:33:31 +01:00
main_classes	Add Bitnet model (#37742 )	2025-04-28 15:08:46 +02:00
model_doc	Add D-FINE Model into Transformers (#36261 )	2025-04-29 12:17:55 +01:00
quantization	Fix auto-round hfoption (#37759 )	2025-04-24 18:19:38 +02:00
tasks	Process inputs directly in apply_chat_template in image-text-to-text pipeline (#35616 )	2025-04-23 13:31:33 -04:00
_config.py	Add optimized `PixtralImageProcessorFast` (#34836 )	2024-11-28 16:04:05 +01:00
_redirects.yml	Docs / Quantization: Redirect deleted page (#31063 )	2024-05-28 18:29:22 +02:00
_toctree.yml	Add D-FINE Model into Transformers (#36261 )	2025-04-29 12:17:55 +01:00
accelerate.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
add_new_model.md	Add support for fast image processors in add-new-model-like CLI (#36313 )	2025-03-13 14:16:37 -04:00
add_new_pipeline.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
agents.md	[agents] remove agents 🧹 (#37368 )	2025-04-11 18:42:37 +01:00
attention_interface.md	Fix AttentionInterface following feedback (#37010 )	2025-03-28 18:00:35 +01:00
attention.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
backbones.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cache_explanation.md	Fix typos (#36910 )	2025-03-24 14:08:29 +00:00
chat_extras.md	Update chat_extras.md with content correction (#36599 )	2025-03-07 13:09:02 +00:00
chat_templating_multimodal.md	[chat-template] Unify tests and clean up 🧼 (#37275 )	2025-04-10 14:42:32 +02:00
chat_templating_writing.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chat_templating.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
community.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
conversations.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
custom_models.md	Fix typos (#36910 )	2025-03-24 14:08:29 +00:00
debugging.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deepspeed.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
executorch.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fast_tokenizers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
feature_extractors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fsdp.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
generation_features.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
generation_strategies.md	Fixing the example in generation strategy doc (#37598 )	2025-04-18 12:50:17 -07:00
gguf.md	Fix gguf docs (#36601 )	2025-03-11 15:29:14 +01:00
glossary.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
gpu_selection.md	Fix typos (#36910 )	2025-03-24 14:08:29 +00:00
how_to_hack_models.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
hpo_train.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
image_processors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
index.md	Adding Qwen3 and Qwen3MoE (#36878 )	2025-03-31 09:50:49 +02:00
installation.md	byebye torch 2.0 (#37277 )	2025-04-07 15:19:47 +02:00
kv_cache.md	fix link in kv_cache.md (#37652 )	2025-04-21 09:01:11 -07:00
llm_optims.md	[CI] green llama tests (#37244 )	2025-04-03 14:15:53 +01:00
llm_tutorial_optimization.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
llm_tutorial.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
model_memory_anatomy.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
model_sharing.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
model_summary.md	model_summary.md - Restore link to Harvard's Annotated Transformer. (#29702 )	2024-03-23 18:29:39 -07:00
models.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
modular_transformers.md	Support custom dosctrings in modular (#36726 )	2025-03-18 14:00:54 -04:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
optimizers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pad_truncation.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
peft.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_hardware.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
perf_infer_cpu.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_infer_gpu_multi.md	enable tp on CPU (#36299 )	2025-03-31 10:55:47 +02:00
perf_infer_gpu_one.md	Add InternVL (2.5 MPO) (#35968 )	2025-04-18 18:57:33 +02:00
perf_torch_compile.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_cpu_many.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_cpu.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_gpu_many.md	docs: fix typo (#37567 )	2025-04-17 14:54:44 +01:00
perf_train_gpu_one.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_special.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_tpu_tf.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perplexity.md	[docs] use device-agnostic API instead of cuda (#34913 )	2024-11-26 09:23:34 -08:00
philosophy.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
pipeline_gradio.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pipeline_tutorial.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
pipeline_webserver.md	fix and enhance pipeline_webserver.md (#36992 )	2025-04-15 08:35:05 -07:00
pr_checks.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
processors.md	[docs] Fix image link (#36869 )	2025-03-25 11:34:21 -07:00
quicktour.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
run_scripts.md	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
serialization.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
serving.md	[docs] Serving LLMs (#36522 )	2025-03-10 13:14:19 -07:00
task_summary.md	[doctest] Fixes (#35863 )	2025-01-26 15:26:38 -08:00
tasks_explained.md	fix: Wrong task mentioned in docs (#34757 )	2024-11-18 18:42:28 +00:00
testing.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
tf_xla.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tflite.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tokenizer_summary.md	[docs] Spanish translation of tokenizer_summary.md (#31154 )	2024-06-03 16:52:23 -07:00
tools.md	[agents] remove agents 🧹 (#37368 )	2025-04-11 18:42:37 +01:00
torchscript.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
trainer.md	(Part 2) feat: allow for tp_size attr for tplizing the model (#37054 )	2025-04-10 17:44:09 +02:00
training.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
troubleshooting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00