transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 09:12:21 +06:00

History

Pavel Iakubovskii 9bec2654ed Add V-JEPA for video classification model (#38788 ) * adding model and conversion scripts * add imports to test vjepa conversion * fix imports and make conversion work * fix computation for short side * replace attention with library attention function * cleanup more attention classes * remove config overrides * add test cases, fix some of the failing ones * fix the model outputs * fix outputs of the model per review * fix too big model test case * fix styling __init__.py * fix initialization test * remove all asserts per review * update sorting unsorting logic as per feedback * remove is_video per review * remove another is_video segment * remove unwanted stuff * small fixes * add docstrings for the model * revert adding vjepa2 config here * update styling * add config docstrings (wip) * fix dpr issue * removed test failing issues * update styles * merge predictor configs into main config * remove processing code, add video processor * remove permute which is not necessary now * fix styles * updated vjepa2 to be in video_processing_auto * update comment for preprocessing * test integration test and fix the outputs * update test values, change test to look at repeated frames for a given image * add a simple video processing test * refactoring pixel_values_videos and upload ckpts to original * fix torch_fx test cases * remove unused config * add all config docstrings * add more integration tests * add basic doc * revert unwanted styling changes * working make fixup * Fix model_type in config * Add ForVideoClassification model * update attention implementation to fit new hf standards * fix the preprocessing logic, ensure it matches the original model * remove use_rope logic, cleanup * fix docstrings * Further cleanup, update doc * Fix model prefix * fix get_vision_features * VJEPA2Embeddings style refactor * nit, style comment * change modules default values * Only `str` activation in config * GradientCheckpointingLayer * fixup * fix conversion script * Remove return_dict * remove None return typehint * Refactor VJEPA2Layer, remove use_SiLU * Fix fx tests * dpr -> drop_path_rates * move ModelOutput on top format docs bit * update docs * update docs * update doc example * remove prune_heads from model * remove unused config params * refactor embed signature * Add vjepa to docs * Fix config docstring * attention head * update defaults * Update docs/source/en/model_doc/vjepa2.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/vjepa2.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix import * Min refactoring * Update HUB_SOURCE and HUB_REPO in conversion script * Add missing headers * VJEPA -> V-JEPA in docs * Add image to doc * fix style * fix init weights * change checkpoint name in modeling tests * Initial cls head setup * remove rop attention from head (not needed) * remove swigluffn - not needed * Add siglip layer * Replace with siglip layer * Rename Siglip - VJEPA2 * remove unused modules * remove siglip mlp * nit * remove MLP * Refactor head cross attention * refactor VJEPA2HeadCrossAttentionLayer * nit renaming * fixup * remove commented code * Add cls head params to config * depth from config * move pooler + classifier to the model * Update for cls model signature * move layers, rename a bit * fix docs * update weights init * remove typehint for init * add to auto-mapping * enable tests * Add conversion script * fixup * add to docs * fix docs * nit * refactor for mapping * clean * Add integration test * Fixing multi gpu test * update not-split-modules * update video cls test tolerance * Increase test_inference_image tolerance * Update no-split modules for multi gpu * Apply suggestions from code review * fixing multi-gpu * fix docstring * Add cls snippet to docs * Update checkpoint		2025-06-13 17:56:15 +01:00
..
internal	Remove all traces of `low_cpu_mem_usage` (#38792 )	2025-06-12 16:39:33 +02:00
main_classes	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
model_doc	Add V-JEPA for video classification model (#38788 )	2025-06-13 17:56:15 +01:00
quantization	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
reference	Enhance Model Loading By Providing Parallelism, Uses Optional Env Flag (#36835 )	2025-05-23 16:39:47 +00:00
tasks	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
_config.py	Add optimized `PixtralImageProcessorFast` (#34836 )	2024-11-28 16:04:05 +01:00
_redirects.yml	Docs / Quantization: Redirect deleted page (#31063 )	2024-05-28 18:29:22 +02:00
_toctree.yml	Add V-JEPA 2 (#38746 )	2025-06-11 15:00:08 +01:00
accelerate.md	change fsdp_strategy to fsdp in TrainingArguments in accelerate doc (#38807 )	2025-06-13 15:32:40 +00:00
accelerator_selection.md	[docs] add xpu environment variable for gpu selection (#38194 )	2025-05-30 16:05:07 +00:00
add_new_model.md	Transformers cli clean command (#37657 )	2025-04-30 12:15:43 +01:00
add_new_pipeline.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
agents.md	[agents] remove agents 🧹 (#37368 )	2025-04-11 18:42:37 +01:00
attention_interface.md	🚨🚨[core] Completely rewrite the masking logic for all attentions (#37866 )	2025-05-22 11:38:26 +02:00
attention.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
auto_docstring.md	[`AutoDocstring`] Based on inspect parsing of the signature (#33771 )	2025-05-08 17:46:07 -04:00
backbones.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cache_explanation.md	[docs] Format fix (#38414 )	2025-06-03 09:53:23 -07:00
chat_extras.md	Update chat_extras.md with content correction (#36599 )	2025-03-07 13:09:02 +00:00
chat_templating_multimodal.md	[chat-template] Unify tests and clean up 🧼 (#37275 )	2025-04-10 14:42:32 +02:00
chat_templating_writing.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chat_templating.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
community.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
conversations.md	[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes (#38047 )	2025-05-12 14:04:41 +01:00
custom_models.md	Fix typos (#36910 )	2025-03-24 14:08:29 +00:00
debugging.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deepspeed.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
executorch.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fast_tokenizers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
feature_extractors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fsdp.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
generation_features.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
generation_strategies.md	[custom_generate] don't forward `custom_generate` and `trust_remote_code` (#38304 )	2025-05-23 14:49:39 +00:00
gguf.md	Fix gguf docs (#36601 )	2025-03-11 15:29:14 +01:00
glossary.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
how_to_hack_models.md	[doc] fix bugs in `how_to_hack_models.md` (#38198 )	2025-05-19 10:37:54 -07:00
hpo_train.md	[Nit] Add Note on SigOpt being in Public Archive Mode (#38610 )	2025-06-05 14:07:23 -07:00
image_processors.md	🔴 Video processors as a separate class (#35206 )	2025-05-12 11:55:51 +02:00
index.md	Reword README in light of model definitions (#38762 )	2025-06-12 14:43:31 +01:00
installation.md	byebye torch 2.0 (#37277 )	2025-04-07 15:19:47 +02:00
kv_cache.md	[docs] update cache docs with new info (#38775 )	2025-06-13 07:10:56 +00:00
llm_optims.md	[CI] green llama tests (#37244 )	2025-04-03 14:15:53 +01:00
llm_tutorial_optimization.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
llm_tutorial.md	[custom_generate] don't forward `custom_generate` and `trust_remote_code` (#38304 )	2025-05-23 14:49:39 +00:00
model_memory_anatomy.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
model_sharing.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
model_summary.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
models.md	Remove all traces of `low_cpu_mem_usage` (#38792 )	2025-06-12 16:39:33 +02:00
modular_transformers.md	Never fallback to eager implicitly (#38327 )	2025-05-23 19:48:01 +02:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
optimizers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pad_truncation.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
peft.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_hardware.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
perf_infer_cpu.md	remove ipex_optimize_model usage (#38632 )	2025-06-06 20:04:44 +02:00
perf_infer_gpu_multi.md	Fix: make docs work better with doc builder (#38213 )	2025-05-20 08:23:03 +00:00
perf_infer_gpu_one.md	Small typo lines 47 and 199 perf_infer_gpu_one.md (#37938 )	2025-05-06 14:32:55 +01:00
perf_torch_compile.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_cpu_many.md	remove ipex_optimize_model usage (#38632 )	2025-06-06 20:04:44 +02:00
perf_train_cpu.md	remove ipex_optimize_model usage (#38632 )	2025-06-06 20:04:44 +02:00
perf_train_gaudi.md	Add Intel Gaudi doc (#37855 )	2025-04-29 13:28:06 -07:00
perf_train_gpu_many.md	docs: fix typo (#37567 )	2025-04-17 14:54:44 +01:00
perf_train_gpu_one.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_special.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_tpu_tf.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perplexity.md	[docs] use device-agnostic API instead of cuda (#34913 )	2024-11-26 09:23:34 -08:00
philosophy.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
pipeline_gradio.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pipeline_tutorial.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
pipeline_webserver.md	fix and enhance pipeline_webserver.md (#36992 )	2025-04-15 08:35:05 -07:00
pr_checks.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
processors.md	[docs] add Audio import (#38195 )	2025-05-19 13:16:35 +00:00
quicktour.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
run_scripts.md	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
serialization.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
serving.md	fix docs serving typos. (#37936 )	2025-05-06 14:32:44 +01:00
task_summary.md	[doctest] Fixes (#35863 )	2025-01-26 15:26:38 -08:00
tasks_explained.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
testing.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
tf_xla.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tflite.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tokenizer_summary.md	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
tools.md	[agents] remove agents 🧹 (#37368 )	2025-04-11 18:42:37 +01:00
torchscript.md	Fix wording in `torchscript.md` (#38004 )	2025-05-08 16:47:45 +01:00
trainer.md	Simplify and update trl examples (#38772 )	2025-06-13 12:03:49 +00:00
training.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
troubleshooting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
video_processors.md	🔴 Video processors as a separate class (#35206 )	2025-05-12 11:55:51 +02:00