transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

History

Pavel Iakubovskii 9bec2654ed Add V-JEPA for video classification model (#38788 ) * adding model and conversion scripts * add imports to test vjepa conversion * fix imports and make conversion work * fix computation for short side * replace attention with library attention function * cleanup more attention classes * remove config overrides * add test cases, fix some of the failing ones * fix the model outputs * fix outputs of the model per review * fix too big model test case * fix styling __init__.py * fix initialization test * remove all asserts per review * update sorting unsorting logic as per feedback * remove is_video per review * remove another is_video segment * remove unwanted stuff * small fixes * add docstrings for the model * revert adding vjepa2 config here * update styling * add config docstrings (wip) * fix dpr issue * removed test failing issues * update styles * merge predictor configs into main config * remove processing code, add video processor * remove permute which is not necessary now * fix styles * updated vjepa2 to be in video_processing_auto * update comment for preprocessing * test integration test and fix the outputs * update test values, change test to look at repeated frames for a given image * add a simple video processing test * refactoring pixel_values_videos and upload ckpts to original * fix torch_fx test cases * remove unused config * add all config docstrings * add more integration tests * add basic doc * revert unwanted styling changes * working make fixup * Fix model_type in config * Add ForVideoClassification model * update attention implementation to fit new hf standards * fix the preprocessing logic, ensure it matches the original model * remove use_rope logic, cleanup * fix docstrings * Further cleanup, update doc * Fix model prefix * fix get_vision_features * VJEPA2Embeddings style refactor * nit, style comment * change modules default values * Only `str` activation in config * GradientCheckpointingLayer * fixup * fix conversion script * Remove return_dict * remove None return typehint * Refactor VJEPA2Layer, remove use_SiLU * Fix fx tests * dpr -> drop_path_rates * move ModelOutput on top format docs bit * update docs * update docs * update doc example * remove prune_heads from model * remove unused config params * refactor embed signature * Add vjepa to docs * Fix config docstring * attention head * update defaults * Update docs/source/en/model_doc/vjepa2.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/vjepa2.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix import * Min refactoring * Update HUB_SOURCE and HUB_REPO in conversion script * Add missing headers * VJEPA -> V-JEPA in docs * Add image to doc * fix style * fix init weights * change checkpoint name in modeling tests * Initial cls head setup * remove rop attention from head (not needed) * remove swigluffn - not needed * Add siglip layer * Replace with siglip layer * Rename Siglip - VJEPA2 * remove unused modules * remove siglip mlp * nit * remove MLP * Refactor head cross attention * refactor VJEPA2HeadCrossAttentionLayer * nit renaming * fixup * remove commented code * Add cls head params to config * depth from config * move pooler + classifier to the model * Update for cls model signature * move layers, rename a bit * fix docs * update weights init * remove typehint for init * add to auto-mapping * enable tests * Add conversion script * fixup * add to docs * fix docs * nit * refactor for mapping * clean * Add integration test * Fixing multi gpu test * update not-split-modules * update video cls test tolerance * Increase test_inference_image tolerance * Update no-split modules for multi gpu * Apply suggestions from code review * fixing multi-gpu * fix docstring * Add cls snippet to docs * Update checkpoint		2025-06-13 17:56:15 +01:00
..
bettertransformer	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
deepspeed	🚨 rm already deprecated pad_to_max_length arg (#37617 )	2025-05-01 15:21:55 +02:00
extended	Add Optional to remaining types (#37808 )	2025-04-28 14:20:45 +01:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	Fix the fsdp config cannot work issue. (#37549 )	2025-04-28 10:44:51 +02:00
generation	Remove all traces of `low_cpu_mem_usage` (#38792 )	2025-06-12 16:39:33 +02:00
models	Add V-JEPA for video classification model (#38788 )	2025-06-13 17:56:15 +01:00
optimization	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
peft_integration	FIX: Faulty PEFT tests (#37757 )	2025-04-28 15:10:46 +02:00
pipelines	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
quantization	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
repo_utils	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
sagemaker	Deprecate TF + JAX (#38758 )	2025-06-11 17:28:06 +01:00
tensor_parallel	[TP] Change command in tests to `python3` (#38555 )	2025-06-03 11:03:33 +00:00
tokenization	Remove `isort` from dependencies (#38616 )	2025-06-05 16:42:49 +00:00
trainer	from 1.11.0, torchao.prototype.low_bit_optim is promoted to torchao.optim (#38689 )	2025-06-11 12:16:25 +00:00
utils	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
causal_lm_tester.py	Refactor DBRX tests to use CausalLMModelTest base classes (#38475 )	2025-06-13 16:22:12 +01:00
test_backbone_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_configuration_common.py	Update composition flag usage (#36263 )	2025-04-09 11:48:49 +02:00
test_feature_extraction_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_image_processing_common.py	enable more test cases on xpu (#38572 )	2025-06-06 09:29:51 +02:00
test_image_transforms.py	Fix `pad` image transform for batched inputs (#37544 )	2025-05-08 10:51:15 +01:00
test_modeling_common.py	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
test_pipeline_mixin.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_processing_common.py	[video processors] support frame sampling within processors (#38105 )	2025-06-12 09:34:30 +00:00
test_sequence_feature_extraction_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_tokenization_common.py	🚨 rm already deprecated pad_to_max_length arg (#37617 )	2025-05-01 15:21:55 +02:00
test_training_args.py	Fix `TrainingArguments.torch_empty_cache_steps` post_init check (#36734 )	2025-03-17 16:09:46 +01:00
test_video_processing_common.py	[video processors] support frame sampling within processors (#38105 )	2025-06-12 09:34:30 +00:00