transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-29 17:22:25 +06:00

History

Yoni Gozlan a245011252 Add InternVL (2.5 MPO) (#35968 ) * initial commit * add convert internvl * add first end-to-end working internvl * nit prompt and image proc * add working chat template * add conversion llama-based models * add tests * pass all tests * fix isort * fix modular after main merge * add video processing for internvl * add support for interlaced images and videos * Remove processing and config from modular, add more tests * add llama model tests * Modify processor for compatibility with refactored got ocr image processor * add comments in processor * Add docs and nits * change video processing to use custom sample_indices_fn * rebase and fix tests * add processor tests * Add changes Raushan review * Use the new attention interface for the vision model * nits * add support for custom video_load_backend * remove mention to InternVLTokenizer * refactor vision model to simplify logic * refactor processor for better readibility * fix copies * fix require av processor test * refactor internVL vision * Update processor and fix processing tests * fix docstring * update convert_weights for internvl3 * change image processor to fast by default * remove do_center_crop=True in convert_weights * force use_cache to True * push_to_hub before reloading * fix internVLVision for larger models * update convert weight for qk norm * fix convert_weights * fix eos_token_id in convert * update docs and integration tests * make modifs after review * fix wrong k_norm and reduce modular * change image_token_index to image_token_id * change checkpoint to OpenGVLab org * last nits * explicitely del self.num_key_value_groups * add extra special tokens		2025-04-18 18:57:33 +02:00
..
bettertransformer	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
deepspeed	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
extended	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	Remove old code for PyTorch, Accelerator and tokenizers (#37234 )	2025-04-10 20:54:21 +02:00
generation	Add InternVL (2.5 MPO) (#35968 )	2025-04-18 18:57:33 +02:00
models	Add InternVL (2.5 MPO) (#35968 )	2025-04-18 18:57:33 +02:00
optimization	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
peft_integration	enable several cases on XPU (#37516 )	2025-04-16 11:01:04 +02:00
pipelines	Simplify soft dependencies and update the dummy-creation process (#36827 )	2025-04-11 11:08:36 +02:00
quantization	Fix Quark quantization config (#37578 )	2025-04-18 07:23:39 +02:00
repo_utils	Simplify soft dependencies and update the dummy-creation process (#36827 )	2025-04-11 11:08:36 +02:00
sagemaker	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
tensor_parallel	enable tp on CPU (#36299 )	2025-03-31 10:55:47 +02:00
tokenization	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
trainer	add FlashAttentionKwargs and seq_idx to flat collator (#36456 )	2025-04-16 15:45:03 +02:00
utils	Model debugger upgrades (#37391 )	2025-04-18 16:45:54 +02:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_backbone_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_configuration_common.py	Update composition flag usage (#36263 )	2025-04-09 11:48:49 +02:00
test_feature_extraction_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_image_processing_common.py	Bridgetower fast image processor (#37373 )	2025-04-16 22:39:18 +02:00
test_image_transforms.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_modeling_common.py	Small fix on context manager detection (#37562 )	2025-04-17 15:39:44 +02:00
test_modeling_flax_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_modeling_tf_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_pipeline_mixin.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_processing_common.py	Add Qwen2.5-Omni (#36752 )	2025-04-14 12:36:41 +02:00
test_sequence_feature_extraction_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_tokenization_common.py	🚨 🚨 Allow saving and loading multiple "raw" chat template files (#36588 )	2025-04-11 16:37:23 +01:00
test_training_args.py	Fix `TrainingArguments.torch_empty_cache_steps` post_init check (#36734 )	2025-03-17 16:09:46 +01:00