transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

History

StevenBucaille e5a9ce48f7 Add LightGlue model (#31718 ) * init * chore: various changes to LightGlue * chore: various changes to LightGlue * chore: various changes to LightGlue * chore: various changes to LightGlue * Fixed dynamo bug and image padding tests * refactor: applied refactoring changes from SuperGlue's concat, batch and stack functions to LightGlue file * tests: removed sdpa support and changed expected values * chore: added some docs and refactoring * chore: fixed copy to superpoint.image_processing_superpoint.convert_to_grayscale * feat: adding batch implementation * feat: added validation for preprocess and post process method to LightGlueImageProcessor * chore: changed convert_lightglue_to_hf script to comply with new standard * chore: changed lightglue test values to match new lightglue config pushed to hub * chore: simplified convert_lightglue_to_hf conversion map * feat: adding batching implementation * chore: make style * feat: added threshold to post_process_keypoint_matching method * fix: added missing instructions that turns keypoints back to absolute coordinate before matching forward * fix: added typehint and docs * chore: make style * [run-slow] lightglue * fix: add matches different from -1 to compute valid matches in post_process_keypoint_matching * tests: added CUDA proof tests similar to SuperGlue * chore: various changes to modeling_lightglue.py - Added "Copies from" statements for copied functions from modeling_superglue.py - Added missing docstrings - Removed unused functions or classes - Removed unnecessary statements - Added missing typehints - Added comments to the main forward method * chore: various changes to convert_lightglue_to_hf.py - Added model saving - Added model reloading * chore: fixed imports in lightglue files * [run-slow] lightglue * chore: make style * [run-slow] lightglue * Apply suggestions from code review Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * [run-slow] lightglue * chore: Applied some suggestions from review - Added missing typehints - Refactor "cuda" to device variable - Variable renaming - LightGlue output order changed - Make style * fix: added missing grayscale argument in image processor in case use of SuperPoint keypoint detector * fix: changed lightglue HF repo to lightglue_superpoint with grayscale default to True * refactor: make keypoints `(batch_size, num_keypoints, keypoint_dim)` through forward and unsqueeze only before attention layer * refactor: refactor do_layer_keypoint_pruning * tests: added tests with no early stop and keypoint pruning * refactor: various refactoring to modeling_lightglue.py - Removed unused functions - Renamed variables for consistency - Added comments for clarity - Set methods to private in LightGlueForKeypointMatching - Replaced tensor initialization to list then concatenation - Used more pythonic list comprehension for repetitive instructions * refactor: added comments and renamed filter_matches to get_matches_from_scores * tests: added copied from statement with superglue tests * docs: added comment to prepare_keypoint_matching_output function in tests * [run-slow] lightglue * refactor: reordered _concat_early_stopped_outputs in LightGlue class * [run-slow] lightglue * docs: added lightglue.md model doc * docs: added Optional typehint to LightGlueKeypointMatchingOutput * chore: removed pad_images function * chore: set do_grayscale default value to True in LightGlueImageProcessor * Apply suggestions from code review Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * Apply suggestions from code review Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * docs: added missing LightGlueConfig typehint in nn.Module __init__ methods * docs: removed unnecessary code in docs * docs: import SuperPointConfig only from a TYPE_CHECKING context * chore: use PretrainedConfig arguments `num_hidden_layers` and `num_attention_heads` instead of `num_layers` and `num_heads` * chore: added organization as arg in convert_lightglue_to_hf.py script * refactor: set device variable * chore: added "gelu" in LightGlueConfig as hidden_act parameter * docs: added comments to reshape.flip.reshape instruction to perform cross attention * refactor: used batched inference for keypoint detector forward pass * fix: added fix for SDPA tests * docs: fixed docstring for LightGlueImageProcessor * [run-slow] lightglue * refactor: removed unused line * refactor: added missing arguments in LightGlueConfig init method * docs: added missing LightGlueConfig typehint in init methods * refactor: added checkpoint url as default variable to verify models output only if it is the default url * fix: moved print message inside if statement * fix: added log assignment r removal in convert script * fix: got rid of confidence_thresholds as registered buffers * refactor: applied suggestions from SuperGlue PR * docs: changed copyright to 2025 * refactor: modular LightGlue * fix: removed unnecessary import * feat: added plot_keypoint_matching method to LightGlueImageProcessor with matplotlib soft dependency * fix: added missing import error for matplotlib * Updated convert script to push on ETH org * fix: added missing licence * fix: make fix-copies * refactor: use cohere apply_rotary_pos_emb function * fix: update model references to use ETH-CVG/lightglue_superpoint * refactor: add and use intermediate_size attribute in config to inherit CLIPMLP for LightGlueMLP * refactor: explicit variables instead of slicing * refactor: use can_return_tuple decorator in LightGlue model * fix: make fix-copies * docs: Update model references in `lightglue.md` to use the correct pretrained model from ETH-CVG * Refactor LightGlue configuration and processing classes - Updated type hints for `keypoint_detector_config` in `LightGlueConfig` to use `SuperPointConfig` directly. - Changed `size` parameter in `LightGlueImageProcessor` to be optional. - Modified `position_embeddings` in `LightGlueAttention` and `LightGlueAttentionBlock` to be optional tuples. - Cleaned up import statements across multiple files for better readability and consistency. * refactor: Update LightGlue configuration to enforce eager attention implementation - Added `attn_implementation="eager"` to `keypoint_detector_config` in `LightGlueConfig` and `LightGlueAttention` classes. - Removed unnecessary logging related to attention implementation fallback. - Cleaned up import statements for better readability. * refactor: renamed message into attention_output * fix: ensure device compatibility in LightGlueMatchAssignmentLayer descriptor normalization - Updated the normalization of `m_descriptors` to use the correct device for the tensor, ensuring compatibility across different hardware setups. * refactor: removed Conv layers from init_weights since LightGlue doesn't have any * refactor: replace add_start_docstrings with auto_docstring in LightGlue models - Updated LightGlue model classes to utilize the new auto_docstring utility for automatic documentation generation. - Removed legacy docstring handling to streamline the code and improve maintainability. * refactor: simplify LightGlue image processing tests by inheriting from SuperGlue - Refactored `LightGlueImageProcessingTester` and `LightGlueImageProcessingTest` to inherit from their SuperGlue counterparts, reducing code duplication. - Removed redundant methods and properties, streamlining the test setup and improving maintainability. * test: forced eager attention implementation to LightGlue model tests - Updated `LightGlueModelTester` to include `attn_implementation="eager"` in the model configuration. - This change aligns the test setup with the recent updates in LightGlue configuration for eager attention. * refactor: update LightGlue model references * fix: import error * test: enhance LightGlue image processing tests with setup method - Added a setup method in `LightGlueImageProcessingTest` to initialize `LightGlueImageProcessingTester`. - Included a docstring for `LightGlueImageProcessingTester` to clarify its purpose. * refactor: added LightGlue image processing implementation to modular file * refactor: moved attention blocks into the transformer layer * fix: added missing import * fix: added missing import in __all__ variable * doc: added comment about enforcing eager attention because of SuperPoint * refactor: added SuperPoint eager attention comment and moved functions to the closest they are used --------- Co-authored-by: Steven Bucaille <steven.bucaille@buawei.com> Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>		2025-06-17 18:10:23 +02:00
..
bettertransformer	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
deepspeed	🚨 rm already deprecated pad_to_max_length arg (#37617 )	2025-05-01 15:21:55 +02:00
extended	Add Optional to remaining types (#37808 )	2025-04-28 14:20:45 +01:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	Fix the fsdp config cannot work issue. (#37549 )	2025-04-28 10:44:51 +02:00
generation	Remove all traces of `low_cpu_mem_usage` (#38792 )	2025-06-12 16:39:33 +02:00
models	Add LightGlue model (#31718 )	2025-06-17 18:10:23 +02:00
optimization	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
peft_integration	FIX: Faulty PEFT tests (#37757 )	2025-04-28 15:10:46 +02:00
pipelines	[BugFix] QA pipeline edge case: `align_to_words=True` in `QuestionAnsweringPipeline` can lead to duplicate answers (#38761 )	2025-06-16 15:01:22 +00:00
quantization	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
repo_utils	Use HF papers (#38184 )	2025-06-13 11:07:09 +00:00
sagemaker	Deprecate TF + JAX (#38758 )	2025-06-11 17:28:06 +01:00
tensor_parallel	[TP] Change command in tests to `python3` (#38555 )	2025-06-03 11:03:33 +00:00
tokenization	Remove `isort` from dependencies (#38616 )	2025-06-05 16:42:49 +00:00
trainer	from 1.11.0, torchao.prototype.low_bit_optim is promoted to torchao.optim (#38689 )	2025-06-11 12:16:25 +00:00
utils	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
causal_lm_tester.py	Refactor DBRX tests to use CausalLMModelTest base classes (#38475 )	2025-06-13 16:22:12 +01:00
test_backbone_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_configuration_common.py	Update composition flag usage (#36263 )	2025-04-09 11:48:49 +02:00
test_feature_extraction_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_image_processing_common.py	enable more test cases on xpu (#38572 )	2025-06-06 09:29:51 +02:00
test_image_transforms.py	Fix `pad` image transform for batched inputs (#37544 )	2025-05-08 10:51:15 +01:00
test_modeling_common.py	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
test_pipeline_mixin.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_processing_common.py	[video processors] support frame sampling within processors (#38105 )	2025-06-12 09:34:30 +00:00
test_sequence_feature_extraction_common.py	Use Python 3.9 syntax in tests (#37343 )	2025-04-08 14:12:08 +02:00
test_tokenization_common.py	🚨 rm already deprecated pad_to_max_length arg (#37617 )	2025-05-01 15:21:55 +02:00
test_training_args.py	Fix `TrainingArguments.torch_empty_cache_steps` post_init check (#36734 )	2025-03-17 16:09:46 +01:00
test_video_processing_common.py	[video processors] support frame sampling within processors (#38105 )	2025-06-12 09:34:30 +00:00