transformers/tests
efsotr 3ee72af6b6
Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1 (#37332)
* Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1

* fix code format

* add test; replace position_ids with query_states becasue position_ids.shape[0] is always 1

* add assert loss is not nan
2025-06-25 07:58:34 +00:00
..
bettertransformer Use Python 3.9 syntax in tests (#37343) 2025-04-08 14:12:08 +02:00
deepspeed Gaudi3 CI (#38790) 2025-06-23 10:56:51 +02:00
extended Add Optional to remaining types (#37808) 2025-04-28 14:20:45 +01:00
fixtures Implementation of SuperPoint and AutoModelForKeypointDetection (#28966) 2024-03-19 14:43:02 +00:00
fsdp Gaudi3 CI (#38790) 2025-06-23 10:56:51 +02:00
generation enable misc test cases on XPU (#38852) 2025-06-18 09:20:49 +02:00
models Skip sdpa dispatch on flash test due to unsupported head dims (#39010) 2025-06-24 20:16:56 +02:00
optimization Use Python 3.9 syntax in tests (#37343) 2025-04-08 14:12:08 +02:00
peft_integration FIX: Faulty PEFT tests (#37757) 2025-04-28 15:10:46 +02:00
pipelines [Feature] Support is_split_into_words in the TokenClassificationPipeline. (#38818) 2025-06-23 15:31:32 +00:00
quantization Fix HQQ model param device transfer issue (#38466) 2025-06-18 15:09:00 +02:00
repo_utils Use HF papers (#38184) 2025-06-13 11:07:09 +00:00
sagemaker Deprecate TF + JAX (#38758) 2025-06-11 17:28:06 +01:00
tensor_parallel [TP] Change command in tests to python3 (#38555) 2025-06-03 11:03:33 +00:00
tokenization Remove isort from dependencies (#38616) 2025-06-05 16:42:49 +00:00
trainer Gaudi3 CI (#38790) 2025-06-23 10:56:51 +02:00
utils Fix bugs in DynamicCache (#37880) 2025-06-24 19:43:40 +02:00
__init__.py GPU text generation: mMoved the encoded_prompt to correct device 2020-01-06 15:11:12 +01:00
causal_lm_tester.py Refactor DBRX tests to use CausalLMModelTest base classes (#38475) 2025-06-13 16:22:12 +01:00
test_backbone_common.py Use Python 3.9 syntax in tests (#37343) 2025-04-08 14:12:08 +02:00
test_configuration_common.py Update composition flag usage (#36263) 2025-04-09 11:48:49 +02:00
test_feature_extraction_common.py Use Python 3.9 syntax in tests (#37343) 2025-04-08 14:12:08 +02:00
test_image_processing_common.py Add Idefics2/3 and SmolVLM Fast image processors + improvements for fast image processors (#38157) 2025-06-23 14:17:25 +00:00
test_image_transforms.py Fix pad image transform for batched inputs (#37544) 2025-05-08 10:51:15 +01:00
test_modeling_common.py Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1 (#37332) 2025-06-25 07:58:34 +00:00
test_pipeline_mixin.py No more Tuple, List, Dict (#38797) 2025-06-17 19:37:18 +01:00
test_processing_common.py [video processors] support frame sampling within processors (#38105) 2025-06-12 09:34:30 +00:00
test_sequence_feature_extraction_common.py No more Tuple, List, Dict (#38797) 2025-06-17 19:37:18 +01:00
test_tokenization_common.py 🚨 rm already deprecated pad_to_max_length arg (#37617) 2025-05-01 15:21:55 +02:00
test_training_args.py Fix TrainingArguments.torch_empty_cache_steps post_init check (#36734) 2025-03-17 16:09:46 +01:00
test_video_processing_common.py [video processors] support frame sampling within processors (#38105) 2025-06-12 09:34:30 +00:00