transformers/tests
Zhen e686fed635
[Feature] Support using FlashAttention2 on Ascend NPU (#36696)
* [Feature] Support using flash-attention on Ascend NPU

* Fix qwen3 and qwen3_moe moduler conversion mismatch
2025-03-31 16:12:58 +02:00
..
agents use torch.testing.assertclose instead to get more details about error in cis (#35659) 2025-01-24 16:55:28 +01:00
bettertransformer Fix typos in tests (#36547) 2025-03-05 15:04:06 -08:00
deepspeed chore: fix typos in the tests directory (#36813) 2025-03-21 10:20:05 +01:00
extended [tests] skip tests for xpu (#33553) 2024-09-19 19:28:04 +01:00
fixtures Implementation of SuperPoint and AutoModelForKeypointDetection (#28966) 2024-03-19 14:43:02 +00:00
fsdp HPU support (#36424) 2025-03-12 09:08:12 +01:00
generation [MLU] Fix FA2 check error, remove deepspeed-mlu deps. (#36159) 2025-03-31 11:02:49 +02:00
models skip (#37141) 2025-03-31 15:38:40 +02:00
optimization Just import torch AdamW instead (#36177) 2025-03-19 18:29:40 +00:00
peft_integration Set weights_only in torch.load (#36991) 2025-03-27 14:55:50 +00:00
pipelines Update ruff to 0.11.2 (#36962) 2025-03-25 16:00:11 +01:00
quantization [tests] remove cuda-only test marker in AwqConfigTest (#37032) 2025-03-31 11:53:02 +02:00
repo_utils Adding Qwen3 and Qwen3MoE (#36878) 2025-03-31 09:50:49 +02:00
sagemaker Change GPUS to GPUs (#36945) 2025-03-25 17:25:39 +01:00
tensor_parallel enable tp on CPU (#36299) 2025-03-31 10:55:47 +02:00
tokenization Use lru_cache for tokenization tests (#36818) 2025-03-28 15:09:35 +01:00
trainer Set weights_only in torch.load (#36991) 2025-03-27 14:55:50 +00:00
utils [Feature] Support using FlashAttention2 on Ascend NPU (#36696) 2025-03-31 16:12:58 +02:00
__init__.py GPU text generation: mMoved the encoded_prompt to correct device 2020-01-06 15:11:12 +01:00
test_backbone_common.py Align backbone stage selection with out_indices & out_features (#27606) 2023-12-20 18:33:17 +00:00
test_configuration_common.py Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
test_feature_extraction_common.py Split common test from core tests (#24284) 2023-06-15 07:30:24 -04:00
test_image_processing_common.py Make the flaky list a little more general (#36704) 2025-03-14 12:15:32 +00:00
test_image_transforms.py Uses Collection in transformers.image_transforms.normalize (#36301) 2025-02-21 18:38:41 +01:00
test_modeling_common.py Support QuestionAnswering Module for ModernBert based models. (#35566) 2025-03-26 21:24:18 +01:00
test_modeling_flax_common.py Fix typos in tests (#36547) 2025-03-05 15:04:06 -08:00
test_modeling_tf_common.py fix typos in the tests directory (#36717) 2025-03-17 17:45:57 +00:00
test_pipeline_mixin.py Add image text to text pipeline (#34170) 2024-10-31 15:48:11 -04:00
test_processing_common.py [chat templates} support loading audio from video (#36955) 2025-03-27 14:46:11 +01:00
test_sequence_feature_extraction_common.py Fix typo (#25966) 2023-09-05 10:12:25 +02:00
test_tokenization_common.py Use lru_cache for tokenization tests (#36818) 2025-03-28 15:09:35 +01:00
test_training_args.py Fix TrainingArguments.torch_empty_cache_steps post_init check (#36734) 2025-03-17 16:09:46 +01:00