transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

History

Zhen e686fed635 [Feature] Support using FlashAttention2 on Ascend NPU (#36696 ) * [Feature] Support using flash-attention on Ascend NPU * Fix qwen3 and qwen3_moe moduler conversion mismatch		2025-03-31 16:12:58 +02:00
..
agents	use torch.testing.assertclose instead to get more details about error in cis (#35659 )	2025-01-24 16:55:28 +01:00
bettertransformer	Fix typos in tests (#36547 )	2025-03-05 15:04:06 -08:00
deepspeed	chore: fix typos in the tests directory (#36813 )	2025-03-21 10:20:05 +01:00
extended	[tests] skip tests for xpu (#33553 )	2024-09-19 19:28:04 +01:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	HPU support (#36424 )	2025-03-12 09:08:12 +01:00
generation	[MLU] Fix FA2 check error, remove deepspeed-mlu deps. (#36159 )	2025-03-31 11:02:49 +02:00
models	skip (#37141 )	2025-03-31 15:38:40 +02:00
optimization	Just import torch AdamW instead (#36177 )	2025-03-19 18:29:40 +00:00
peft_integration	Set weights_only in torch.load (#36991 )	2025-03-27 14:55:50 +00:00
pipelines	Update ruff to `0.11.2` (#36962 )	2025-03-25 16:00:11 +01:00
quantization	[tests] remove cuda-only test marker in `AwqConfigTest` (#37032 )	2025-03-31 11:53:02 +02:00
repo_utils	Adding Qwen3 and Qwen3MoE (#36878 )	2025-03-31 09:50:49 +02:00
sagemaker	Change GPUS to GPUs (#36945 )	2025-03-25 17:25:39 +01:00
tensor_parallel	enable tp on CPU (#36299 )	2025-03-31 10:55:47 +02:00
tokenization	Use `lru_cache` for tokenization tests (#36818 )	2025-03-28 15:09:35 +01:00
trainer	Set weights_only in torch.load (#36991 )	2025-03-27 14:55:50 +00:00
utils	[Feature] Support using FlashAttention2 on Ascend NPU (#36696 )	2025-03-31 16:12:58 +02:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_configuration_common.py	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_image_processing_common.py	Make the flaky list a little more general (#36704 )	2025-03-14 12:15:32 +00:00
test_image_transforms.py	Uses Collection in transformers.image_transforms.normalize (#36301 )	2025-02-21 18:38:41 +01:00
test_modeling_common.py	Support QuestionAnswering Module for ModernBert based models. (#35566 )	2025-03-26 21:24:18 +01:00
test_modeling_flax_common.py	Fix typos in tests (#36547 )	2025-03-05 15:04:06 -08:00
test_modeling_tf_common.py	fix typos in the tests directory (#36717 )	2025-03-17 17:45:57 +00:00
test_pipeline_mixin.py	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
test_processing_common.py	[chat templates} support loading audio from video (#36955 )	2025-03-27 14:46:11 +01:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	Use `lru_cache` for tokenization tests (#36818 )	2025-03-28 15:09:35 +01:00
test_training_args.py	Fix `TrainingArguments.torch_empty_cache_steps` post_init check (#36734 )	2025-03-17 16:09:46 +01:00