transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-23 14:29:01 +06:00

History

Dmitry Rogozhkin 31830474bf Fix `test_eager_matches_sdpa_inference` for `XPU` backend (#34889 ) * Use torch.nn.attention.sdpa_kernel instead of deprecated torch.backends.cuda.sdp_kernel Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> * Fix test_eager_matches_sdpa_inference for XPU backend As of PyTorch 2.5 XPU backend supports only torch.nn.attention.SDPBackend.MATH which is implemented on PyTorch level using aten operators and is device agnostic with respect to implementation of each aten operator. Thus, we can reuse CUDA (or CPU) MATH weights for XPU. Fixes: #34888 Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> * Use torch.amp.autocast instead of deprecated torch.cuda.amp.autocast in nemotron Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> --------- Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>		2024-12-02 16:21:04 +01:00
..
agents	Agents: Small fixes in streaming to gradio + add tests (#34549 )	2024-11-11 20:52:09 +01:00
benchmark	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	Trainer - deprecate tokenizer for processing_class (#32385 )	2024-10-02 14:08:46 +01:00
extended	[tests] skip tests for xpu (#33553 )	2024-09-19 19:28:04 +01:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	FSDP grad accum fix (#34645 )	2024-11-15 22:28:06 +01:00
generation	Offloaded cache: fix generate (#34921 )	2024-11-28 15:05:56 +01:00
models	Fix `test_eager_matches_sdpa_inference` for `XPU` backend (#34889 )	2024-12-02 16:21:04 +01:00
optimization	fix: Fixed the `1st argument` name in classmethods (#31907 )	2024-07-11 12:11:50 +01:00
peft_integration	[PEFT] Set eval mode when loading PEFT adapter (#34509 )	2024-11-28 13:56:25 +01:00
pipelines	allow unused input parameters passthrough when chunking in asr pipelines (#33889 )	2024-11-25 11:36:44 +01:00
quantization	Skipping aqlm non working inference tests till fix merged (#34865 )	2024-11-26 11:09:30 +01:00
repo_utils	Refactor CI: more explicit (#30674 )	2024-08-30 18:17:25 +02:00
sagemaker	Trainer - deprecate tokenizer for processing_class (#32385 )	2024-10-02 14:08:46 +01:00
tokenization	VLM: special multimodal Tokenizer (#34461 )	2024-11-04 16:37:51 +01:00
tp	Simplify Tensor Parallel implementation with PyTorch TP (#34184 )	2024-11-18 19:51:49 +01:00
trainer	Remove FSDP wrapping from sub-models. (#34452 )	2024-11-15 23:00:03 +01:00
utils	Fix: take into account meta device (#34134 )	2024-11-20 11:32:07 +01:00
__init__.py	GPU text generation: mMoved the encoded_prompt to correct device	2020-01-06 15:11:12 +01:00
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_configuration_common.py	Load sub-configs from composite configs (#34410 )	2024-11-05 11:34:01 +01:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_image_processing_common.py	Add DetrImageProcessorFast (#34063 )	2024-10-21 09:05:05 -04:00
test_image_transforms.py	fix: center_crop occasionally outputs off-by-one dimension matrix (#30934 )	2024-05-21 13:56:52 +01:00
test_modeling_common.py	Fix `test_eager_matches_sdpa_inference` for `XPU` backend (#34889 )	2024-12-02 16:21:04 +01:00
test_modeling_flax_common.py	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
test_modeling_tf_common.py	[`TF`] Fix Tensorflow XLA Generation on limited seq_len models (#33903 )	2024-10-05 16:20:50 +02:00
test_pipeline_mixin.py	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
test_processing_common.py	Separate chat templates into a single file (#33957 )	2024-11-26 14:18:04 +00:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	Separate chat templates into a single file (#33957 )	2024-11-26 14:18:04 +00:00