transformers/tests
Younes Belkada 3f93fd0694
Llama et al. / FSDP : Fix breaking change in 4.40 for FSDP (#31161)
* fix llama fsdp

* fixup

* adding FSDP tests for CPU offloading

* fixes

* fix tests

* fix tests

* add it for mixtral

* propagate the changes on other models

* Update src/transformers/models/phi/modeling_phi.py

* Delete utils/testing_scripts/fsdp_cpu_offloading.py

Remove script - FSDP + CPU offloading it tested in the test suite

* Delete utils/testing_scripts/dummy_fsdp_config.yml

* Update + add cache_positions docstring

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-06-26 14:50:08 +01:00
..
agents Agents: Improve python interpreter (#31409) 2024-06-18 11:55:36 +02:00
benchmark [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
bettertransformer Fixed malapropism error (#26660) 2023-10-09 11:04:57 +02:00
deepspeed [tests] rename test_config_object to test_ds_config_object (#31403) 2024-06-19 11:19:15 +02:00
extended CI: update to ROCm 6.0.2 and test MI300 (#30266) 2024-05-13 18:14:36 +02:00
fixtures Implementation of SuperPoint and AutoModelForKeypointDetection (#28966) 2024-03-19 14:43:02 +00:00
fsdp Llama et al. / FSDP : Fix breaking change in 4.40 for FSDP (#31161) 2024-06-26 14:50:08 +01:00
generation RWKV: enable generation tests (#31490) 2024-06-20 14:15:01 +01:00
models [GPT-NeoX] Add SDPA support (#31031) 2024-06-26 13:56:36 +01:00
optimization fix get_scheduler when name is warmup_stable_decay (#31128) 2024-05-30 15:25:43 +01:00
peft_integration FIX [CI]: Fix failing tests for peft integration (#29330) 2024-02-29 03:56:16 +01:00
pipelines fix output data type of image classification (#31444) 2024-06-25 11:14:39 +01:00
quantization GGUF: Fix llama 3 GGUF (#31358) 2024-06-20 14:29:58 +02:00
repo_utils Allow # Ignore copy (#27328) 2023-12-07 10:00:08 +01:00
sagemaker Pass datasets trust_remote_code (#31406) 2024-06-17 17:29:13 +01:00
tokenization update ruff version (#30932) 2024-05-22 06:40:15 +02:00
trainer Pass datasets trust_remote_code (#31406) 2024-06-17 17:29:13 +01:00
utils Add implementation of spectrogram_batch (#27159) 2024-06-24 09:19:12 +02:00
__init__.py GPU text generation: mMoved the encoded_prompt to correct device 2020-01-06 15:11:12 +01:00
test_backbone_common.py Align backbone stage selection with out_indices & out_features (#27606) 2023-12-20 18:33:17 +00:00
test_cache_utils.py Generate: add tests for caches with pad_to_multiple_of (#29462) 2024-03-06 10:57:04 +00:00
test_configuration_common.py [ PretrainedConfig] Improve messaging (#27438) 2023-11-15 14:10:39 +01:00
test_configuration_utils.py Use HF_HUB_OFFLINE + fix has_file in offline mode (#31016) 2024-05-29 11:55:43 +01:00
test_feature_extraction_common.py Split common test from core tests (#24284) 2023-06-15 07:30:24 -04:00
test_feature_extraction_utils.py [tests] remove deprecated tests for model loading (#29450) 2024-03-15 14:18:41 +00:00
test_image_processing_common.py Add ViTImageProcessorFast to tests (#31424) 2024-06-25 13:36:58 +01:00
test_image_processing_utils.py [tests] remove deprecated tests for model loading (#29450) 2024-03-15 14:18:41 +00:00
test_image_transforms.py fix: center_crop occasionally outputs off-by-one dimension matrix (#30934) 2024-05-21 13:56:52 +01:00
test_modeling_common.py Correct @is_flaky test decoration (#31480) 2024-06-24 08:09:21 +01:00
test_modeling_flax_common.py add sdpa to ViT [follow up of #29325] (#30555) 2024-05-16 10:56:11 +01:00
test_modeling_flax_utils.py Enable safetensors conversion from PyTorch to other frameworks without the torch requirement (#27599) 2024-01-23 10:28:23 +01:00
test_modeling_tf_common.py Port IDEFICS to tensorflow (#26870) 2024-05-13 15:59:46 +01:00
test_modeling_tf_utils.py Cast bfloat16 to float32 for Numpy conversions (#29755) 2024-03-21 14:04:11 +00:00
test_modeling_utils.py Use huggingface_hub helper function to split state dict (#31091) 2024-06-12 14:10:32 +02:00
test_pipeline_mixin.py Remove ConversationalPipeline and Conversation object (#31165) 2024-06-07 17:50:18 +01:00
test_processing_common.py add initial design for uniform processors + align model (#31197) 2024-06-13 16:27:16 +02:00
test_sequence_feature_extraction_common.py Fix typo (#25966) 2023-09-05 10:12:25 +02:00
test_tokenization_common.py SPLIT PR: add user defined symbols and control symbols (#31305) 2024-06-21 01:48:10 -07:00
test_tokenization_utils.py Token healing (#30081) 2024-06-03 10:53:15 +02:00