transformers/tests
Younes Belkada 93e538ae2e
Mamba / FalconMamba: Fix mamba left padding (#32677)
* fix mamba left padding

* Apply suggestions from code review

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

* fix copies

* test with `inputs_embeds`

* Update src/transformers/models/falcon_mamba/modeling_falcon_mamba.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* copies

* clairfy

* fix last comments

* remove

---------

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2024-08-19 16:01:35 +02:00
..
agents Agents use grammar (#31735) 2024-08-07 11:42:52 +02:00
benchmark [Test refactor 1/5] Per-folder tests reorganization (#15725) 2022-02-23 15:46:28 -05:00
bettertransformer Fixed malapropism error (#26660) 2023-10-09 11:04:57 +02:00
deepspeed Revert PR 32299, flag users when Zero-3 was missed (#32851) 2024-08-16 12:35:41 -04:00
extended Skip tests properly (#31308) 2024-06-26 21:59:08 +01:00
fixtures Implementation of SuperPoint and AutoModelForKeypointDetection (#28966) 2024-03-19 14:43:02 +00:00
fsdp Llama et al. / FSDP : Fix breaking change in 4.40 for FSDP (#31161) 2024-06-26 14:50:08 +01:00
generation Generate: unify LogitsWarper and LogitsProcessor (#32626) 2024-08-16 11:20:41 +01:00
models Mamba / FalconMamba: Fix mamba left padding (#32677) 2024-08-19 16:01:35 +02:00
optimization fix: Fixed the 1st argument name in classmethods (#31907) 2024-07-11 12:11:50 +01:00
peft_integration FIX [CI]: Fix failing tests for peft integration (#29330) 2024-02-29 03:56:16 +01:00
pipelines fix tensors on different devices in WhisperGenerationMixin (#32316) 2024-08-13 11:29:57 +01:00
quantization Cache: use batch_size instead of max_batch_size (#32657) 2024-08-16 11:48:45 +01:00
repo_utils Allow # Ignore copy (#27328) 2023-12-07 10:00:08 +01:00
sagemaker Fixed log messages that are resulting in TypeError due to too many arguments (#32017) 2024-07-17 10:56:44 +01:00
tokenization #32184 save total_vocab_size (#32240) 2024-08-05 09:22:48 +02:00
trainer Add support for GrokAdamW optimizer (#32521) 2024-08-13 13:20:28 +01:00
utils Reduce the error log when using core models that need their weights renamed, and provide a step forward (#32656) 2024-08-16 13:05:57 -04:00
__init__.py GPU text generation: mMoved the encoded_prompt to correct device 2020-01-06 15:11:12 +01:00
test_backbone_common.py Align backbone stage selection with out_indices & out_features (#27606) 2023-12-20 18:33:17 +00:00
test_configuration_common.py Refactor: Removed un-necessary object base class (#32230) 2024-07-26 10:33:02 +02:00
test_feature_extraction_common.py Split common test from core tests (#24284) 2023-06-15 07:30:24 -04:00
test_image_processing_common.py Update kwargs validation for preprocess with decorator (#32024) 2024-08-06 11:33:05 +01:00
test_image_transforms.py fix: center_crop occasionally outputs off-by-one dimension matrix (#30934) 2024-05-21 13:56:52 +01:00
test_modeling_common.py [tests] make test_sdpa_can_compile_dynamic device-agnostic (#32519) 2024-08-19 12:46:59 +01:00
test_modeling_flax_common.py add sdpa to ViT [follow up of #29325] (#30555) 2024-05-16 10:56:11 +01:00
test_modeling_tf_common.py Port IDEFICS to tensorflow (#26870) 2024-05-13 15:59:46 +01:00
test_pipeline_mixin.py fix: Fixed raising TypeError instead of ValueError for invalid type (#32111) 2024-07-22 17:46:17 +01:00
test_processing_common.py Modify ProcessorTesterMixin for better generalization (#32637) 2024-08-13 11:48:53 -04:00
test_sequence_feature_extraction_common.py Fix typo (#25966) 2023-09-05 10:12:25 +02:00
test_tokenization_common.py Fix conflicting key in init kwargs in PreTrainedTokenizerBase (#31233) 2024-08-01 14:32:13 +02:00