..
benchmark
[Test refactor 1/5] Per-folder tests reorganization ( #15725 )
2022-02-23 15:46:28 -05:00
bettertransformer
Fixed malapropism error ( #26660 )
2023-10-09 11:04:57 +02:00
deepspeed
Add deepspeed test to amd scheduled CI ( #27633 )
2023-12-11 16:33:36 +01:00
extended
Device agnostic trainer testing ( #27131 )
2023-10-30 18:16:40 +00:00
fixtures
[WIP] add SpeechT5 model ( #18922 )
2023-02-03 12:43:46 -05:00
fsdp
fix resuming from ckpt when using FSDP with FULL_STATE_DICT ( #27891 )
2023-12-16 19:41:43 +05:30
generation
Generate: assisted decoding now uses generate
for the assistant ( #28030 )
2023-12-14 13:31:13 +00:00
models
Remove token_type_ids from model_input_names (like #24788 ) ( #28325 )
2024-01-03 19:26:07 +01:00
optimization
Make schedulers picklable by making lr_lambda fns global ( #21768 )
2023-03-02 12:08:43 -05:00
peft_integration
[Peft
] modules_to_save
support for peft integration ( #27466 )
2023-11-14 10:32:57 +01:00
pipelines
[Whisper] Fix word-level timestamps with bs>1 or num_beams>1 ( #28114 )
2023-12-22 12:43:11 +00:00
quantization
[Awq
] Enable the possibility to skip quantization for some target modules ( #27950 )
2023-12-25 11:06:56 +01:00
repo_utils
Allow # Ignore copy
( #27328 )
2023-12-07 10:00:08 +01:00
sagemaker
Broken links fixed related to datasets docs ( #27569 )
2023-11-17 13:44:09 -08:00
tokenization
[Styling
] stylify using ruff ( #27144 )
2023-11-16 17:43:19 +01:00
tools
Add support for for loops in python interpreter ( #24429 )
2023-06-26 09:58:14 -04:00
trainer
move code to Trainer.evaluate to enable use of that function with multiple datasets ( #27844 )
2023-12-20 10:55:56 +01:00
utils
Fix the check of models supporting FA/SDPA not run ( #28202 )
2023-12-22 12:56:11 +01:00
__init__.py
GPU text generation: mMoved the encoded_prompt to correct device
2020-01-06 15:11:12 +01:00
test_backbone_common.py
Align backbone stage selection with out_indices & out_features ( #27606 )
2023-12-20 18:33:17 +00:00
test_cache_utils.py
Generate: SinkCache can handle iterative prompts ( #27907 )
2023-12-08 20:02:20 +00:00
test_configuration_common.py
[ PretrainedConfig
] Improve messaging ( #27438 )
2023-11-15 14:10:39 +01:00
test_configuration_utils.py
F.scaled_dot_product_attention support ( #26572 )
2023-12-09 05:38:14 +09:00
test_feature_extraction_common.py
Split common test from core tests ( #24284 )
2023-06-15 07:30:24 -04:00
test_feature_extraction_utils.py
Remove-auth-token ( #27060 )
2023-11-13 14:20:54 +01:00
test_image_processing_common.py
Fix a couple of typos and add an illustrative test ( #26941 )
2023-12-11 15:51:51 +00:00
test_image_processing_utils.py
Remove-auth-token ( #27060 )
2023-11-13 14:20:54 +01:00
test_image_transforms.py
Normalize floating point cast ( #27249 )
2023-11-10 15:35:27 +00:00
test_modeling_common.py
Fix weights not properly initialized due to shape mismatch ( #28122 )
2023-12-20 14:20:02 +01:00
test_modeling_flax_common.py
Split common test from core tests ( #24284 )
2023-06-15 07:30:24 -04:00
test_modeling_flax_utils.py
Default to msgpack for safetensors ( #27460 )
2023-11-13 15:17:01 +01:00
test_modeling_tf_common.py
Replace build() with build_in_name_scope() for some TF tests ( #28046 )
2023-12-14 17:42:25 +00:00
test_modeling_tf_utils.py
Replace build() with build_in_name_scope() for some TF tests ( #28046 )
2023-12-14 17:42:25 +00:00
test_modeling_utils.py
4D attention_mask
support ( #27539 )
2023-12-17 11:08:04 +01:00
test_pipeline_mixin.py
Shorten the conversation tests for speed + fixing position overflows ( #26960 )
2023-10-31 14:20:04 +00:00
test_sequence_feature_extraction_common.py
Fix typo ( #25966 )
2023-09-05 10:12:25 +02:00
test_tokenization_common.py
[Styling
] stylify using ruff ( #27144 )
2023-11-16 17:43:19 +01:00
test_tokenization_utils.py
Remove-auth-token ( #27060 )
2023-11-13 14:20:54 +01:00