transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

eustlb da334bcfa8 [Whisper] 🚨 Fix whisper decoding 🚨 (#34135 ) * do not remove decoder_input_ids for the first segment * do not remove eos token in generate_with_fallback * when removing padding tokens, do not remove eos token * remove eos token in generate (and not in generate_with_fallback!) * reconciliate short-from/ long-form behavior * correct avg_logprobs calculation * handle eos token in segments * handle decoder_input_ids and eos token in _prepare_decoder_input_ids * fix incorrect time precision * always remove eos token * always remove decoder_input_ids * no need to handle decoder_inputs_ids and eos token * no need to remove decoder_input_ids * no need to handle eos token * fix num_beams in _retrieve_logit_processors * remove todo unconsistency * no need to add eos token * last_timestamp_pos should indeed be timestamp token pos * patch generate to enable compatibility with GenerationTesterMixin tests * adapt test_generate_continue_from_past_key_values * adapt test_prompt_lookup_decoding_matches_greedy_search * adapt generic GenerationMixin tests to whisper's generate * fix speculative decoding * fix * [run-slow] whisper * change HF_HUB_TOKEN for require_read_token * [run-slow] whisper * prioritize kwargs over generation_config * remove unnecessary args * [run-slow] whisper * update tests * [run-slow] whisper * add comment * update test * [run-slow] whisper * update test + revert require_read_token * docstring updates * revert tokenizer decode args change * do not use a patch + docstring updates * [run-slow] whisper * make * [run-slow] whisper * add a flag to force unique call to generate * test update * [run-slow] whisper * add force_unique_generate_call arg * do not use a patch * correct the timestamps for the pad tokens * docstring update * docstring update * docstring update * upodate TF tests * add require_read_token * [run-slow] whisper * test reset dynamo * [run-slow] whisper * fix * [run-slow] whisper * avoid iterating twice on current_segments * [run-slow] whisper * [run-slow] whisper --------- Co-authored-by: Eustache Le Bihan <eustlb@users.noreply.huggingface.co> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>		2024-12-18 14:13:21 +01:00
..
agents	Add token cost + runtime monitoring to Agent and HfEngine children (#34548 )	2024-12-03 13:14:52 +01:00
benchmark	[Test refactor 1/5] Per-folder tests reorganization (#15725 )	2022-02-23 15:46:28 -05:00
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	Trainer - deprecate tokenizer for processing_class (#32385 )	2024-10-02 14:08:46 +01:00
extended	[tests] skip tests for xpu (#33553 )	2024-09-19 19:28:04 +01:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	FSDP grad accum fix (#34645 )	2024-11-15 22:28:06 +01:00
generation	skip Fuyu from test_generate (#35246 )	2024-12-13 10:12:49 +01:00
models	[Whisper] 🚨 Fix whisper decoding 🚨 (#34135 )	2024-12-18 14:13:21 +01:00
optimization	fix: Fixed the `1st argument` name in classmethods (#31907 )	2024-07-11 12:11:50 +01:00
peft_integration	[PEFT] Better Trainer error when prompt learning with loading best model at the end (#35087 )	2024-12-11 12:44:39 +01:00
pipelines	Fix seamless TTS generate (#34968 )	2024-12-11 15:38:42 +01:00
quantization	Fix : model used to test ggml conversion of Falcon-7b is incorrect (#35083 )	2024-12-16 13:21:44 +01:00
repo_utils	Refactor CI: more explicit (#30674 )	2024-08-30 18:17:25 +02:00
sagemaker	Trainer - deprecate tokenizer for processing_class (#32385 )	2024-10-02 14:08:46 +01:00
tokenization	VLM: special multimodal Tokenizer (#34461 )	2024-11-04 16:37:51 +01:00
tp	Simplify Tensor Parallel implementation with PyTorch TP (#34184 )	2024-11-18 19:51:49 +01:00
trainer	Fix GA loss bugs and add unit test (#35121 )	2024-12-09 09:57:41 +01:00
utils	Fix loading with only state dict and low_cpu_mem_usage = True (#35217 )	2024-12-18 09:54:32 +01:00
__init__.py
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_configuration_common.py	Load sub-configs from composite configs (#34410 )	2024-11-05 11:34:01 +01:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_image_processing_common.py	Fall back to slow image processor in ImageProcessingAuto when no fast processor available (#34785 )	2024-12-15 14:00:36 -05:00
test_image_transforms.py	fix: center_crop occasionally outputs off-by-one dimension matrix (#30934 )	2024-05-21 13:56:52 +01:00
test_modeling_common.py	Support for SDPA for SAM models (#34110 )	2024-12-17 14:46:05 +01:00
test_modeling_flax_common.py	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
test_modeling_tf_common.py	[`TF`] Fix Tensorflow XLA Generation on limited seq_len models (#33903 )	2024-10-05 16:20:50 +02:00
test_pipeline_mixin.py	Add image text to text pipeline (#34170 )	2024-10-31 15:48:11 -04:00
test_processing_common.py	Separate chat templates into a single file (#33957 )	2024-11-26 14:18:04 +00:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	Separate chat templates into a single file (#33957 )	2024-11-26 14:18:04 +00:00