transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

History

Joshua Lochner 6e2d04e429 Fix slow GemmaTokenizer and improve SPM slow -> fast conversion process (#32191 ) * Remove user-defined tokens which can be obtained through merges * Remove debug line * formatting * Refactor spm slow -> fast converter * revert unnecessary refactor * set comprehension * remove test files * Use `vocab_scores` * Always replace spiece underline with space in decode * we no longer need token filtering * Add save fast load slow unit test * Remove tokenizers version check * Remove duplicate code * Make `<start_of_turn>` and `<end_of_turn>` special tokens * Bias merge priority with length if score is the same * Add unit test for merge priority * CI		2024-07-30 23:36:38 +02:00
..
agents	Updated `ruff` to the latest version (#31926 )	2024-07-23 17:07:31 +02:00
benchmark
bettertransformer	Fixed malapropism error (#26660 )	2023-10-09 11:04:57 +02:00
deepspeed	[tests] fix deepspeed zero3 config for `test_stage3_nvme_offload` (#31881 )	2024-07-16 16:11:37 +02:00
extended	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00
fixtures	Implementation of SuperPoint and AutoModelForKeypointDetection (#28966 )	2024-03-19 14:43:02 +00:00
fsdp	Llama et al. / FSDP : Fix breaking change in 4.40 for FSDP (#31161 )	2024-06-26 14:50:08 +01:00
generation	Generate: end-to-end compilation (#30788 )	2024-07-29 10:52:13 +01:00
models	Fix slow GemmaTokenizer and improve SPM slow -> fast conversion process (#32191 )	2024-07-30 23:36:38 +02:00
optimization	fix: Fixed the `1st argument` name in classmethods (#31907 )	2024-07-11 12:11:50 +01:00
peft_integration	FIX [`CI`]: Fix failing tests for peft integration (#29330 )	2024-02-29 03:56:16 +01:00
pipelines	[pipeline] fix padding for 1-d tensors (#31776 )	2024-07-29 21:24:42 +08:00
quantization	Support dequantizing GGUF FP16 format (#31783 )	2024-07-24 17:59:59 +02:00
repo_utils	Allow `# Ignore copy` (#27328 )	2023-12-07 10:00:08 +01:00
sagemaker	Fixed `log messages` that are resulting in TypeError due to too many arguments (#32017 )	2024-07-17 10:56:44 +01:00
tokenization	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00
trainer	Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs (#31629 )	2024-07-23 15:56:41 +02:00
utils	Make static cache compatible with torch.export (#32168 )	2024-07-29 18:19:15 +01:00
__init__.py
test_backbone_common.py	Align backbone stage selection with out_indices & out_features (#27606 )	2023-12-20 18:33:17 +00:00
test_configuration_common.py	Refactor: Removed un-necessary `object` base class (#32230 )	2024-07-26 10:33:02 +02:00
test_feature_extraction_common.py	Split common test from core tests (#24284 )	2023-06-15 07:30:24 -04:00
test_image_processing_common.py	Skip tests properly (#31308 )	2024-06-26 21:59:08 +01:00
test_image_transforms.py	fix: center_crop occasionally outputs off-by-one dimension matrix (#30934 )	2024-05-21 13:56:52 +01:00
test_modeling_common.py	Flash-Attn: fix generation when no attention mask or no pading (#32241 )	2024-07-26 14:45:55 +05:00
test_modeling_flax_common.py	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
test_modeling_tf_common.py	Port IDEFICS to tensorflow (#26870 )	2024-05-13 15:59:46 +01:00
test_pipeline_mixin.py	fix: Fixed raising `TypeError` instead of `ValueError` for invalid type (#32111 )	2024-07-22 17:46:17 +01:00
test_processing_common.py	add initial design for uniform processors + align model (#31197 )	2024-06-13 16:27:16 +02:00
test_sequence_feature_extraction_common.py	Fix typo (#25966 )	2023-09-05 10:12:25 +02:00
test_tokenization_common.py	Return assistant generated tokens mask in apply_chat_template (#30650 )	2024-07-22 18:24:43 +01:00