transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

Author	SHA1	Message	Date
Yih-Dar	7819911b0c	Use T4 single GPU runner with more CPU RAM (#37961 ) larger T4 single GPU Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-05 16:17:45 +02:00
Joao Gante	3b067a15dd	[core] reuse unused reserved cuda memory when loading models (#37920 )	2025-05-05 15:14:05 +01:00
ivarflakstad	afbc293e2b	More fault tolerant notification service (#37924 ) * Let notification service succeed even when artifacts and reported jobs on github have mismatch * Use default trace msg if no trace msg available * Add pop_default helper fn * style	2025-05-05 15:19:48 +02:00
NielsRogge	36ca58bf4f	[D-FINE] Update names (#37957 ) * Update names * Fix modular --------- Co-authored-by: qubvel <qubvel@gmail.com>	2025-05-05 13:05:46 +01:00
Joao Gante	2932f318a2	[docs] logits docstring (#37929 )	2025-05-02 16:38:35 +01:00
Jerry Zhang	fa3c3f9cab	Break weight tying when quantizing input embedding (#37905 ) Summary: Currently when we try to quantize input_embedding for some models, the output embedding (lm_head) will also be quantized the same way, since they are tied, and this may not be what we want. To break the tie, we added the option to allow people to 1. load unquantized weight 2. tie weights 3. quantize so that the tie will be broken Test Plan: ``` from transformers import ( AutoModelForCausalLM, AutoProcessor, AutoTokenizer, TorchAoConfig, ) from torchao.quantization.quant_api import ( IntxWeightOnlyConfig, Int8DynamicActivationIntxWeightConfig, AOPerModuleConfig ) from torchao.quantization.granularity import PerGroup, PerAxis import torch model_id = "microsoft/Phi-4-mini-instruct" embedding_config = IntxWeightOnlyConfig( weight_dtype=torch.int8, granularity=PerAxis(0), ) linear_config = Int8DynamicActivationIntxWeightConfig( weight_dtype=torch.int4, weight_granularity=PerGroup(32), weight_scale_dtype=torch.bfloat16, ) quant_config = AOPerModuleConfig({"_default": linear_config, "model.embed_tokens": embedding_config}) quantization_config = TorchAoConfig(quant_type=quant_config, include_embedding=True, untie_embedding_weights=True) quantized_model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float32, device_map="auto", quantization_config=quantization_config) tokenizer = AutoTokenizer.from_pretrained(model_id) print(quantized_model) print("embed_tokens.weight:", quantized_model.model.embed_tokens.weight) print("lm head weight:", quantized_model.lm_head.weight) from transformers.modeling_utils import find_tied_parameters print(find_tied_parameters(quantized_model)) ``` Reviewers: Subscribers: Tasks: Tags: Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-05-02 10:53:23 +02:00
Aritra Roy Gosthipaty	8a0a508f2b	Aligning modling code for GPT2 to work with vLLM (fallback) (#36934 ) * aligning for vllm * using input shape rather than attn outputs * remove demo * revert Conv1D * style * style * Update src/transformers/models/gpt2/modeling_gpt2.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix copies * Apply suggestions from code review Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * adding docs about vllm * chore: style --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-02 09:55:16 +02:00
Federico Baldassarre	e94a4807df	Add usage example for DINOv2 (#37398 ) * Add usage example for DINOv2 * More explicit shape names * More verbose text * Moved example to Notes section * Indentation	2025-05-01 08:54:22 -07:00
Bogeum Kim	d20aa68193	🌐 [i18n-KO] Translated `gpu_selection.md` to Korean (#36757 ) * Add _toctree.yml * feat: serving.md draft * Add _toctree.yml * feat: gpu_selection.md nmt draft * fix: TOC edit * Update docs/source/ko/serving.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ko/gpu_selection.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ko/serving.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update _toctree.yml --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-01 08:44:12 -07:00
woctordho	ee25d57ed1	Improve performance of `load_state_dict` (#37902 ) Improve performance of load_state_dict	2025-05-01 16:35:17 +02:00
Joao Gante	410aa01901	[chat] clean code and add base help (#37892 )	2025-05-01 15:12:18 +01:00
co63oc	5b573bebb9	Fix typos in strings and comments (#37910 )	2025-05-01 14:58:58 +01:00
Ita Zaporozhets	c80f65265b	🚨 rm already deprecated pad_to_max_length arg (#37617 ) * rm already deprecated padding max length * truncate_strategy AS AN ARG is already deprecated for a few years * fix * rm test_padding_to_max_length * rm pad_to_max_length=True in other tests * rm from common * missed fnet	2025-05-01 15:21:55 +02:00
Diogo Glória-Silva	7a3e208892	fixed gemma3 collection path pointing to llama 2 collection. (#37899 )	2025-04-30 12:50:54 -07:00
Jerry Zhang	86777b5e2f	Support `AOPerModuleConfig` and `include_embedding` (#37802 ) * Support `AOPerModuleConfig` and include_embedding Summary: This PR adds support per module configuration for torchao Also added per module quantization examples: 1. Quantizing different layers with different quantization configs 2. Skip quantization for certain layers Test Plan: python tests/quantization/torchao_integration/test_torchao.py -k test_include_embedding python tests/quantization/torchao_integration/test_torchao.py -k test_per_module_config_skip Reviewers: Subscribers: Tasks: Tags: * format * format * inlcude embedding remove input embedding from module not to convert * more docs * Update docs/source/en/quantization/torchao.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_torchao.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_torchao.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-04-30 20:16:29 +02:00
Sifal	c3aeaa8060	Enhance documentation to explain chat-based few-shot prompting (#37828 ) * Enhance documentation to explain chat-based few-shot prompting Updates the documentation on few-shot prompting to illustrate how to structure examples using the chat-based format for instruction-tuned models. * Update docs/source/en/tasks/prompting.md Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix typos --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-04-30 11:00:10 -07:00
Mohamed Mekkouri	36e2e33bbe	Fix Qwen3 tp plan with FP8 (#37871 ) * update for qwen 3 * fix style * rm print	2025-04-30 18:14:10 +02:00
Joao Gante	8e8025b384	[tests] reset logs in `torch.compile` test (#37894 )	2025-04-30 16:04:28 +01:00
Joao Gante	1b222903c3	[tests] Test all cache implementations (#37873 )	2025-04-30 15:37:00 +01:00
Yan Zhao	2c1155519f	Support FlaxPreTrainedModel to load model checkpoint from local subfolder safetensors (#37732 ) Support FlaxPreTrainedModel to load model checkpoint from subfolder in local directory as safetensors format Signed-off-by: Yan Zhao <zhao.y4@northeastern.edu>	2025-04-30 16:13:23 +02:00
Arjuna Sky Kok	5b223bbc8c	update comment in image_processing_base.py to reference image_process… (#37864 ) update comment in image_processing_base.py to reference image_processing_utils_fast	2025-04-30 14:31:29 +01:00
LLinkedlist	0dffcb0967	Fix: reassign in qwen3 moe model (#37848 ) * Fix: reassign in qwen3 moe model Fix: reassign in qwen3 moe model * Remove redundant assignment to self.mlp * make fix-copies * Revert unwanted style change * Revert unwanted style change --------- Co-authored-by: li.ding <int.li.ding@enflame-tech.com> Co-authored-by: Matt <rocketknight1@gmail.com>	2025-04-30 13:49:59 +01:00
Tibor Reiss	6c5d374d56	uniformize kwargs for VisionTextDualEncoder (#34563 ) * Make kwargs uniform for VisionTextDualEncoder * Add bc for flipped args	2025-04-30 14:32:59 +02:00
湛露先生	4fc976779e	Fix qwen2-vl-docs. (#37879 ) Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-30 13:32:21 +01:00
Wing Lian	4eb6acc896	make sure lr is not a tensor (#37881 ) * make sure lr is not a tensor * revert change from #37704 * clean up to reduce extra LoC --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2025-04-30 14:23:39 +02:00
jiaqiw09	7be92f9a94	fix error for _register_pytree_node in torch2.1.0 and fix bf16 assertion in xpu and npu (#37839 ) * fix error for _register_pytree_node and bf16 assertion * fix format * update xpu available assert function	2025-04-30 14:22:53 +02:00
湛露先生	455c3a33b0	update Clean_up_tokenization_spaces typos. (#37865 ) Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-04-30 13:04:49 +01:00
Lysandre Debut	d538293f62	Transformers cli clean command (#37657 ) * transformers-cli -> transformers * Chat command works with positional argument * update doc references to transformers-cli * doc headers * deepspeed --------- Co-authored-by: Joao Gante <joao@huggingface.co>	2025-04-30 12:15:43 +01:00
Pedro Cuenca	63cd4c76f3	Llama Guard updates (#37872 ) * Unhardcode use_chunked_attention, fix no_rope_layers * Go back to exhaustive list of bools * Conversion and modeling updates * Fix rope * Unhardcode rope * Fix context length * style * Minor updates to conversion * Use StaticCache * Minor simplification * DynamicCache 🤦 * Style * Style	2025-04-30 10:34:43 +02:00
Yao Matrix	34f26e2c3e	enable internvl UTs on XPU (#37779 ) * enable internvl UTs on XPU Signed-off-by: YAO Matrix <matrix.yao@intel.com> * fix style Signed-off-by: YAO Matrix <matrix.yao@intel.com> * fix style per comments Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 10:29:40 +02:00
Guang Yang	a57274466f	Allow override inputs to export recipe (#37508 ) Add option to specify dynamic shapes during export Co-authored-by: Guang Yang <guangyang@fb.com>	2025-04-30 10:19:27 +02:00
Matt	481de7204c	Skip is_flaky tests in the CI (#37723 ) * No more red flaky tests in the CI! * Remove the CircleCI logic as well * Revert most changes including is_flaky behaviour * make fixup * Move to a more sensible place * Mark a flaky test that failed on this PR! * correct import * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-04-30 09:52:21 +02:00
Aaditya Ura	5f8d17268c	Update modeling_llama4.py (#37841 ) * Update modeling_llama4.py * Update modeling_llama4.py * do not pass device --------- Co-authored-by: raushan <raushan@huggingface.co>	2025-04-30 00:36:02 +02:00
Kim Juwon	50f8caaa48	🌐 [i18n-KO] Translated `electra.md` to Korean (#36763 ) * docs: ko: electra.md * feat: nmt draft * fix: manual edits * fix: manual edits	2025-04-29 14:03:39 -07:00
regisss	91f3e9422f	Add Intel Gaudi doc (#37855 ) * Add Intel Gaudi doc * Use "TIP" instead of "NOTE" * Address comments from reviews	2025-04-29 13:28:06 -07:00
Pedro Cuenca	c34afa5957	Processor chat template: pass custom kwargs (#37852 )	2025-04-29 21:22:10 +02:00
Yaner	66ad8b2db0	docs: Details for ambigious channel dimension assignment (#37600 ) * docs: Details for ambigious channel dimension inference * Update src/transformers/image_utils.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-04-29 08:12:38 -07:00
Mohamed Mekkouri	096f25ae1f	Fix Bitnet tokenizer in pipeline (#37861 ) add tokenizer	2025-04-29 15:35:02 +02:00
Chris	da7ae467c4	Fix cache get item return type hints (#37847 ) F: Fix cache return hints Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2025-04-29 14:23:52 +01:00
Hicham Tala	aa6b79db43	Fix check of unecessary packages (issue #37626 ) (#37825 ) * Fix check of unecessary packages (issue #37626) * Reformat using ruff * And a condition to avoind the risk of matching a random object in `import_utils` * Reformat	2025-04-29 14:21:05 +01:00
Matt	517367fe9a	Revert change that breaks on Torch 2.1 (#37531 ) * Revert change that breaks on Torch 2.1 * Add TODO * Trigger tests * Trigger tests	2025-04-29 13:27:09 +01:00
Joao Gante	755b0fa2fe	[tests] reorganize cache tests and clean memory between tests (#37684 )	2025-04-29 12:21:14 +01:00
Joao Gante	3a1acc36ed	[tests] fix flaky pattern in `test_generate_continue_from_past_key_values` (#37724 )	2025-04-29 12:20:42 +01:00
Vladislav Bronzov	4abeb50f6e	Add D-FINE Model into Transformers (#36261 ) * copy the last changes from broken PR * small format * some fixes and refactoring after review * format * add config attr for loss * some fixes and refactoring * fix copies * fix style * add test for d-fine resnet * fix decoder layer prop * fix dummies * format init * remove extra print * refactor modeling, move resnet into separate folder * fix resnet config * change resnet on hgnet_v2, add clamp into decoder * fix init * fix config doc * fix init * fix dummies * fix config docs * fix hgnet_v2 config typo * format modular * add image classification for hgnet, some refactoring * format tests * fix dummies * fix init * fix style * fix init for hgnet v2 * fix index.md, add init rnage for hgnet * fix conversion * add missing attr to encoder * add loss for d-fine, add additional output for rt-detr decoder * tests and docs fixes * fix rt_detr v2 conversion * some fixes for loos and decoder output * some fixes for loss * small fix for converted modeling * add n model config, some todo comments for modular * convert script adjustments and fixes, small refact * remove extra output for rt_detr * make some outputs optionsl, fix conversion * some posr merge fixes * small fix * last field fix * fix not split for hgnet_v2 * disable parallelism test for hgnet_v2 image classification * skip multi gpu for d-fine * adjust after merge init * remove extra comment * fix repo name references * small fixes for tests * Fix checkpoint path * Fix consistency * Fixing docs --------- Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2025-04-29 12:17:55 +01:00
Cyril Vallez	4602059aae	[modular] Fix the prefix-based renaming if the old and new model share a common name suffix (#37829 ) * first try * Fix and set examples * style * fix * Update modular_test_detr.py * Update image_processing_new_imgproc_model.py * Update modular_model_converter.py	2025-04-29 10:43:23 +02:00
Henrik Matthiesen	a847d4aa6b	Fast image processor for VitMatte added and bug in slow version fixed (#37616 ) * added fast image processor for VitMatte including updated and new tests, fixed a bug in the slow image processor that processed images incorrectly for input format ChannelDimension.FIRST in which case the trimaps were not added in the correct dimension, this bug was also reflected in the tests through incorretly shaped trimaps being passed * final edits for fast vitmatte image processor and tests * final edits for fast vitmatte image processor and tests --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>	2025-04-28 14:51:50 -04:00
sushmanth reddy	65e940208c	Samhq model addition (#35147 ) * added the configuartion for sam_hq * added the modeelling for sam_hq * added the sam hq mask decoder with hq features * added the code for the samhq * added the code for the samhq * added the code for the samhq * Delete src/transformers/models/sam_hq/modelling_sam_hq.py * added the code for the samhq * added the code for the samhq * added the chnages for the modeelling * added the code for sam hq for image processing * added code for the sam hq model * added the required changes * added the changes * added the key mappings for the sam hq * adding the working code of samhq * added the required files * adding the pt object * added the push to hub account * added the args for the sam maks decoder * added the args for the sam hq vision config * aded the some more documentation * removed the unecessary spaces * all required chnages * removed the image processor * added the required file * added the changes for the checkcopies * added the code for modular file * added the changes for the __init file * added the code for the interm embeds * added the code for sam hq * added the changes for modular file * added the test file * added the changes required * added the changes required * added the code for the * added the cl errors * added the changes * added the required changes * added the some code * added the code for the removing image processor * added the test dimensins * added the code for the removing extra used variables * added the code for modeluar file hf_mlp for a better name * removed abbrevaation in core functionality * removed abbrevaation in core functionality * .contiguous() method is often used to ensure that the tensor is stored in a contiguous block of memory * added the code which is after make fixup * added some test for the intermediate embeddings test * added the code for the torch support in sam hq * added the code for the updated modular file * added the changes for documentations as mentioned * removed the heading * add the changes for the code * first mentioned issue resolved * added the changes code to processor * added the easy loading to init file * added the changes to code * added the code to changes * added the code to work * added the code for sam hq * added the code for sam hq * added the code for the point pad value * added the small test for the image embeddings and intermediate embedding * added the code * added the code * added the code for the tests * added the code * added ythe code for the processor file * added the code * added the code * added the code * added the code * added the code * added the code for tests and some checks * added some code * added the code * added the code * added some code * added some code * added the changes for required * added the code * added the code * added the code * added the code * added the code * added the code * added the code * added the code * added the code * added the code * added some changes * added some changes * removed spaces and quality checks * added some code * added some code * added some code * added code quality checks * added the checks for quality checks * addded some code which fixes test_inference_mask_generation_no_point * added code for the test_inference_mask_generation_one_point_one_bb * added code for the test_inference_mask_generation_one_point_one_bb_zero * added code for the test_inference_mask_generation_one_box * added some code in modelling for testing * added some code which sort maks with high score * added some code * added some code * added some code for the move KEYS_TO_MODIFY_MAPPING * added some code for the unsqueeze removal * added some code for the unsqueeze removal * added some code * added some code * add some code * added some code * added some code * added some testign values changed * added changes to code in sam hq for readbility purpose * added pre commit checks * added the fix samvisionmodel for compatibilty * added the changes made on sam by cyyever * fixed the tests for samhq * added some the code * added some code related to init file issue during merge conflicts * remobved the merge conflicts * added changes mentioned by aruther and mobap * added changes mentioned by aruther and mobap * solving quality checks * added the changes for input clearly * added the changes * added changes in mask generation file rgearding model inputs and sam hq quargs in processor file * added changes in processor file * added the Setup -> setupclass conversion * added the code mentioned for processor * added changes for the code * added some code * added some code * added some code --------- Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>	2025-04-28 19:07:09 +02:00
Raushan Turganbay	9c5b1319d0	[config] revert #37603 (#37821 ) revert	2025-04-28 16:28:30 +02:00
Marc Sun	9e730689c3	change XLA deprecated api (#37741 ) * deprecated api * fix	2025-04-28 16:27:41 +02:00
Yuan Wu	2933894985	Fix error of HPU TP (#37782 ) * Fix error of HPU TP Signed-off-by: yuanwu <yuan.wu@intel.com> * Add the init distrubuted for hpu Signed-off-by: yuanwu <yuan.wu@intel.com> * Fix error of make style Signed-off-by: yuanwu <yuan.wu@intel.com> --------- Signed-off-by: yuanwu <yuan.wu@intel.com>	2025-04-28 15:47:16 +02:00

1 2 3 4 5 ...

18854 Commits