transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-02 19:21:31 +06:00

Author	SHA1	Message	Date
Arthur	7266aafab7	remove the **flash stuff in favor of noraml kwargs	2025-06-30 17:10:56 +02:00
Arthur	3fb6b710f2	update	2025-06-30 16:49:11 +02:00
Arthur	a74974d989	update	2025-06-30 15:44:17 +02:00
Arthur	e7705c981a	update models based on qwen2	2025-06-30 15:25:03 +02:00
Arthur	113219becd	update modularqwen2	2025-06-30 15:22:39 +02:00
Arthur	3caf7d76a0	fix other models as well!	2025-06-30 14:55:01 +02:00
Arthur	8c66f4d0bb	this fixes more tests	2025-06-30 14:50:34 +02:00
Arthur	ea87eb700b	very small nits	2025-06-30 14:49:01 +02:00
Arthur	4a14287a60	finally	2025-06-30 14:38:25 +02:00
Arthur	124cd82968	fix	2025-06-30 14:36:56 +02:00
Arthur	98739ba418	update	2025-06-30 14:23:04 +02:00
Arthur	fca73ad7ce	update	2025-06-30 12:43:56 +02:00
Arthur	0dc082627c	update	2025-06-30 12:38:58 +02:00
Arthur	cb5da530c0	updates	2025-06-30 12:37:16 +02:00
Arthur	c9bb39ef87	update	2025-06-30 12:32:03 +02:00
Arthur	a7e0ce238e	update other models	2025-06-30 12:31:07 +02:00
Arthur	98f402cd5d	update	2025-06-30 12:28:24 +02:00
Arthur	63df15bb24	update	2025-06-30 12:26:51 +02:00
Arthur	96aabd77c7	use transformers kwargs instead	2025-06-30 12:23:39 +02:00
Arthur	e437edd7fc	update other modelings	2025-06-30 12:18:09 +02:00
Arthur	0f1d7e0a6f	update generic, fix to use config value	2025-06-30 12:11:13 +02:00
Arthur	eb6747bca9	nits and fixes	2025-06-30 12:03:41 +02:00
Arthur	abf9d39d12	put this on the pretrained model instead	2025-06-30 11:44:42 +02:00
Arthur	7f113b43cc	also add the changes needed to modeling utils	2025-06-30 11:39:29 +02:00
Arthur	37b4ef022e	update other models as well just making fix-copies	2025-06-30 11:35:37 +02:00
Arthur	7433c44376	just update 2 files	2025-06-30 11:25:07 +02:00
Yih-Dar	ccf2ca162e	skip some `test_sdpa_can_dispatch_on_flash` (#39092 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Has been cancelled Details Build documentation / build (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Get all modified files (push) Has been cancelled Details Self-hosted runner (push-caller) / Check if setup was changed (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details Update Transformers metadata / build_and_package (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Has been cancelled Details Self-hosted runner (push-caller) / build-docker-containers (push) Has been cancelled Details Self-hosted runner (push-caller) / Trigger Push CI (push) Has been cancelled Details * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-27 23:08:14 +02:00
st81	a11f692895	Fixes the failing test `test_is_split_into_words` in `test_pipelines_token_classification.py` (#39079 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details New model PR merged notification / Notify new model (push) Has been cancelled Details * Fix test pipelines token classification for is_split_into_words * Fix incorrect import format	2025-06-27 19:25:32 +01:00
Sandeep Yadav	18143c76bf	Sandeepyadav1478/2025 06 19 deberta v2 model card update (#38895 ) * [docs]: update deberta-v2.md model card * chore: req updates * chore: address code review feedback and update docs * chore: review feedback and updates * chore: model selection updates * chores: quantizations review updates	2025-06-27 10:35:30 -07:00
Steven Liu	02a769b058	[fix] Add FastSpeech2ConformerWithHifiGan (#38207 ) * add to mapping * oops * oops * add to config_mapping_names * revert * fix? * config-mapping-names * fix? * fix?	2025-06-27 09:38:21 -07:00
Benjamin Bossan	c2dc72bb5f	TST Fix PEFT integration test bitsandbytes config (#39082 ) TST Fix PEFT integration test bitsandbytes config The PEFT integration tests still used load_in_{4,8}_bit, which is deprecated, moving to properly setting BitsAndBytesConfig. For 4bit, also ensure that nf4 is being used to prevent > RuntimeError: quant_type must be nf4 on CPU, got fp4	2025-06-27 18:33:11 +02:00
Matej Sirovatka	c8064bea9a	Fix: unprotected import of tp plugin (#39083 )	2025-06-27 17:28:05 +02:00
farrosalferro	dd7dc4a4a2	Add Fast Image Processor for Chameleon (#37140 ) * Add Fast Image Processor for Chameleon * add warning to resize and move blend_rgba to convert_to_rgb * Remove unrelated files * Update image_processing_chameleon_fast to use auto_docstring * fix equivalence test --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>	2025-06-27 15:26:57 +00:00
Yih-Dar	6d773fc3bc	fix `dots1` tests (#39088 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-27 16:54:11 +02:00
Tijana Vukovic	c8764ab935	guard torch distributed check (#39057 ) * guard torch distributed check * Update src/transformers/pipelines/base.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2025-06-27 14:49:47 +00:00
MinJu-Ha	49d9fd49bd	Add Fast Image Processor for mobileViT (#37143 ) * Add image_processing_mobilevit_fast.py * Fix copies * update _preprocess for channel_flip * Update for batched image processing * Resolve merge conflicts with main * Fix import order and remove trailing whitespace (ruff clean-up) * Fix copy inconsistencies * Add NotImplementedError for post_process_semantic_segmentation to satisfy repo checks * Add auto_docstring * Adjust style * Update docs/source/en/model_doc/mobilevit.md Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Update src/transformers/models/mobilevit/image_processing_mobilevit_fast.py Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Update src/transformers/models/mobilevit/image_processing_mobilevit_fast.py Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Delete not used function * test: add missing tests for and * Add post_process_semantic_segmentation to mobilevit_fast.py * Add preprocess function to image_processing_mobilebit_fast.py * ruff check for formatting * fix: modify preprocess method to handle BatchFeature correctly * Remove logic for default value assignment Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Remove normalization adn RGB conversion logic not used in slow processor Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Simplify return_tensors logic using one-liner conditional expression Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Remove unused normalization and format parameters Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * add *kwargs and remove default values in _preprocess add slow_fast equivalence tests for segmentation * style: autoformat code with ruff * Fix slow_fast equivalence test * merge + remove skipped test --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>	2025-06-27 14:40:24 +00:00
Nahieli	4336ecd1ea	add fast image processor nougat (#37661 ) * add fast image processor nougat * test fixes * docstring white space * last fixes * docstring_type * tolerance unit test * fix tolerance * fix rtol * remove traling white space * remove white space * note for tolerance unit test * fix tests * remove print --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>	2025-06-27 14:39:43 +00:00
Benjamin Bossan	0c35280e58	TST PEFT integration tests with pipeline generate (#39086 ) Some PEFT integration tests involving text generation pipelines were failing since #38129 because the base model is too small to generate longer sequences. Setting max_new_tokens fixes this.	2025-06-27 15:58:10 +02:00
JINO ROHIT	993665a5ff	fixed typo for docstring in prepare_inputs method (#39071 )	2025-06-27 13:57:56 +00:00
Yih-Dar	839893c86b	fix `mistral3` tests (#38989 ) * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-27 15:44:10 +02:00
eustlb	2b85b6ce19	[Whisper] 🚨 Fix pipeline word timestamp: timestamp token is end of token time !!! (#36632 ) * timestamp token is end of token time !!! * ensure correct alignment between tokens and timestamp tokens * ignore input tokens for DTW computation * use num_frames to avoid token timestamp hallucinations * token timestamps test updates ! * num_frames: deprecate and use attention_mask instead * avoid breaking change * fix the pipeline usage for chunk approach * make style * better logging * better logging * make style * update tests with correct values	2025-06-27 12:51:43 +00:00
eustlb	9c8d3a70b8	Pipeline: fix unnecessary warnings (#35753 ) * return attention mask * use correct model input name * fix * make	2025-06-27 14:32:03 +02:00
Yaswanth Gali	1750c518dd	✨ Add EoMT Model \|\| 🚨 Fix Mask2Former loss calculation (#37610 ) * Initial Commit * up * More changes * up * Only mask_logits mismatch * close enough logits debug later * fixes * format * Add dummy loss * Close enough processing for semantic seg * nit * Added panoptic postprocessor * refactor * refactor * finally fixed panoptic postprocessor * temp update * Refactor ForUniversalSegmentation class * nits and config update * Few fixes and inference matches * change mapping * Added training support but loss slightly off 🥲 * Loss is matching 😀 * update * Initial tests skelton * changes * tests update * more modular * initial tests * updates * better docstrings * changes * proc tests passing :) * Image processor update * tiny change * QOL changes * Update test w.r.t latest attn refactor * repo-consistency fixes * up * Image proc fix and integration tests :) * docs update * integration tests * fix * docs update 🥰 * minor fix * Happy CI * fix * obvious refactoring * refactoring w.r.t review * Add fask image proc skelton * Fast Image proc and cleanups * Use more modular * tests update * Add more tests * Nit * QOL updates * change init_weights to torch default * add eager func coz of make style * up * changes * typo fix * Updates * More deterministic tests * More modular * go more modular 🚀 * up * dump * add supprot for giant ckpts * overhaul * modular * refactor * instace seg is ready * cleanup * forgot this * docs cleanup * minor changes * EoMT - > Eomt * Happy CI * remove redundant comment * Change model references * final change * check annealing per block * My other PR changes 😂 --------- Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>	2025-06-27 14:18:18 +02:00
Yao Matrix	0106a50a6b	fix a bunch of XPU UT failures on stock PyTorch 2.7 and 2.8 (#39069 ) * fix a bunch of XPU UT failures on stock PyTorch 2.7 and 2.8 Signed-off-by: YAO Matrix <matrix.yao@intel.com> * qwen3 Signed-off-by: YAO Matrix <matrix.yao@intel.com> * quanto Signed-off-by: YAO Matrix <matrix.yao@intel.com> * models Signed-off-by: YAO Matrix <matrix.yao@intel.com> * fix style Signed-off-by: YAO Matrix <matrix.yao@intel.com> * idefics2 Signed-off-by: YAO Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-06-27 14:01:53 +02:00
Mohamed Mekkouri	cb17103bd5	Uninstallling Flash attention from quantization docker (#39078 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * update * revert	2025-06-27 13:51:46 +02:00
BUI Van Tuan	371c471113	Fix initialization of OneFormer (#38901 ) * fix initialization of OneFormer * remove redundant initializations * remove redundant initializations * remove redundant initializations * keep BC	2025-06-27 12:39:37 +02:00
Yih-Dar	540a10848c	fix `Gemma3nProcessorTest` (#39068 ) * fix * fix * oups forgot style --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>	2025-06-27 12:28:10 +02:00
Yaswanth Gali	0d66ef7792	Cleanup Attention class for Siglip and dependent models (#39040 ) * cleanup attention class * More models * more models * Changes * make style * Should fix CI * This should work 🙏	2025-06-27 12:14:09 +02:00
eustlb	1ccc73dee9	[Whisper] fix shape mismatch in tests (#39074 ) fix shape mismatch	2025-06-27 09:27:42 +00:00
Steven Liu	a52478253b	[docs] Tensor parallelism (#38241 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * updates * feedback * badges * fix? * fix? * fix? * fix?	2025-06-26 14:40:45 -07:00

1 2 3 4 5 ...

19494 Commits