transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Vasqu	8fa32ca900	xmod + cache position fixes	2025-07-02 12:57:15 +02:00
Vasqu	baaa3ecccc	tmp disable	2025-07-02 10:54:48 +02:00
Vasqu	52d2052b4e	modular data2vec text	2025-07-02 10:53:12 +02:00
Vasqu	ad3ffe55a9	data2vectext, making it modular tomorrow Some checks are pending Secret Leaks / trufflehog (push) Waiting to run Details	2025-07-01 18:41:29 +02:00
Vasqu	dd7aeca424	albert	2025-07-01 17:55:05 +02:00
Vasqu	11de15bda4	modular roberta	2025-07-01 16:47:04 +02:00
Vasqu	5120ca6c8e	fix test Some checks are pending Secret Leaks / trufflehog (push) Waiting to run Details	2025-07-01 15:58:43 +02:00
Vasqu	38e8de3104	fix encoder decoder	2025-07-01 14:52:46 +02:00
Vasqu	306a5c2a5c	attention split, simplify args and kwargs, better typing	2025-07-01 12:50:45 +02:00
Vasqu	b82b47e5d5	Merge branch 'main' into vas-bert-attn-refactors	2025-07-01 11:32:01 +02:00
Raushan Turganbay	e435574721	🚨 Don't use cache in non-generative models (#38751 ) * deprecate for 1 version * style * fix some tests * fix esm * skip for now, GC requires positional args but we have keyword args * remove transpose for scores in modified models only * skip fx trace tests	2025-07-01 09:08:21 +00:00
Cyril Vallez	dbc98328da	Several fixes for Gemma3n (#39135 ) * remove the skips * fix the epsilon to a small value (does not make sense otherwise) * safeguard * overload test_eager_matches_sdpa * Update test_modeling_common.py * skip appropriate tests * correct no_split_layer * fix all devices issue * fix backward * fix	2025-07-01 10:34:53 +02:00
BUI Van Tuan	d53518c5f2	Fix key mapping for VLMs (#39029 ) * fix key mapping for VLMs * use __mro__ instead * update key mapping in save_pretrained	2025-07-01 09:47:53 +02:00
eustlb	3457e8e73e	[Whisper] update token timestamps tests (#39126 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * fixes * update comment * update for A10 * all a10 * all a10 * all a10 * all a10 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-30 21:55:36 +02:00
Drew Ross	fe35eca7bd	Update BigBirdPegasus model card (#39104 ) * Update igbird_pegasus.md * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-30 10:42:56 -07:00
Vasqu	d1c76901b4	fixup sdpa remains Some checks are pending Secret Leaks / trufflehog (push) Waiting to run Details	2025-06-30 18:00:02 +02:00
Yao Matrix	29a3f5ed8c	switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8 (#39024 ) * switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8 Signed-off-by: YAO Matrix <matrix.yao@intel.com> * Update docs/source/en/perf_infer_gpu_multi.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update perf_infer_gpu_multi.md * Update perf_infer_gpu_multi.md * Update perf_infer_gpu_multi.md --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-30 08:54:05 -07:00
Vladimir Gutuev	9e0c865b8b	docs: correct two typos in awesome-transformers.md (#39102 ) * docs(awesome-projects): fix typo “Itt leverages” → “It leverages” (#39101) closes #39101 * docs(awesome-projects): fix grammar “We provides” → “We provide” (#39101) closes #39101	2025-06-30 08:53:43 -07:00
Vasqu	6a7357de4a	roberta	2025-06-30 17:53:27 +02:00
jiqing-feng	03db2700ab	Enable XPU doc (#38929 ) * fix example with dataset Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix device type Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert torchao change Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert torchao change Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update xpu torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update chat_templating_multimodal.md Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * use full name for int8 Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert int8 title Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-06-30 07:56:55 -07:00
Vasqu	775573e07f	Merge branch 'main' into vas-bert-attn-refactors	2025-06-30 15:57:51 +02:00
Vasqu	41ddb5726d	fix flash attention tests, flex attention requires torch 2.7.x to work with multiple classes (as recompile strats force a size call which is wrongly interpreted before)	2025-06-30 15:56:30 +02:00
Joao Gante	ea0ea392e5	Fix chat (#39128 )	2025-06-30 13:47:48 +00:00
Lysandre Debut	ed36f8490e	Licenses (#39127 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Licenses * Licenses	2025-06-30 15:25:36 +02:00
Lysandre Debut	e8f90b5397	Split `transformers chat` and `transformers serve` (#38443 ) * Next token * Split chat and serve * Support both generation methods * Style * Generation Config * temp * temp * Finalize serving.py Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> * Finalize chat.py * Update src/transformers/commands/serving.py Co-authored-by: célina <hanouticelina@gmail.com> * Lucain's comments Co-authored-by: Lucain <lucain@huggingface.co> * Update * Last comments on PR * Better error handling * Better error handling * CI errors * CI errors * Add tests * Fix tests * Fix tests * [chat] Split chat/serve (built on top of lysandre's PR) (#39031) * Next token * Split chat and serve * Support both generation methods * Style * Generation Config * temp * temp * Finalize serving.py Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> * Finalize chat.py * Update src/transformers/commands/serving.py Co-authored-by: célina <hanouticelina@gmail.com> * Lucain's comments Co-authored-by: Lucain <lucain@huggingface.co> * Update * Last comments on PR * Better error handling * Better error handling * CI errors * CI errors * Add tests * Fix tests * Fix tests * streaming tool call * abstract tool state; set tool start as eos * todos * server working on models without tools * rm chat's deprecated flags * chat defaults * kv cache persists across calls * add server docs * link * Update src/transformers/commands/serving.py * Apply suggestions from code review * i love merge conflicts * solve multi turn with tiny-agents * On the fly switching of the models * Remove required positional arg --------- Co-authored-by: Lysandre <hi@lysand.re> Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> Co-authored-by: Lucain <lucain@huggingface.co> * Protect names * Fix tests --------- Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> Co-authored-by: Lucain <lucain@huggingface.co> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2025-06-30 15:10:53 +02:00
Yih-Dar	539c6c2fa8	All CI jobs with A10 (#39119 ) all a10 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-30 14:23:27 +02:00
Ryan Mullins	ed9f252608	docs: Gemma 3n audio encoder (#39087 ) Updating Gemma 3n docs and docstrings to clarify the relationship between the newly trained audio encoder used in Gemma 3n and the USM model from the original paper.	2025-06-30 14:10:51 +02:00
Vasqu	82633afe2b	style Some checks are pending Secret Leaks / trufflehog (push) Waiting to run Details	2025-06-30 13:04:45 +02:00
Vasqu	13f5b49fb3	this time	2025-06-30 12:55:09 +02:00
Yuxuan Zhang	4a79bf947d	Fix some bug for finetune and batch infer For GLM-4.1V (#39090 ) * update * 1	2025-06-30 12:16:22 +02:00
Vasqu	f46d6a48eb	?	2025-06-30 11:56:16 +02:00
Yao Matrix	2100ee6545	fix UT failures on XPU w/ stock PyTorch 2.7 & 2.8 (#39116 ) * fix UT failures on XPU w/ stock PyTorch 2.7 & 2.8 Signed-off-by: YAO Matrix <matrix.yao@intel.com> * zamba2 Signed-off-by: YAO Matrix <matrix.yao@intel.com> * xx Signed-off-by: YAO Matrix <matrix.yao@intel.com> * internvl Signed-off-by: YAO Matrix <matrix.yao@intel.com> * tp cases Signed-off-by: YAO Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-06-30 11:49:03 +02:00
Vasqu	01227640ae	flex attn, static cache support, round of fixes	2025-06-30 11:35:16 +02:00
Yih-Dar	ccf2ca162e	skip some `test_sdpa_can_dispatch_on_flash` (#39092 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Has been cancelled Details Build documentation / build (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Get all modified files (push) Has been cancelled Details Self-hosted runner (push-caller) / Check if setup was changed (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details Update Transformers metadata / build_and_package (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Has been cancelled Details Self-hosted runner (push-caller) / build-docker-containers (push) Has been cancelled Details Self-hosted runner (push-caller) / Trigger Push CI (push) Has been cancelled Details * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-27 23:08:14 +02:00
Vasqu	e376e3cff9	simplify masks, fix tests for gen Some checks failed Secret Leaks / trufflehog (push) Has been cancelled Details	2025-06-27 21:17:04 +02:00
st81	a11f692895	Fixes the failing test `test_is_split_into_words` in `test_pipelines_token_classification.py` (#39079 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details New model PR merged notification / Notify new model (push) Has been cancelled Details * Fix test pipelines token classification for is_split_into_words * Fix incorrect import format	2025-06-27 19:25:32 +01:00
Sandeep Yadav	18143c76bf	Sandeepyadav1478/2025 06 19 deberta v2 model card update (#38895 ) * [docs]: update deberta-v2.md model card * chore: req updates * chore: address code review feedback and update docs * chore: review feedback and updates * chore: model selection updates * chores: quantizations review updates	2025-06-27 10:35:30 -07:00
Vasqu	3e591058f4	more cache fixes, new causal API	2025-06-27 19:08:23 +02:00
Steven Liu	02a769b058	[fix] Add FastSpeech2ConformerWithHifiGan (#38207 ) * add to mapping * oops * oops * add to config_mapping_names * revert * fix? * config-mapping-names * fix? * fix?	2025-06-27 09:38:21 -07:00
Benjamin Bossan	c2dc72bb5f	TST Fix PEFT integration test bitsandbytes config (#39082 ) TST Fix PEFT integration test bitsandbytes config The PEFT integration tests still used load_in_{4,8}_bit, which is deprecated, moving to properly setting BitsAndBytesConfig. For 4bit, also ensure that nf4 is being used to prevent > RuntimeError: quant_type must be nf4 on CPU, got fp4	2025-06-27 18:33:11 +02:00
Matej Sirovatka	c8064bea9a	Fix: unprotected import of tp plugin (#39083 )	2025-06-27 17:28:05 +02:00
farrosalferro	dd7dc4a4a2	Add Fast Image Processor for Chameleon (#37140 ) * Add Fast Image Processor for Chameleon * add warning to resize and move blend_rgba to convert_to_rgb * Remove unrelated files * Update image_processing_chameleon_fast to use auto_docstring * fix equivalence test --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>	2025-06-27 15:26:57 +00:00
Yih-Dar	6d773fc3bc	fix `dots1` tests (#39088 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-27 16:54:11 +02:00
Tijana Vukovic	c8764ab935	guard torch distributed check (#39057 ) * guard torch distributed check * Update src/transformers/pipelines/base.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2025-06-27 14:49:47 +00:00
MinJu-Ha	49d9fd49bd	Add Fast Image Processor for mobileViT (#37143 ) * Add image_processing_mobilevit_fast.py * Fix copies * update _preprocess for channel_flip * Update for batched image processing * Resolve merge conflicts with main * Fix import order and remove trailing whitespace (ruff clean-up) * Fix copy inconsistencies * Add NotImplementedError for post_process_semantic_segmentation to satisfy repo checks * Add auto_docstring * Adjust style * Update docs/source/en/model_doc/mobilevit.md Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Update src/transformers/models/mobilevit/image_processing_mobilevit_fast.py Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Update src/transformers/models/mobilevit/image_processing_mobilevit_fast.py Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Delete not used function * test: add missing tests for and * Add post_process_semantic_segmentation to mobilevit_fast.py * Add preprocess function to image_processing_mobilebit_fast.py * ruff check for formatting * fix: modify preprocess method to handle BatchFeature correctly * Remove logic for default value assignment Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Remove normalization adn RGB conversion logic not used in slow processor Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Simplify return_tensors logic using one-liner conditional expression Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * Remove unused normalization and format parameters Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * add *kwargs and remove default values in _preprocess add slow_fast equivalence tests for segmentation * style: autoformat code with ruff * Fix slow_fast equivalence test * merge + remove skipped test --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>	2025-06-27 14:40:24 +00:00
Nahieli	4336ecd1ea	add fast image processor nougat (#37661 ) * add fast image processor nougat * test fixes * docstring white space * last fixes * docstring_type * tolerance unit test * fix tolerance * fix rtol * remove traling white space * remove white space * note for tolerance unit test * fix tests * remove print --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>	2025-06-27 14:39:43 +00:00
Benjamin Bossan	0c35280e58	TST PEFT integration tests with pipeline generate (#39086 ) Some PEFT integration tests involving text generation pipelines were failing since #38129 because the base model is too small to generate longer sequences. Setting max_new_tokens fixes this.	2025-06-27 15:58:10 +02:00
JINO ROHIT	993665a5ff	fixed typo for docstring in prepare_inputs method (#39071 )	2025-06-27 13:57:56 +00:00
Vasqu	1eaca54b02	cache support	2025-06-27 15:49:08 +02:00
Yih-Dar	839893c86b	fix `mistral3` tests (#38989 ) * fix * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-27 15:44:10 +02:00

1 2 3 4 5 ...

19510 Commits