Arthur
01d4da8510
support cross attention edge case
2025-07-01 10:56:06 +02:00
Cyril Vallez
dbc98328da
Several fixes for Gemma3n ( #39135 )
...
* remove the skips
* fix the epsilon to a small value (does not make sense otherwise)
* safeguard
* overload test_eager_matches_sdpa
* Update test_modeling_common.py
* skip appropriate tests
* correct no_split_layer
* fix all devices issue
* fix backward
* fix
2025-07-01 10:34:53 +02:00
BUI Van Tuan
d53518c5f2
Fix key mapping for VLMs ( #39029 )
...
* fix key mapping for VLMs
* use __mro__ instead
* update key mapping in save_pretrained
2025-07-01 09:47:53 +02:00
Arthur
8c96926f60
Merge branch 'main' of github.com:huggingface/transformers into clean-llamas
2025-07-01 08:20:39 +02:00
eustlb
3457e8e73e
[Whisper] update token timestamps tests ( #39126 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* fixes
* update comment
* update for A10
* all a10
* all a10
* all a10
* all a10
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-06-30 21:55:36 +02:00
Drew Ross
fe35eca7bd
Update BigBirdPegasus model card ( #39104 )
...
* Update igbird_pegasus.md
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2025-06-30 10:42:56 -07:00
Arthur
063e510dc8
propagate
Secret Leaks / trufflehog (push) Waiting to run
2025-06-30 18:02:53 +02:00
Arthur
1303470aa4
remove output attentions
2025-06-30 18:01:58 +02:00
Arthur
e63ef640ea
propagate gemma?
2025-06-30 18:01:02 +02:00
Arthur
c7d195feee
update
2025-06-30 17:58:35 +02:00
Yao Matrix
29a3f5ed8c
switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8 ( #39024 )
...
* switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* Update docs/source/en/perf_infer_gpu_multi.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update perf_infer_gpu_multi.md
* Update perf_infer_gpu_multi.md
* Update perf_infer_gpu_multi.md
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2025-06-30 08:54:05 -07:00
Vladimir Gutuev
9e0c865b8b
docs: correct two typos in awesome-transformers.md ( #39102 )
...
* docs(awesome-projects): fix typo “Itt leverages” → “It leverages” (#39101 )
closes #39101
* docs(awesome-projects): fix grammar “We provides” → “We provide” (#39101 )
closes #39101
2025-06-30 08:53:43 -07:00
Arthur
7266aafab7
remove the **flash stuff in favor of noraml kwargs
2025-06-30 17:10:56 +02:00
jiqing-feng
03db2700ab
Enable XPU doc ( #38929 )
...
* fix example with dataset
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix device type
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert torchao change
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert torchao change
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update xpu torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update chat_templating_multimodal.md
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* use full name for int8
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert int8 title
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-06-30 07:56:55 -07:00
Arthur
3fb6b710f2
update
2025-06-30 16:49:11 +02:00
Joao Gante
ea0ea392e5
Fix chat ( #39128 )
2025-06-30 13:47:48 +00:00
Arthur
a74974d989
update
2025-06-30 15:44:17 +02:00
Lysandre Debut
ed36f8490e
Licenses ( #39127 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* Licenses
* Licenses
2025-06-30 15:25:36 +02:00
Arthur
e7705c981a
update models based on qwen2
2025-06-30 15:25:03 +02:00
Arthur
113219becd
update modularqwen2
2025-06-30 15:22:39 +02:00
Lysandre Debut
e8f90b5397
Split transformers chat
and transformers serve
( #38443 )
...
* Next token
* Split chat and serve
* Support both generation methods
* Style
* Generation Config
* temp
* temp
* Finalize serving.py
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
* Finalize chat.py
* Update src/transformers/commands/serving.py
Co-authored-by: célina <hanouticelina@gmail.com>
* Lucain's comments
Co-authored-by: Lucain <lucain@huggingface.co>
* Update
* Last comments on PR
* Better error handling
* Better error handling
* CI errors
* CI errors
* Add tests
* Fix tests
* Fix tests
* [chat] Split chat/serve (built on top of lysandre's PR) (#39031 )
* Next token
* Split chat and serve
* Support both generation methods
* Style
* Generation Config
* temp
* temp
* Finalize serving.py
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
* Finalize chat.py
* Update src/transformers/commands/serving.py
Co-authored-by: célina <hanouticelina@gmail.com>
* Lucain's comments
Co-authored-by: Lucain <lucain@huggingface.co>
* Update
* Last comments on PR
* Better error handling
* Better error handling
* CI errors
* CI errors
* Add tests
* Fix tests
* Fix tests
* streaming tool call
* abstract tool state; set tool start as eos
* todos
* server working on models without tools
* rm chat's deprecated flags
* chat defaults
* kv cache persists across calls
* add server docs
* link
* Update src/transformers/commands/serving.py
* Apply suggestions from code review
* i love merge conflicts
* solve multi turn with tiny-agents
* On the fly switching of the models
* Remove required positional arg
---------
Co-authored-by: Lysandre <hi@lysand.re>
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
Co-authored-by: Lucain <lucain@huggingface.co>
* Protect names
* Fix tests
---------
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
Co-authored-by: Lucain <lucain@huggingface.co>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2025-06-30 15:10:53 +02:00
Arthur
3caf7d76a0
fix other models as well!
2025-06-30 14:55:01 +02:00
Arthur
8c66f4d0bb
this fixes more tests
2025-06-30 14:50:34 +02:00
Arthur
ea87eb700b
very small nits
2025-06-30 14:49:01 +02:00
Arthur
4a14287a60
finally
2025-06-30 14:38:25 +02:00
Arthur
124cd82968
fix
2025-06-30 14:36:56 +02:00
Yih-Dar
539c6c2fa8
All CI jobs with A10 ( #39119 )
...
all a10
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-06-30 14:23:27 +02:00
Arthur
98739ba418
update
2025-06-30 14:23:04 +02:00
Ryan Mullins
ed9f252608
docs: Gemma 3n audio encoder ( #39087 )
...
Updating Gemma 3n docs and docstrings to clarify the relationship
between the newly trained audio encoder used in Gemma 3n and the USM
model from the original paper.
2025-06-30 14:10:51 +02:00
Arthur
fca73ad7ce
update
2025-06-30 12:43:56 +02:00
Arthur
0dc082627c
update
2025-06-30 12:38:58 +02:00
Arthur
cb5da530c0
updates
2025-06-30 12:37:16 +02:00
Arthur
c9bb39ef87
update
2025-06-30 12:32:03 +02:00
Arthur
a7e0ce238e
update other models
2025-06-30 12:31:07 +02:00
Arthur
98f402cd5d
update
2025-06-30 12:28:24 +02:00
Arthur
63df15bb24
update
2025-06-30 12:26:51 +02:00
Arthur
96aabd77c7
use transformers kwargs instead
2025-06-30 12:23:39 +02:00
Arthur
e437edd7fc
update other modelings
2025-06-30 12:18:09 +02:00
Yuxuan Zhang
4a79bf947d
Fix some bug for finetune and batch infer For GLM-4.1V ( #39090 )
...
* update
* 1
2025-06-30 12:16:22 +02:00
Arthur
0f1d7e0a6f
update generic, fix to use config value
2025-06-30 12:11:13 +02:00
Arthur
eb6747bca9
nits and fixes
2025-06-30 12:03:41 +02:00
Yao Matrix
2100ee6545
fix UT failures on XPU w/ stock PyTorch 2.7 & 2.8 ( #39116 )
...
* fix UT failures on XPU w/ stock PyTorch 2.7 & 2.8
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* zamba2
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* xx
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* internvl
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* tp cases
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
2025-06-30 11:49:03 +02:00
Arthur
abf9d39d12
put this on the pretrained model instead
2025-06-30 11:44:42 +02:00
Arthur
7f113b43cc
also add the changes needed to modeling utils
2025-06-30 11:39:29 +02:00
Arthur
37b4ef022e
update other models as well just making fix-copies
2025-06-30 11:35:37 +02:00
Arthur
7433c44376
just update 2 files
2025-06-30 11:25:07 +02:00
Yih-Dar
ccf2ca162e
skip some test_sdpa_can_dispatch_on_flash
( #39092 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Has been cancelled
Build documentation / build (push) Has been cancelled
Slow tests on important models (on Push - A10) / Get all modified files (push) Has been cancelled
Self-hosted runner (push-caller) / Check if setup was changed (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
Update Transformers metadata / build_and_package (push) Has been cancelled
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Has been cancelled
Self-hosted runner (push-caller) / build-docker-containers (push) Has been cancelled
Self-hosted runner (push-caller) / Trigger Push CI (push) Has been cancelled
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-06-27 23:08:14 +02:00
st81
a11f692895
Fixes the failing test test_is_split_into_words
in test_pipelines_token_classification.py
( #39079 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
New model PR merged notification / Notify new model (push) Has been cancelled
* Fix test pipelines token classification for is_split_into_words
* Fix incorrect import format
2025-06-27 19:25:32 +01:00
Sandeep Yadav
18143c76bf
Sandeepyadav1478/2025 06 19 deberta v2 model card update ( #38895 )
...
* [docs]: update deberta-v2.md model card
* chore: req updates
* chore: address code review feedback and update docs
* chore: review feedback and updates
* chore: model selection updates
* chores: quantizations review updates
2025-06-27 10:35:30 -07:00
Steven Liu
02a769b058
[fix] Add FastSpeech2ConformerWithHifiGan ( #38207 )
...
* add to mapping
* oops
* oops
* add to config_mapping_names
* revert
* fix?
* config-mapping-names
* fix?
* fix?
2025-06-27 09:38:21 -07:00