ydshieh
ff9034ffda
fix
2025-07-02 13:35:46 +02:00
ydshieh
9bb036d736
fix
2025-07-02 13:27:02 +02:00
ydshieh
7cd5f16822
fix
2025-07-02 12:56:00 +02:00
ydshieh
41de695fa3
fix
2025-07-02 12:41:37 +02:00
ydshieh
571aa68422
fix
2025-07-02 12:30:30 +02:00
ydshieh
ca688e7449
fix
2025-07-02 12:25:43 +02:00
ydshieh
50d48aaa8a
fix
2025-07-02 12:20:33 +02:00
ydshieh
1c407778e2
fix
2025-07-02 12:19:01 +02:00
ydshieh
a92786d77a
fix
2025-07-02 12:13:30 +02:00
ydshieh
1efce2dbf8
fix
2025-07-02 12:08:22 +02:00
ydshieh
fce61367b5
fix
2025-07-02 12:05:30 +02:00
ydshieh
e23c242848
fix
2025-07-02 11:50:31 +02:00
ydshieh
2ec3fdcf2a
fix
2025-07-02 11:22:48 +02:00
ydshieh
39d61f8c8f
[skip ci]
2025-07-02 08:39:33 +02:00
ydshieh
35bd19eda8
empty
2025-07-02 08:39:22 +02:00
ydshieh
53c409ed49
fix
2025-07-02 08:36:01 +02:00
ydshieh
15a413576e
fix
2025-07-02 08:30:19 +02:00
ydshieh
17d4b80e3e
fix
2025-07-02 08:21:11 +02:00
ydshieh
ab8726bdd3
fix
2025-07-02 08:15:20 +02:00
ydshieh
7e1913820c
fix
2025-07-02 08:11:17 +02:00
ydshieh
b58800f83f
fix
2025-07-02 08:05:45 +02:00
ydshieh
2e63d0b1f4
fix
2025-07-02 08:02:17 +02:00
ydshieh
bec9d4fbab
fix
2025-07-02 07:53:19 +02:00
ydshieh
8ebb6de590
fix
2025-07-02 07:47:19 +02:00
ydshieh
ca5410f3e8
fix
2025-07-02 07:42:13 +02:00
Yih-Dar
4c1715b610
Update expected values (after switching to A10) ( #39157 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* empty
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 20:54:31 +02:00
Yih-Dar
ab59cc27fe
Suggest jobs to use in run-slow
( #39100 )
...
* pr
* pr
* pr
* pr
* pr
* pr
* pr
* pr
* pr
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 20:19:06 +02:00
jiqing-feng
db2f535443
update bnb ground truth ( #39117 )
...
* update bnb resulte
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* set seed to avoid sampling different results
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix int8 tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* add comments
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
2025-07-01 20:06:37 +02:00
ybkurt
260846efad
fix: remove undefined variable ( #39146 )
2025-07-01 19:10:29 +02:00
rasmi
cdfe49a4d0
Change @lru_cache()
to @lru_cache
to match styles from #38883 . ( #39093 )
...
Match styles in #38883
2025-07-01 18:29:16 +02:00
DavidS2106
f46798193e
Fix: Ensure wandb logs config in offline mode ( #38992 )
...
* Fix: Ensure wandb logs config in offline mode
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-07-01 16:17:58 +00:00
Yih-Dar
fe838d6631
Fix missing fsdp & trainer jobs in daily CI ( #39153 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 18:10:30 +02:00
StevenBucaille
1283877571
[superglue] fix wrong concatenation which made batching results wrong ( #38850 )
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
2025-07-01 12:14:44 +00:00
Raushan Turganbay
f8b88866f5
[VLMs] support passing embeds along with pixels ( #38467 )
...
* VLMs can work with embeds now
* update more models
* fix tests
* fix copies
* fixup
* fix
* style
* unskip tests
* fix copies
* fix tests
* style
* omni modality models
* qwen models had extra indentation
* fix some other tests
* fix copies
* fix test last time
* unrelated changes revert
* we can't rely only on embeds
* delete file
* de-flake mistral3
* fix qwen models
* fix style
* fix tests
* fix copies
* deflake the test
* modular reverted by fixes, fix again
* flaky test, overwritten
* fix copies
* style
2025-07-01 11:33:20 +00:00
Ayush Singh
20901f1d68
[typing] LlamaAttention return typehint ( #38998 )
...
* helo llama
* helo llama
* helo llama
* apply modular
* fix dia
---------
Co-authored-by: qubvel <qubvel@gmail.com>
2025-07-01 11:29:52 +01:00
Raushan Turganbay
7a25f8dfdb
[qwen2-vl] fix FA2 inference ( #39121 )
...
* fix FA2
* update is causal flag and remove mask for FA2
* update for FA2 with varlen path
* how the tests were passing with different devices?
* add comment and ref to the PR
* move mask preparation to base pretrained model
* seq len is the first dim, not second
* fix copies to fix GLM4V
2025-07-01 10:18:37 +00:00
Mehant Kammakomati
def9663239
feat: support indivisible shards for TP model loading and TPlizing. ( #37220 )
...
* feat: support uneven loading and sharding
resolve merge conflicts
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* fix: allow for empty tensor computations
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* test: add llama1b test case
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* due to q_proj colwise it has to be multi of 2
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* refactor: use slice API
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* refactor: use slice API
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* refactor: use slice API
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* refactor: use slice API
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
---------
Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
2025-07-01 10:03:22 +00:00
jiqing-feng
06c4a4d499
fix caching_allocator_warmup with tie weights ( #39070 )
...
* fix caching_allocator_warmup with tie weights
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix comment
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
2025-07-01 11:32:20 +02:00
Raushan Turganbay
e435574721
🚨 Don't use cache in non-generative models ( #38751 )
...
* deprecate for 1 version
* style
* fix some tests
* fix esm
* skip for now, GC requires positional args but we have keyword args
* remove transpose for scores in modified models only
* skip fx trace tests
2025-07-01 09:08:21 +00:00
Cyril Vallez
dbc98328da
Several fixes for Gemma3n ( #39135 )
...
* remove the skips
* fix the epsilon to a small value (does not make sense otherwise)
* safeguard
* overload test_eager_matches_sdpa
* Update test_modeling_common.py
* skip appropriate tests
* correct no_split_layer
* fix all devices issue
* fix backward
* fix
2025-07-01 10:34:53 +02:00
BUI Van Tuan
d53518c5f2
Fix key mapping for VLMs ( #39029 )
...
* fix key mapping for VLMs
* use __mro__ instead
* update key mapping in save_pretrained
2025-07-01 09:47:53 +02:00
eustlb
3457e8e73e
[Whisper] update token timestamps tests ( #39126 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* fixes
* update comment
* update for A10
* all a10
* all a10
* all a10
* all a10
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-06-30 21:55:36 +02:00
Drew Ross
fe35eca7bd
Update BigBirdPegasus model card ( #39104 )
...
* Update igbird_pegasus.md
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2025-06-30 10:42:56 -07:00
Yao Matrix
29a3f5ed8c
switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8 ( #39024 )
...
* switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* Update docs/source/en/perf_infer_gpu_multi.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update perf_infer_gpu_multi.md
* Update perf_infer_gpu_multi.md
* Update perf_infer_gpu_multi.md
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2025-06-30 08:54:05 -07:00
Vladimir Gutuev
9e0c865b8b
docs: correct two typos in awesome-transformers.md ( #39102 )
...
* docs(awesome-projects): fix typo “Itt leverages” → “It leverages” (#39101 )
closes #39101
* docs(awesome-projects): fix grammar “We provides” → “We provide” (#39101 )
closes #39101
2025-06-30 08:53:43 -07:00
jiqing-feng
03db2700ab
Enable XPU doc ( #38929 )
...
* fix example with dataset
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix device type
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert torchao change
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert torchao change
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update xpu torchao doc
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* update chat_templating_multimodal.md
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* use full name for int8
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* revert int8 title
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-06-30 07:56:55 -07:00
Joao Gante
ea0ea392e5
Fix chat ( #39128 )
2025-06-30 13:47:48 +00:00
Lysandre Debut
ed36f8490e
Licenses ( #39127 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* Licenses
* Licenses
2025-06-30 15:25:36 +02:00
Lysandre Debut
e8f90b5397
Split transformers chat
and transformers serve
( #38443 )
...
* Next token
* Split chat and serve
* Support both generation methods
* Style
* Generation Config
* temp
* temp
* Finalize serving.py
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
* Finalize chat.py
* Update src/transformers/commands/serving.py
Co-authored-by: célina <hanouticelina@gmail.com>
* Lucain's comments
Co-authored-by: Lucain <lucain@huggingface.co>
* Update
* Last comments on PR
* Better error handling
* Better error handling
* CI errors
* CI errors
* Add tests
* Fix tests
* Fix tests
* [chat] Split chat/serve (built on top of lysandre's PR) (#39031 )
* Next token
* Split chat and serve
* Support both generation methods
* Style
* Generation Config
* temp
* temp
* Finalize serving.py
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
* Finalize chat.py
* Update src/transformers/commands/serving.py
Co-authored-by: célina <hanouticelina@gmail.com>
* Lucain's comments
Co-authored-by: Lucain <lucain@huggingface.co>
* Update
* Last comments on PR
* Better error handling
* Better error handling
* CI errors
* CI errors
* Add tests
* Fix tests
* Fix tests
* streaming tool call
* abstract tool state; set tool start as eos
* todos
* server working on models without tools
* rm chat's deprecated flags
* chat defaults
* kv cache persists across calls
* add server docs
* link
* Update src/transformers/commands/serving.py
* Apply suggestions from code review
* i love merge conflicts
* solve multi turn with tiny-agents
* On the fly switching of the models
* Remove required positional arg
---------
Co-authored-by: Lysandre <hi@lysand.re>
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
Co-authored-by: Lucain <lucain@huggingface.co>
* Protect names
* Fix tests
---------
Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com>
Co-authored-by: Lucain <lucain@huggingface.co>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2025-06-30 15:10:53 +02:00
Yih-Dar
539c6c2fa8
All CI jobs with A10 ( #39119 )
...
all a10
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-06-30 14:23:27 +02:00