Cyril Vallez
056fa73fae
[modular] Simplify logic and docstring handling ( #39185 )
...
* simplify a lot
* Update modular_model_converter.py
* finalize
* remove outdated functions
* apply it
* and examples
2025-07-07 14:52:57 +02:00
Xavier Dupré
f16fbfb89a
Make _compute_dynamic_ntk_parameters exportable ( #39171 )
...
* Make _compute_dynamic_ntk_parameters exportable
* add unit test
2025-07-07 14:48:31 +02:00
kaixuanliu
4243bb844d
fix bug using FSDP V1 will lead to model device not properly set ( #39177 )
...
* fix bug using FSDP V1 will lead to model device not properly set
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
* update the code
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
---------
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
2025-07-07 14:47:04 +02:00
Yih-Dar
34c16167eb
Don't send new comment if the previous one is less than 30 minutes (unless the content is changed) ( #39170 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-07 14:43:50 +02:00
Daniel van Strien
b8f397e456
fix typo in Gemma3n notes ( #39196 )
2025-07-07 14:41:33 +02:00
Cyril Vallez
5348fbc005
[modular] Follow global indexing and attribute setting, and their dependencies ( #39180 )
...
* export global indexing statements
* add example
* style
* examples
2025-07-07 14:36:43 +02:00
Isotr0py
8570bc29f3
Fix missing fast tokenizer/image_processor in whisper/qwen2.5-omni processor ( #39244 )
...
* fix missing fast tokenizer in whisper processor
Signed-off-by: Isotr0py <2037008807@qq.com>
* fix processor test
Signed-off-by: Isotr0py <2037008807@qq.com>
* fix qwen2.5 omni processor
Signed-off-by: Isotr0py <2037008807@qq.com>
---------
Signed-off-by: Isotr0py <2037008807@qq.com>
2025-07-07 13:54:18 +02:00
Joshua Lochner
b283d52f7f
[vjepa2] replace einsum with unsqueeze ( #39234 )
2025-07-07 11:14:08 +01:00
Rémi Ouazan
a325409a50
Expectations re-order and corrected FA3 skip ( #39195 )
...
* Fix Expectations and a FA3 skip
* Fixed docstring
* Added context for Default expectation
2025-07-07 11:42:33 +02:00
zrohyun
b0a8e0b8d7
[video processors] Support float fps for precise frame sampling ( #39134 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* [video processors] Support float fps for precise frame sampling
Enable fractional fps values (e.g., 1.5, 29.97) in video processors
for more precise frame sampling control.
- Change fps type from int to float across all video processors
- Maintain backward compatibility with integer values
Extends: #38105
* [video processors] Refine fps typing to Union[int, float]
Change fps type from Optional[float] to Optional[Union[int, float]]
for more explicit type information about supporting both integer
and floating-point frame rates.
- Update type hints and docstrings across 8 files
- Maintain backward compatibility
- Clarify support for both int and float values
Extends: #38105
* Revert "[video processors] Support float fps for precise frame sampling"
This reverts commit 7360d6e661
.
2025-07-07 03:43:43 +00:00
Arthur
ca7e1a3756
Refactor the way we handle outputs for new llamas and new models ( #39120 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Has been cancelled
Build documentation / build (push) Has been cancelled
New model PR merged notification / Notify new model (push) Has been cancelled
Slow tests on important models (on Push - A10) / Get all modified files (push) Has been cancelled
Self-hosted runner (push-caller) / Check if setup was changed (push) Has been cancelled
Secret Leaks / trufflehog (push) Has been cancelled
Update Transformers metadata / build_and_package (push) Has been cancelled
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Has been cancelled
Self-hosted runner (push-caller) / build-docker-containers (push) Has been cancelled
Self-hosted runner (push-caller) / Trigger Push CI (push) Has been cancelled
* just update 2 files
* update other models as well just making fix-copies
* also add the changes needed to modeling utils
* put this on the pretrained model instead
* nits and fixes
* update generic, fix to use config value
* update other modelings
* use transformers kwargs instead
* update
* update
* update other models
* update
* updates
* update
* update
* update
* fix
* finally
* very small nits
* this fixes more tests
* fix other models as well!
* update modularqwen2
* update models based on qwen2
* update
* update
* remove the **flash stuff in favor of noraml kwargs
* update
* propagate gemma?
* remove output attentions
* propagate
* support cross attention edge case
* same
* test this
* fixes
* more fix
* update
* update
* fix conflicts
* update
* fix emu3
* fix emu3
* move the fix a bit
* quel enfer
* some fixes, loss_kwargs should never had been
* finish fixing gemma3n
* fix small lm3
* fix another one
* fix csm now
* fux csm and mistral
* fix mistral now
* small fixes
* fix janusss
* only for some models
* fixup
* phix phi3
* more fixes?
* dose this fix it?
* update
* holy shit it was just graph breaks
* protect torch
* updates
* fix samhq?
* fix moonshine
* more moonshine fixes, 3 failures left!
* nits
* generic needs to support more
* more fixes to moonshine!
* fix cross attention outputs!
* fix csm!
* nits
* fix stupid kosmos2
* current updates
* fixes
* use output recorder?
* nicer!
* a little bit of magic
* update
* fix protect
* fix
* small fixes
* protect import
* fix a bunch of more models
* fix fixups
* fix some of the last ones
* nit
* partly fix phi
* update
* fix import path
* make something that is fullgraph compatible just to be sure
* typing was wrong on llama so the rest was wrong as well
* fucking ugly but at least it is still exportable
* syle
* supposed to fix moonshine, it still breaks
* fix some default
* fix the last bits of sam
* update samhq
* more fixes to am hq
* nit
* fix all output+hidden states and output_attentions!
* fix?
* fix diffllama
* updates to fix initialization on the sam pips
* ups there was a bug
* fix the last sam hq test
* fix gotocr
* fix gotocr2!
* fixes
* skip stupid tests
* there was one left :)
* fixup
* fix fix copies issues with this test file
* fix copies for sam_hq
* rm some comments
* skip 2 more failing tests
* fix
* fix everything
* Apply suggestions from code review
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
* add more doc!
* fix public init
* fix modular qwen3
---------
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>
2025-07-05 11:34:28 +02:00
Yih-Dar
e6a8063ef1
Update expected values (after switching to A10) - part 8 - Final ( #39220 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-04 13:35:53 +02:00
Yih-Dar
cd8a041a4f
Update expected values (after switching to A10) - part 7 ( #39218 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-04 12:48:10 +02:00
Cyril Vallez
0cf27916f0
Add packed tensor format support for flex/sdpa/eager through the mask! ( #39194 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
New model PR merged notification / Notify new model (push) Has been cancelled
* Add the necesary logic to mask_utils
* add it everywhere
* Update masking_utils.py
* style
* Update masking_utils.py
* Update modeling_mimi.py
* Update masking_utils.py
* add support for more than batch size 1
* Update masking_utils.py
* add test
* style
* Update test_masking_utils.py
* Update masking_utils.py
* add require_token
* fix tests
* fix
2025-07-04 09:01:56 +02:00
Yih-Dar
037755ed54
Update expected values (after switching to A10) - part 6 ( #39207 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-03 22:45:30 +02:00
Yih-Dar
1168f57abf
Update expected values (after switching to A10) - part 5 ( #39205 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-03 19:56:02 +02:00
Lysandre Debut
7d9e52f376
Fix continuous batching in transformers serve
( #39149 )
...
* Fix CB
* Nit
* Update src/transformers/commands/serving.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* Add todos
---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
2025-07-03 18:15:31 +02:00
Joao Gante
85d93cc6e3
[serve] Cursor support, move docs into separate page, add more examples ( #39133 )
...
* jan docs
* rm
* [cursor] tmp commit
* Cursor working :D
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* Update src/transformers/commands/serving.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* cursor docs
* try to fix agents/tools docs?
* try to fix agents/tools docs?
* Update docs/source/en/serving.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
* add transformers chat example with transformers serve
---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
2025-07-03 17:04:16 +01:00
Pavel Iakubovskii
e15b06d8dc
[typing] better return typehints for from_pretrained
( #39184 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* config
* processor
* feature-extractor
* jukebox
* fixup
* update other methods in config
* remove "PretrainedConfig" annotations
2025-07-03 14:22:47 +00:00
Yih-Dar
a25fc3592e
Update expected values (after switching to A10) - part 4 ( #39189 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-03 15:13:06 +02:00
Anton Vlasjuk
b31e9d19a6
[Dia
] Change ckpt path in docs ( #39181 )
...
fix ckpt path
2025-07-03 10:02:58 +00:00
Ilyas Moutawwakil
18e0cae207
Fix many HPU failures in the CI ( #39066 )
...
* more torch.hpu patches
* increase top_k because it results in flaky behavior when Tempreture, TopP and TopK are used together, which ends up killing beams early.
* remove temporal fix
* fix scatter operation when input and src are the same
* trigger
* fix and reduce
* skip finding batch size as it makes the hpu go loco
* fix fsdp (yay all are passing)
* fix checking equal nan values
* style
* remove models list
* order
* rename to cuda_extensions
* Update src/transformers/trainer.py
2025-07-03 11:17:27 +02:00
Marc Sun
bff964c429
Decouple device_map='auto' and tp_plan='auto' ( #38942 )
...
* dissociate
* better place
* fix
2025-07-03 11:07:11 +02:00
Wing Lian
8178c43112
when delaying optimizer creation only prepare the model ( #39152 )
2025-07-03 09:04:16 +02:00
Raushan Turganbay
91221da2f1
[glm4v] fix video inference ( #39174 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
fix video inference
2025-07-03 05:20:41 +00:00
Rémi Ouazan
ebfbcd42da
Test fixes for Aria (and some Expectation for llava_next_video) ( #39131 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* Expectations for llava_next_video
* Updated image src in aria
* Fix test_small_model_integration_test
* Fix small model integration llama
* Fix a bunch of tests
* Style
* Shortened generation in test from 900 to 90
2025-07-02 23:41:14 +02:00
Yih-Dar
37a239ca50
Update expected values (after switching to A10) - part 3 ( #39179 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-02 22:48:30 +02:00
Yih-Dar
9326fc332d
Update expected values (after switching to A10) - part 2 ( #39165 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* empty
* [skip ci]
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-02 22:47:55 +02:00
Pedro Cuenca
25cd65ac43
Random serve fixes ( #39176 )
...
* Fix index out of bounds exception on wrong kv reuse
* Prevent loading same model twice
---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Lysandre Debut <hi@lysand.re>
2025-07-02 22:09:58 +02:00
Lysandre Debut
548794b886
[serve] Model name or path should be required ( #39178 )
...
* Model name or path should be required
* Fix + add tests
* Change print to log so it doesn't display in transformers chat
2025-07-02 22:06:47 +02:00
Joao Gante
2d561713f8
[generate] document non-canonical beam search default behavior ( #39000 )
2025-07-02 18:29:16 +01:00
Steven Liu
df12d87d18
[docs] ViTPose ( #38630 )
...
* vitpose
* fix?
* fix?
* feedback
* fix
* feedback
* feedback
* update sample image
2025-07-02 07:56:29 -07:00
Cyril Vallez
2b4a12b5bf
Reduce Glm4v model test size significantly ( #39173 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* fix test size
* Update test_modeling_glm4v.py
2025-07-02 15:55:05 +02:00
BUI Van Tuan
e355c0a11c
Fix missing initializations for models created in 2024 ( #38987 )
...
* fix GroundingDino
* fix SuperGlue
* fix GroundingDino
* fix MambaModel
* fix OmDetTurbo
* fix SegGpt
* fix Qwen2Audio
* fix Mamba2
* fix DabDetr
* fix Dac
* fix FalconMamba
* skip timm initialization
* fix Encodec and MusicgenMelody
* fix Musicgen
* skip timm initialization test
* fix OmDetTurbo
* clean the code
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
* add reviewed changes
* add back timm
* style
* better check for parametrizations
---------
Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
2025-07-02 15:03:57 +02:00
Rémi Ouazan
1125513a8d
Blip2 fixes ( #39080 )
...
* Fixed some devices errors
* Fixed other device issues and more expectations
* Reverted support flags
* style
* More granular support
* Fixed some rebase stuff
* add a not None check before .to
2025-07-02 14:39:39 +02:00
Isotr0py
28df7f854a
Fix multimodal processor get duplicate arguments when receive kwargs for initialization ( #39125 )
...
* fix processor tokenizer override
Signed-off-by: Isotr0py <2037008807@qq.com>
* code format
Signed-off-by: Isotr0py <2037008807@qq.com>
* add regression test
Signed-off-by: Isotr0py <2037008807@qq.com>
* fix
Signed-off-by: Isotr0py <2037008807@qq.com>
* check image processor same
Signed-off-by: Isotr0py <2037008807@qq.com>
---------
Signed-off-by: Isotr0py <2037008807@qq.com>
2025-07-02 19:57:15 +08:00
Yaswanth Gali
b61023a1b7
🚨 🚨 🚨 [eomt] make EoMT compatible with pipeline ( #39122 )
...
* Make EoMT compatible with pipeline
* Implicit patch offsets
* remove patch offsets from arg
* Modify tests
* Update example
* fix proc testcase
* Add few more args
* add pipeline test suite
* fix
* docstring fixes
* add pipeline test
* changes w.r.t review
* 🙈 MB
* should fix device mismatch
* debug
* Fixes device mismatch
* use decorator
* we can split mlp
* expected values update
---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
2025-07-02 12:25:26 +01:00
Raushan Turganbay
4d5822e65d
[smolvlm] fix video inference ( #39147 )
...
* fix smolvlm
* better do as before, set sampling params in overwritten `apply_chat_template`
* style
* update with `setdefault`
2025-07-02 12:05:10 +02:00
वेदांत
9b2f5b66d8
fix default value of config to match checkpionts in LLaVa-OV models ( #39163 )
2025-07-02 09:45:50 +00:00
Chong You
e8e0c76162
Add activation sparsity reference in gemma3n doc ( #39160 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
Add activation sparsity reference in the description of gemma3n
2025-07-02 04:11:03 +02:00
Yih-Dar
8e87adc45f
fix llama
tests ( #39161 )
...
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
* fix
* fix
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 23:27:22 +02:00
Yih-Dar
4c1715b610
Update expected values (after switching to A10) ( #39157 )
...
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* empty
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 20:54:31 +02:00
Yih-Dar
ab59cc27fe
Suggest jobs to use in run-slow
( #39100 )
...
* pr
* pr
* pr
* pr
* pr
* pr
* pr
* pr
* pr
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 20:19:06 +02:00
jiqing-feng
db2f535443
update bnb ground truth ( #39117 )
...
* update bnb resulte
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* set seed to avoid sampling different results
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix int8 tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix typo
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* add comments
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
2025-07-01 20:06:37 +02:00
ybkurt
260846efad
fix: remove undefined variable ( #39146 )
2025-07-01 19:10:29 +02:00
rasmi
cdfe49a4d0
Change @lru_cache()
to @lru_cache
to match styles from #38883 . ( #39093 )
...
Match styles in #38883
2025-07-01 18:29:16 +02:00
DavidS2106
f46798193e
Fix: Ensure wandb logs config in offline mode ( #38992 )
...
* Fix: Ensure wandb logs config in offline mode
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
2025-07-01 16:17:58 +00:00
Yih-Dar
fe838d6631
Fix missing fsdp & trainer jobs in daily CI ( #39153 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2025-07-01 18:10:30 +02:00
StevenBucaille
1283877571
[superglue] fix wrong concatenation which made batching results wrong ( #38850 )
Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run
Build documentation / build (push) Waiting to run
New model PR merged notification / Notify new model (push) Waiting to run
Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run
Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions
Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run
Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions
Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions
Secret Leaks / trufflehog (push) Waiting to run
Update Transformers metadata / build_and_package (push) Waiting to run
2025-07-01 12:14:44 +00:00
Raushan Turganbay
f8b88866f5
[VLMs] support passing embeds along with pixels ( #38467 )
...
* VLMs can work with embeds now
* update more models
* fix tests
* fix copies
* fixup
* fix
* style
* unskip tests
* fix copies
* fix tests
* style
* omni modality models
* qwen models had extra indentation
* fix some other tests
* fix copies
* fix test last time
* unrelated changes revert
* we can't rely only on embeds
* delete file
* de-flake mistral3
* fix qwen models
* fix style
* fix tests
* fix copies
* deflake the test
* modular reverted by fixes, fix again
* flaky test, overwritten
* fix copies
* style
2025-07-01 11:33:20 +00:00