transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-08-03 03:31:05 +06:00

Author	SHA1	Message	Date
ydshieh	ff9034ffda	fix	2025-07-02 13:35:46 +02:00
ydshieh	9bb036d736	fix	2025-07-02 13:27:02 +02:00
ydshieh	7cd5f16822	fix	2025-07-02 12:56:00 +02:00
ydshieh	41de695fa3	fix	2025-07-02 12:41:37 +02:00
ydshieh	571aa68422	fix	2025-07-02 12:30:30 +02:00
ydshieh	ca688e7449	fix	2025-07-02 12:25:43 +02:00
ydshieh	50d48aaa8a	fix	2025-07-02 12:20:33 +02:00
ydshieh	1c407778e2	fix	2025-07-02 12:19:01 +02:00
ydshieh	a92786d77a	fix	2025-07-02 12:13:30 +02:00
ydshieh	1efce2dbf8	fix	2025-07-02 12:08:22 +02:00
ydshieh	fce61367b5	fix	2025-07-02 12:05:30 +02:00
ydshieh	e23c242848	fix	2025-07-02 11:50:31 +02:00
ydshieh	2ec3fdcf2a	fix	2025-07-02 11:22:48 +02:00
ydshieh	39d61f8c8f	[skip ci]	2025-07-02 08:39:33 +02:00
ydshieh	35bd19eda8	empty	2025-07-02 08:39:22 +02:00
ydshieh	53c409ed49	fix	2025-07-02 08:36:01 +02:00
ydshieh	15a413576e	fix	2025-07-02 08:30:19 +02:00
ydshieh	17d4b80e3e	fix	2025-07-02 08:21:11 +02:00
ydshieh	ab8726bdd3	fix	2025-07-02 08:15:20 +02:00
ydshieh	7e1913820c	fix	2025-07-02 08:11:17 +02:00
ydshieh	b58800f83f	fix	2025-07-02 08:05:45 +02:00
ydshieh	2e63d0b1f4	fix	2025-07-02 08:02:17 +02:00
ydshieh	bec9d4fbab	fix	2025-07-02 07:53:19 +02:00
ydshieh	8ebb6de590	fix	2025-07-02 07:47:19 +02:00
ydshieh	ca5410f3e8	fix	2025-07-02 07:42:13 +02:00
Yih-Dar	4c1715b610	Update expected values (after switching to A10) (#39157 ) * fix * fix * fix * fix * fix * fix * fix * fix * fix * empty * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-07-01 20:54:31 +02:00
Yih-Dar	ab59cc27fe	Suggest jobs to use in `run-slow` (#39100 ) * pr * pr * pr * pr * pr * pr * pr * pr * pr --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-07-01 20:19:06 +02:00
jiqing-feng	db2f535443	update bnb ground truth (#39117 ) * update bnb resulte Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * set seed to avoid sampling different results Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix int8 tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix typo Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * add comments Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>	2025-07-01 20:06:37 +02:00
ybkurt	260846efad	fix: remove undefined variable (#39146 )	2025-07-01 19:10:29 +02:00
rasmi	cdfe49a4d0	Change `@lru_cache()` to `@lru_cache` to match styles from #38883 . (#39093 ) Match styles in #38883	2025-07-01 18:29:16 +02:00
DavidS2106	f46798193e	Fix: Ensure wandb logs config in offline mode (#38992 ) * Fix: Ensure wandb logs config in offline mode * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-07-01 16:17:58 +00:00
Yih-Dar	fe838d6631	Fix missing fsdp & trainer jobs in daily CI (#39153 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-07-01 18:10:30 +02:00
StevenBucaille	1283877571	[superglue] fix wrong concatenation which made batching results wrong (#38850 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details	2025-07-01 12:14:44 +00:00
Raushan Turganbay	f8b88866f5	[VLMs] support passing embeds along with pixels (#38467 ) * VLMs can work with embeds now * update more models * fix tests * fix copies * fixup * fix * style * unskip tests * fix copies * fix tests * style * omni modality models * qwen models had extra indentation * fix some other tests * fix copies * fix test last time * unrelated changes revert * we can't rely only on embeds * delete file * de-flake mistral3 * fix qwen models * fix style * fix tests * fix copies * deflake the test * modular reverted by fixes, fix again * flaky test, overwritten * fix copies * style	2025-07-01 11:33:20 +00:00
Ayush Singh	20901f1d68	[typing] LlamaAttention return typehint (#38998 ) * helo llama * helo llama * helo llama * apply modular * fix dia --------- Co-authored-by: qubvel <qubvel@gmail.com>	2025-07-01 11:29:52 +01:00
Raushan Turganbay	7a25f8dfdb	[qwen2-vl] fix FA2 inference (#39121 ) * fix FA2 * update is causal flag and remove mask for FA2 * update for FA2 with varlen path * how the tests were passing with different devices? * add comment and ref to the PR * move mask preparation to base pretrained model * seq len is the first dim, not second * fix copies to fix GLM4V	2025-07-01 10:18:37 +00:00
Mehant Kammakomati	def9663239	feat: support indivisible shards for TP model loading and TPlizing. (#37220 ) * feat: support uneven loading and sharding resolve merge conflicts Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: allow for empty tensor computations Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * test: add llama1b test case Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * due to q_proj colwise it has to be multi of 2 Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * refactor: use slice API Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * refactor: use slice API Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * refactor: use slice API Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> * refactor: use slice API Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com> --------- Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>	2025-07-01 10:03:22 +00:00
jiqing-feng	06c4a4d499	fix caching_allocator_warmup with tie weights (#39070 ) * fix caching_allocator_warmup with tie weights Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix comment Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>	2025-07-01 11:32:20 +02:00
Raushan Turganbay	e435574721	🚨 Don't use cache in non-generative models (#38751 ) * deprecate for 1 version * style * fix some tests * fix esm * skip for now, GC requires positional args but we have keyword args * remove transpose for scores in modified models only * skip fx trace tests	2025-07-01 09:08:21 +00:00
Cyril Vallez	dbc98328da	Several fixes for Gemma3n (#39135 ) * remove the skips * fix the epsilon to a small value (does not make sense otherwise) * safeguard * overload test_eager_matches_sdpa * Update test_modeling_common.py * skip appropriate tests * correct no_split_layer * fix all devices issue * fix backward * fix	2025-07-01 10:34:53 +02:00
BUI Van Tuan	d53518c5f2	Fix key mapping for VLMs (#39029 ) * fix key mapping for VLMs * use __mro__ instead * update key mapping in save_pretrained	2025-07-01 09:47:53 +02:00
eustlb	3457e8e73e	[Whisper] update token timestamps tests (#39126 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * fixes * update comment * update for A10 * all a10 * all a10 * all a10 * all a10 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-30 21:55:36 +02:00
Drew Ross	fe35eca7bd	Update BigBirdPegasus model card (#39104 ) * Update igbird_pegasus.md * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-30 10:42:56 -07:00
Yao Matrix	29a3f5ed8c	switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8 (#39024 ) * switch default xpu tp backend to pytorch built-in XCCL from pytorch 2.8 Signed-off-by: YAO Matrix <matrix.yao@intel.com> * Update docs/source/en/perf_infer_gpu_multi.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update perf_infer_gpu_multi.md * Update perf_infer_gpu_multi.md * Update perf_infer_gpu_multi.md --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-30 08:54:05 -07:00
Vladimir Gutuev	9e0c865b8b	docs: correct two typos in awesome-transformers.md (#39102 ) * docs(awesome-projects): fix typo “Itt leverages” → “It leverages” (#39101) closes #39101 * docs(awesome-projects): fix grammar “We provides” → “We provide” (#39101) closes #39101	2025-06-30 08:53:43 -07:00
jiqing-feng	03db2700ab	Enable XPU doc (#38929 ) * fix example with dataset Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix device type Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert torchao change Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert torchao change Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update xpu torchao doc Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * update chat_templating_multimodal.md Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * use full name for int8 Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * revert int8 title Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-06-30 07:56:55 -07:00
Joao Gante	ea0ea392e5	Fix chat (#39128 )	2025-06-30 13:47:48 +00:00
Lysandre Debut	ed36f8490e	Licenses (#39127 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Licenses * Licenses	2025-06-30 15:25:36 +02:00
Lysandre Debut	e8f90b5397	Split `transformers chat` and `transformers serve` (#38443 ) * Next token * Split chat and serve * Support both generation methods * Style * Generation Config * temp * temp * Finalize serving.py Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> * Finalize chat.py * Update src/transformers/commands/serving.py Co-authored-by: célina <hanouticelina@gmail.com> * Lucain's comments Co-authored-by: Lucain <lucain@huggingface.co> * Update * Last comments on PR * Better error handling * Better error handling * CI errors * CI errors * Add tests * Fix tests * Fix tests * [chat] Split chat/serve (built on top of lysandre's PR) (#39031) * Next token * Split chat and serve * Support both generation methods * Style * Generation Config * temp * temp * Finalize serving.py Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> * Finalize chat.py * Update src/transformers/commands/serving.py Co-authored-by: célina <hanouticelina@gmail.com> * Lucain's comments Co-authored-by: Lucain <lucain@huggingface.co> * Update * Last comments on PR * Better error handling * Better error handling * CI errors * CI errors * Add tests * Fix tests * Fix tests * streaming tool call * abstract tool state; set tool start as eos * todos * server working on models without tools * rm chat's deprecated flags * chat defaults * kv cache persists across calls * add server docs * link * Update src/transformers/commands/serving.py * Apply suggestions from code review * i love merge conflicts * solve multi turn with tiny-agents * On the fly switching of the models * Remove required positional arg --------- Co-authored-by: Lysandre <hi@lysand.re> Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> Co-authored-by: Lucain <lucain@huggingface.co> * Protect names * Fix tests --------- Co-authored-by: =?UTF-8?q?c=C3=A9lina?= <hanouticelina@gmail.com> Co-authored-by: Lucain <lucain@huggingface.co> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>	2025-06-30 15:10:53 +02:00
Yih-Dar	539c6c2fa8	All CI jobs with A10 (#39119 ) all a10 Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-30 14:23:27 +02:00

1 2 3 4 5 ...

19521 Commits