transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

Author	SHA1	Message	Date
Raushan Turganbay	1a5be2f5c0	[aya vision] fix processor for vLLM (#38371 ) accidentally merged two PRs in one (；－＿－)	2025-05-27 09:43:53 +00:00
Raushan Turganbay	19fdb75cf0	[video utils] group and reorder by number of frames (#38374 ) fix	2025-05-27 11:32:33 +02:00
Raushan Turganbay	b0735dc0c1	[paligemma] fix processor with suffix (#38365 ) fix pg processor	2025-05-27 11:31:56 +02:00
Raushan Turganbay	9e1017b479	[transformers x vLLM] standardize processors (#37915 ) * standardize * fix tests * batch update some processors, not final yet * oke, now I tested that everything indeed runs. Still needs prettification * emu3 * fixup * gemma3 but it doesn't generate anything * fuyu * update * why? * Update src/transformers/models/aya_vision/processing_aya_vision.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * address comments * bc * why do we need to guard import this every time? * i hate guarded imports * i am blind --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-27 11:30:30 +02:00
Cyril Vallez	b5ececb900	Fix image token mask in Gemma3 (#38295 ) fix mask	2025-05-27 11:15:52 +02:00
Jitesh Gupta	c4e71e8fff	Add AMD MI300 CI caller leveraging self-hosted runner scale set workflow in hf-workflows (#38132 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details	2025-05-26 23:13:02 +02:00
Matt	706b00928f	Stop autoconverting custom code checkpoints (#37751 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Stop autoconverting custom code checkpoints * make fixup * Better auto class detection * Match the kwarg ordering	2025-05-26 19:15:28 +01:00
Yih-Dar	07848a8405	update gemma tests (#38384 ) * update * update * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 19:54:04 +02:00
Joao Gante	cd0f3ce73b	[cli] cli usable without torch (#38386 ) cli without torch	2025-05-26 16:54:18 +00:00
Matt	ba6d72226d	🚨 🚨 Fix custom code saving (#37716 ) * Firstly: Better detection of when we're a custom class * Trigger tests * Let's break everything * make fixup * fix mistaken line doubling * Let's try to get rid of it from config classes at least * Let's try to get rid of it from config classes at least * Fixup image processor * no more circular import * Let's go back to setting `_auto_class` again * Let's go back to setting `_auto_class` again * stash commit * Revert the irrelevant changes until we figure out AutoConfig * Change tests since we're breaking expectations * make fixup * do the same for all custom classes * Cleanup for feature extractor tests * Cleanup tokenization tests too * typo * Fix tokenizer tests * make fixup * fix image processor test * make fixup * Remove warning from register_for_auto_class * Stop adding model info to auto map entirely * Remove todo * Remove the other todo * Let's start slapping _auto_class on models why not * Let's start slapping _auto_class on models why not * Make sure the tests know what's up * Make sure the tests know what's up * Completely remove add_model_info_to_* * Start adding _auto_class to models * Start adding _auto_class to models * Add a flaky decorator * Add a flaky decorator and import * stash commit * More message cleanup * make fixup * fix indent * Fix trust_remote_code prompts * make fixup * correct indentation * Reincorporate changes into dynamic_module_utils * Update call to trust_remote_code * make fixup * Fix video processors too * Fix video processors too * Remove is_flaky additions * make fixup	2025-05-26 17:37:30 +01:00
Matt	701caef704	Stop TF weight rename reDOS (#38325 ) * let's try a non-regex solution * make fixup * Slight adjustment * Let's just use the original code with a check * slight tweak to conditional * slight tweak to conditional	2025-05-26 16:58:51 +01:00
Judd	0a4e8e2855	fix typo: `tokenizer` -> `tokenize` (#38357 )	2025-05-26 15:29:16 +00:00
Ragnar	63964b7c67	fix typos (#38336 ) * Update video_processor.md * Update deepseek_v3.md	2025-05-26 14:42:37 +00:00
Cyril Vallez	8b03c8eaf2	Better check in `initialize_weights` (#38382 ) * Update modeling_utils.py * CIs * CIs	2025-05-26 16:20:23 +02:00
Yih-Dar	eb74cf977b	Use one `utils/notification_service.py` (#38379 ) * step 1 * step 2 * step 3 * step 4 * step 5 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 16:15:29 +02:00
Arthur	98328fd9a1	for now disable compile (#38383 )	2025-05-26 15:57:11 +02:00
Manuel de Prada Corral	78079abeff	Improved cache docs (#38060 ) * improved cache docs Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-26 13:53:41 +00:00
Dhia Eddine Rhaiem	7a9b071bfd	[Falcon H1] Fix slow path forward pass (#38320 ) * Create push-important-models.yml * feat: add falcon-h1 * fixup * address comment * fix * fix copies * fix copies * fix * fix * fix * fix * fix copies * fix * fix copies * fix test import to at least trigget the cis * yups * update * fix make fix copies * fix inits? * fix style * skip annoying test * add integration test for Falcon H1 * fix copies * fix * fix typo * make style * fix slow path generations * clean debug traces * debug * remove debug traces final confirmation * clean debug traces final * fix format and lineup * make style * debug * Update src/transformers/models/falcon_h1/modular_falcon_h1.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * adress comments * fix fix-copies * fix integration test * Merge pull request #7 from ydshieh/fix-slow-path update * another update (#8) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Younes Belkada <younesbelkada@gmail.com> Co-authored-by: younesbelkada <younes.belkada@tii.ae> Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 15:30:35 +02:00
Cyril Vallez	b5b76b5561	Protect `get_default_device` for torch<2.3 (#38376 ) * Update modeling_utils.py * CIs	2025-05-26 15:00:09 +02:00
Isotr0py	bff32678cc	Fix incorrect batching audio index calculation for Phi-4-Multimodal (#38103 ) * fix Signed-off-by: Isotr0py <2037008807@qq.com> * add tests Signed-off-by: Isotr0py <2037008807@qq.com> * code format Signed-off-by: Isotr0py <2037008807@qq.com> * Update src/transformers/models/phi4_multimodal/feature_extraction_phi4_multimodal.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Signed-off-by: Isotr0py <2037008807@qq.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-26 12:41:31 +00:00
Cyril Vallez	9f0402bc4d	Fix all import errors based on older torch versions (#38370 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Update masking_utils.py * fix * fix * fix * Update masking_utils.py * Update executorch.py * fix	2025-05-26 12:11:54 +02:00
Anton Vlasjuk	d03a3ca692	[`OPT`] Fix attention scaling (#38290 ) * fix opt attention scaling * add comment to why we do this	2025-05-26 11:02:16 +02:00
Yao Matrix	a5a0c7b888	switch to device agnostic device calling for test cases (#38247 ) * use device agnostic APIs in test cases Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix style Signed-off-by: Matrix Yao <matrix.yao@intel.com> * add one more Signed-off-by: YAO Matrix <matrix.yao@intel.com> * xpu now supports integer device id, aligning to CUDA behaviors Signed-off-by: Matrix Yao <matrix.yao@intel.com> * update to use device_properties Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix style Signed-off-by: Matrix Yao <matrix.yao@intel.com> * update comment Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix comments Signed-off-by: Matrix Yao <matrix.yao@intel.com> * fix style Signed-off-by: Matrix Yao <matrix.yao@intel.com> --------- Signed-off-by: Matrix Yao <matrix.yao@intel.com> Signed-off-by: YAO Matrix <matrix.yao@intel.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-26 10:18:53 +02:00
Raushan Turganbay	cba279f46c	[VLMs] add helpers for get/set embedding (#38144 ) * add helpers in VLMs * fix tied weight key test	2025-05-26 09:50:32 +02:00
Yih-Dar	6e3063422c	Uninstall `kernels` for AMD docker images (#38354 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details Uninstall kernels for AMD docker images Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-25 19:42:25 +02:00
Yih-Dar	4a03044ddb	Hot fix for AMD CI workflow (#38349 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-25 11:15:31 +02:00
Yih-Dar	d0c9c66d1c	new failure CI reports for all jobs (#38298 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details Check Tiny Models / Check tiny models (push) Has been cancelled Details * new failures * report_repo_id * report_repo_id * report_repo_id * More fixes * More fixes * More fixes * ruff --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-24 19:15:02 +02:00
Kseniya Parkhamchuk	31f8a0fe8a	[docs]: update roformer.md model card (#37946 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Update roformer model card * fix example purpose description * fix model description according to the comments * revert changes for autodoc * remove unneeded tags * fix review issues * fix hfoption --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-23 16:27:56 -07:00
Bryan C.	36f97ae15b	docs(swinv2): Update SwinV2 model card to new standard format (#37942 ) * docs(swinv2): Update SwinV2 model card to new standard format * docs(swinv2): Apply review suggestions Incorporates feedback from @stevhliu to: - Enhance the introductory paragraph with more details about scaling and SimMIM. - Generalize the tip from "image classification tasks" to "vision tasks". Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-23 13:04:13 -07:00
Aguedo	33d23c39ed	Update BioGPT model card (#38214 ) * Update BioGPT model card * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/biogpt.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * correction for CPU fallback * added quantization code and method * fixed transformers-cli call --------- Co-authored-by: Aguedo <aguedo@fakeemail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-23 13:03:47 -07:00
Cheery	dffb118013	Remove duplicate docstring: resample (#38305 ) Duplicate of the line above.	2025-05-23 13:02:58 -07:00
Cyril Vallez	e0aad278fe	Never fallback to eager implicitly (#38327 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details New model PR merged notification / Notify new model (push) Has been cancelled Details * remove arg everywhere * Update warnings * add more models * Update sdpa_attention.py * fix style * fix * readd warnings but not for flex * Update test_modeling_common.py * skip * fix --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-23 19:48:01 +02:00
Alex Brooks	e64ed0304c	Use Gradient Checkpointing Layer in Jamba & Blip Related Models (#38310 ) * Use gradient checkpointing class in blip classes * Use gradient checkpointing class in jamba/bamba	2025-05-23 19:35:25 +02:00
Matt	53fb245eb6	🚨 🚨 Inherited CausalLM Tests (#37590 ) * stash commit * Experiment 1: Try just Gemma * Experiment 1: Just try Gemma * make fixup * Trigger tests * stash commit * Try adding Gemma3 as well * make fixup * Correct attrib names * Correct pipeline model mapping * Add in all_model_classes for Gemma1 again * Move the pipeline model mapping around again * make fixup * Revert Gemma3 changes since it's a VLM * Let's try Falcon * Correct attributes * Correct attributes * Let's try just overriding get_config() for now * Do Nemotron too * And Llama! * Do llama/persimmon * Correctly skip tests * Fix Persimmon * Include Phimoe * Fix Gemma2 * Set model_tester_class correctly * Add GLM * More models! * models models models * make fixup * Add Qwen3 + Qwen3MoE * Correct import * make fixup * Add the QuestionAnswering classes * Add the QuestionAnswering classes * Move pipeline mapping to the right place * Jetmoe too * Stop RoPE testing models with no RoPE * Fix up JetMOE a bit * Fix up JetMOE a bit * Can we just force pad_token_id all the time? * make fixup * fix starcoder2 * Move pipeline mapping * Fix RoPE skipping * Fix RecurrentGemma tests * Fix Falcon tests * Add MoE attributes * Fix values for RoPE testing * Make sure we set bos_token_id and eos_token_id in an appropriate range * make fixup * Fix GLM4 * Add mamba attributes * Revert bits of JetMOE * Re-add the JetMOE skips * Update tests/causal_lm_tester.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Add licence --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-23 18:29:31 +01:00
Aaron V	d5f992f5e6	Enhance Model Loading By Providing Parallelism, Uses Optional Env Flag (#36835 ) * Get parallel loader working. Include tests. * Update the tests for parallel loading * Rename env variables. * Add docs for parallel model weight loading. * Touch up parallel model loading docs. * Touch up parallel model loading docs again. * Edit comment in test_modeling_utils_parallel_loading.py * Make sure HF_PARALLEL_LOADING_WORKERS is spelled correctly in modeling_utils.py * Correct times for parallelized loading, previous times were for a "hot" filesystem * Update parallel model loading so the spawn method is encapsulated. DRY up the code by leveraging get_submodule. * Update docs on model loading parallelism so that details on setting the multiprocessing start method are removed, now that the package handles this step internally. * Fix style on model loading parallelism changes. * Merge latest version of master's modeling_utils. * Removed unused variable. * Fix argument packing for the parallel loader. * Fix state dict being undefined in the parallel model loader. * Rename variables used in parallel model loading for clarity. Use get_module_from_name(). * Switch to the use of threads for parallel model loading. * Update docs for parallel loading. * Remove the use of json.loads when evaluating HF_ENABLE_PARALLEL_LOADING. Prefer simple casting. * Move parallelized shard loading into its own function. * Remove use of is_true(). Favor checking env var true values for HF_ENABLE_PARALLEL_LOADING. * Update copyright to 2025 in readme for paralell model loading. * Remove garbage collection line in load_shard_file, implicit garbage collection already occurs. * Run formatter on modeling_utils.py * Apply style fixes * Delete tests/utils/test_modeling_utils_parallel_loading.py --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>	2025-05-23 16:39:47 +00:00
Anton Vlasjuk	1ed19360b1	[`FlexAttention`] Reenable flex for encoder-decoder and make the test more robust (#38321 ) * reenable most flex attention test cases * style * trigger * trigger	2025-05-23 18:16:43 +02:00
Ita Zaporozhets	bb567d85a4	refactor can_save_slow_tokenizer (#37722 ) * refactor to rm property can_save_slow_tokenizer, it can be done within the if of save_vocab * move property to fast * revert if * check if vocab_file is attr * fix check for sp * fix if condition * fix if condition * fix if condition	2025-05-23 17:29:38 +02:00
Zhen	3c289e2104	[performance_optim] reduce frequency of declaring attention_mask in Ascend NPU flash attention (#38278 ) [performance_optim] reduce frequency of declaring attention_mask in ASCEND NPU flash attention	2025-05-23 17:24:51 +02:00
Arthur	f5d45d89c4	🚨Early-error🚨 config will error out if `output_attentions=True` and the attn implementation is wrong (#38288 ) * Protect ParallelInterface * early error out on output attention setting for no wraning in modeling * modular update * fixup * update model tests * update * oups * set model's config * more cases * ?? * properly fix * fixup * update * last onces * update * fix? * fix wrong merge commit * fix hub test * nits * wow I am tired * updates * fix pipeline! --------- Co-authored-by: Lysandre <hi@lysand.re>	2025-05-23 17:17:38 +02:00
Cyril Vallez	896833c183	Fix some tests (especially compile with fullgraph=True on Python<3.11) (#38319 ) * fix tests * better fix for python<3.11 * fixes * style	2025-05-23 17:11:40 +02:00
Yih-Dar	a63bc17416	add `vasqu` to `self-comment-ci.yml` (#38324 ) add vasqu Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-23 17:09:44 +02:00
Joao Gante	54cd86708d	[custom_generate] don't forward `custom_generate` and `trust_remote_code` (#38304 ) * prevent infinite loops * docs * more links to custom generation methods	2025-05-23 14:49:39 +00:00
Jinan Zhou	135163e9c5	Expose AutoModelForTimeSeriesPrediction for import (#38307 ) * expose AutoModelForTimeSeriesPrediction for import * add in docs	2025-05-23 13:09:29 +00:00
Joao Gante	a6b51e7341	[Whisper + beam search] fix usage of `beam_indices` (#38259 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * tmp * fix test_tiny_token_timestamp_batch_generation * better comments * test * comments * Apply suggestions from code review Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>	2025-05-23 10:05:44 +00:00
Joao Gante	3e960e032d	[tf/flax] handle `forced_decoder_ids` deletion (#38316 ) fix tf/flax, attr checks	2025-05-23 09:44:58 +00:00
Ryan Mullins	9eb0a37c9e	Adds use_repr to model_addition_debugger_context (#37984 ) * Adds use_repr to model_addition_debugger_context * Updating docs for use_repr option	2025-05-23 09:35:13 +00:00
Abdessamad Enabih	38f9c5b15b	Fix typo: change 'env' to 'environment' in .circleci/config.yml (#38273 ) * Fix typo: change 'env' to 'environment' in .circleci/config.yml * Remove CIRCLE_TOKEN environment variable from artifact retrieval step --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2025-05-23 10:45:27 +02:00
Yuanyuan Chen	11b670a282	Fix run_slow (#38314 ) Signed-off-by: cyy <cyyever@outlook.com>	2025-05-23 10:18:30 +02:00
Raushan Turganbay	b01984a51d	[emu3] fix conversion script (#38297 ) * fix conversion script and update weights * fixup * remove commented line	2025-05-23 09:49:56 +02:00
Yaswanth Gali	2b585419b4	[Tests] Cleanup Janus Testcase (#38311 ) * Cleanup janus testcase * shift code to setup	2025-05-23 09:29:16 +02:00

1 2 3 4 5 ...

19096 Commits