transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-08 23:30:08 +06:00

Author	SHA1	Message	Date
Eon Kim	5c47d08b0d	Add Swin2SR ImageProcessorFast (#37169 ) * Add fast image processor support for Swin2SR * Add Swin2SR tests of fast image processing * Update docs and remove unnecessary test func * Fix docstring formatting * Skip fast vs slow processing test --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>	2025-05-07 12:20:16 -04:00
Raushan Turganbay	17742bd9c8	🔴 [VLM] Add base model without head (#37033 ) * i guessreverted all CdGen classes * style * llava onevision * fix copies * fix some tests * some more tests * dump * skip these * nevermind, i am dumb * revert fix not needed * fixup * fixup * another fixup * more fixup to make ci finally happy * fixup after rebasing * fix qwen tests * add internVL + typos here and there * image token index -> id * style * fix init weights * revert blip-2 not supported * address comments * fix copies * revert blip2 test file as well * as discussed internally, revert back CdGen models * fix some tests * fix more tests for compile * CI red * fix copies * enumerate explicitly allowed models * address comments * fix tests * fixup * style again * add tests for new model class * another fixup ( x _ x ) * [fixup] unused attributes can be removed post-deprecation	2025-05-07 17:47:51 +02:00
eustlb	3fa8d9c20e	[CSM] tiny fix on generation (#38001 ) nit	2025-05-07 11:45:23 -04:00
eustlb	798f948e88	Add CSM model (#36719 ) * draft structure * depth decoder with forward pre hook * full model forward draft * draft update * depth decoder update * ConversationalSpeechModelForCausalLM udpates * add generate * max length criteria small fix * udpate * updates * generation update * update in loss compute * conversion script * update for correct input embeddings * handle interleaved rope * update * update * update * support compile * update training * add doc * update doc * correct inits * ConversationalSpeechModel -> Csm * conf update * name update * tests CsmForCausalLMTest * convert use cached_file * conf + modeling updates * generate utils handle third dim shape * integration test * modeling + conf updates * common test handle more than 2 dims * add nested audio list utils * processing handle nested audio list * csm processing draft * mimi util * init updates * modular update * convert modular * processing update * csm tests update * generate tests handle third dim * generate utils handle third dim * propagate _get_initial_cache_position update * tied_weight_keys update + convert correctly * fix inputs_embeds * revert audio nested list * batch inference update + return audio * audio_utils update * processor update * some more integration tests * remove old test * porcessing output labels * improve * fix * update rope values with equivalent ones * conversion update * udpate tests * handle depth decoder generation config * remove default eos_token_id * make style * revert modeling_mimi * add default generation_config * remove sdpa since handled by default * make * fix conflict * fix conflicts * correct naming * correct imports * make * causal -> conditional naming * causal -> conditional naming * auto update * make * make * add doc * test update * fix weight init * audio tokens offsets as buffer * 4d mask in conditional class * make * doc update * fix causal mask * fix causal mask * doc update * doc update * add processor doc * update doc * fix 4d causal mask * update make_list_of_audio * do not default to mutable * remove duplicates * remove useless reset_parameters * use GradientCheckpointingLayer * use can_return_tuple * formatting * prepend placeholder in _sample * torch compile fix * some more fixies * convert modular * fix * default max_length in convert * handle depth decoder generation config correctly * clearer formulation * handle output_loading_info * handle softmax warning * add doc * propagate _get_initial_cache_position changes * generation in its own module * add processor tests * fix compile witu cuda graphs * fix compile with cuda graphs * add csm.md * include CSM loss * doc nit * doc nit * doc nit * Update docs/source/en/model_doc/csm.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * add save_audio to processor * Update src/transformers/models/csm/modular_csm.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * doc update * simplify audio_codes_mask computation * doc update * simplify loss computation * fix static cache test * fix * remove comment * simplify encoded length computation * use hf-internal-testing * doc update * cast to float before numpy * nit * mem efficient codebook head * nit * cat input values with cutoffs --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2025-05-07 10:20:13 -04:00
Fiona Waters	c8607a17cb	Add a check to import_utils.py to allow for use of faiss_gpu installation (#37997 ) Adding check to import_utils.py for faiss_gpu	2025-05-07 14:27:41 +01:00
kaixuanliu	fb1e3a4daa	remove duplicate code (#37991 ) Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>	2025-05-07 13:46:45 +01:00
Raushan Turganbay	8a9441d26d	[chat template] separate jinja logic from tokenizers (#37602 ) * split oit jinja * raise error	2025-05-07 14:18:03 +02:00
Yao Matrix	038f8fc159	make aya vision 5 integration tests pass on xpu (#37990 ) * 5 aya vision integration pass on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2025-05-07 11:16:38 +02:00
Joao Gante	a9384f849a	[offload] respect `max_memory` argument when factoring in unused reserved memory (#37982 )	2025-05-07 09:49:31 +01:00
Guang Yang	0b037fd425	Fix Qwen models export with torch 2.7 (#37985 ) Co-authored-by: Guang Yang <guangyang@fb.com>	2025-05-07 09:13:08 +02:00
Aritra Roy Gosthipaty	3c0796aaea	[Fast Processor] BEiT (#37005 ) * adding fast processor for beit * adding resample * address review issues and add segmentation maps logic * style * chore: adding tests * reduce label test * adding batched tests * Update src/transformers/models/beit/image_processing_beit_fast.py Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> * fix imports and make segmentation masks * fix tests * build segmentation maps * all tests pass * style * style fix * style * chore: delete demo.py file * review suggestions * Update docs/source/en/model_doc/beit.md Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>	2025-05-06 17:40:28 -04:00
Matt	ebbe9b12dd	Fix donut backtracking (#37788 ) * Fix donut backtracking * make fixup * Trigger tests * Remove old line * Update code * Fix reversed slice	2025-05-06 17:39:04 +01:00
Alex Brooks	06c4d05fe6	Enable granite speech 3.3 tests (#37560 ) * Enable granite speech 3.3 tests * skip sdpa test for granite speech * Explicitly move model to device * Use granite speech 2b in tests --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2025-05-06 17:56:18 +02:00
Joaquin Caballero	031ef8802c	fix FSDP + torch.compile bug when saving pretrained model (#37725 ) * args keep_torch_compile=False in _save and _wwrap_method * Fix FSDP execution on evaluation for torch_compile mode * add test trainer FSDP + Torch Compile * fix quality code * make style * Revert " make style" This reverts commit 77e797f8829c50992cc21496be3d9a3e480e1c97. * make style	2025-05-06 17:51:28 +02:00
Yao Matrix	5534b80b7f	enable xpu in test_trainer (#37774 ) * enable xpu in test_trainer Signed-off-by: YAO Matrix <matrix.yao@intel.com> * fix style Signed-off-by: YAO Matrix <matrix.yao@intel.com> * enhance _device_agnostic_dispatch to cover value Signed-off-by: Yao Matrix <matrix.yao@intel.com> * add default values for torch not available case Signed-off-by: YAO Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-05-06 17:13:35 +02:00
Kyungmin Lee	7db5d5b9ea	Fix typo (#37964 )	2025-05-06 14:59:00 +01:00
Joao Gante	af2866a8b1	[speech2text] fix init of sinusoidal embeddings (#37931 ) * fix init (meta device -> bad numbers) * fast test * dont init sinusoidal twice * make fixup	2025-05-06 14:49:00 +01:00
omahs	274e79b326	Fix typos (#37978 ) fix typos	2025-05-06 14:45:20 +01:00
nlhm	057ae00504	Small typo lines 47 and 199 perf_infer_gpu_one.md (#37938 ) * Small typo line 199 perf_infer_gpu_one.md * Typo l. 47 perf_infer_gpu_one.md	2025-05-06 14:32:55 +01:00
湛露先生	cc68070d41	fix docs serving typos. (#37936 ) Signed-off-by: zhanluxianshen <zhanluxianshen@163.com>	2025-05-06 14:32:44 +01:00
Yih-Dar	b1375177fc	add job links to new model failure report (#37973 ) * update for job link * stye --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-06 15:10:29 +02:00
youngrok cha	acded47fe7	[llava] one pixel is missing from padding when length is odd (#37819 ) * [fix] one pixel should be added when length is odd * [fix] add vision_aspect_ratio args & typo * [fix] style * [fix] do not fix fast file directly * [fix] convert using modular * remove duplicate codes * match unpad logic with pad logic * test odd-sized images for llava & aria * test unpad odd-sized padding for llava family * fix style * add kwarg to onvision modular * move vision_aspect_ratio from image_processor to processor (llava_onevision)	2025-05-06 13:11:26 +02:00
Joao Gante	9981214d32	[tests] Smaller model in slow cache tests (#37922 )	2025-05-06 11:15:25 +01:00
Fanli Lin	ff5ef95db7	add xpu memory check (#37969 ) add xpu check	2025-05-06 11:57:49 +02:00
Pedro Sandoval	7cc78804ba	🚨🚨🚨 Fix forward of Dinov2ForImageClassification for models with registers (#37836 ) * add num_tokens_to_discard to the forward of Dinov2ForImageClassification * redefine forward in modular file, remove change to modeling_dinov2 file * run make fixup --------- Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2025-05-06 11:55:53 +02:00
Sukriti Sharma	471958b620	Add GraniteMoeHybrid support for 4.0 (#37658 ) * initial config and MLA layer Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * first pass at decoder Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * completion of layers Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * modeling class Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * adding hybrid class to imports Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix imports granitemoehybrid Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix granitehybrid imports Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix granitehybrid import Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix generated modeling file Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * add some comments Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * minor fixes in layers Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * add sharedMLP layer Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * correct layer names Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fixes in mamba config Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix mamba config Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * change name of MLP layer Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix seq mizer layers Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * correct mamba config Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fixes in param names Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * enable hybrid model Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * update config Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix config granite hybrid Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix attention layer Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * cleanup to re-use mamba code Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * keep layer types Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * attention bias cleanup Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * update mamba layer name Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * first pass at tests Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * first pass at tests Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * use granite attention Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix: self attn weights Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * pass at making pos_emb optional Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * initialize self_attn only as needed Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * overwrite forward to create HybridMambaCache Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * Log invalid layer types * Add attention outputs test * Only emit attentions/logits if not None * Fix config test hidden size divisibility * mark granitmoehybrid as stateful * Initialize mamba convolutional layers * Formatting fixes * config docstring, removed some unused attrs * Fix missing arg in models test * Fix create and check decoder model test * support logits to keep in granitemoe * regen to pass logits_to_keep * Allow None or rope * Fix gradient checkpointing * Add granitemoehybrid as special cache for generate check * Remove unused MLA refs * Fix mamba layer mask * Remove logits to keep from config * Minor docstring nits * Update licenses * Enable cache by default * map layer types to layer block type * First pass at granite moe hybrid docs * Ignore granite moe hybrid in valid checkpoint check * Align attention interfaces * regenerate modular granitemoeshared attention interface * Align granite moe hybrid attn interface * run formatting * Handle mamba initialization * avoid conditional attr defs * Move hybrid layer validation to config * Add placeholder integration tests * Docs nits / Update model names * Clean up forward conditions * Use gradient checkpointing layer * Remove some copied bamba tests + inherit align test init delete more tests Use common layer init with bamba tests finish test consolidation * avoid redundant intermediate std var * use @can_return_tuple * Remove unused moe state * make skipped test names consistent * Fix docstring order * Add missing toc * Always create the shared mlp * Fix name in docstring * link preview model in docs --------- Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com> Co-authored-by: Alex-Brooks <Alex.Brooks@ibm.com>	2025-05-06 06:47:43 +02:00
Kyle Sayers	fe29b8c487	[Ready to Merge][HFQuantizer] Squelch pydantic warnings (#37726 ) replace dict with model_dump Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2025-05-05 20:38:49 +02:00
Tanuj Rai	46c0e1ff80	Fix incorrect type annotation in get_auxiliary_logits (#37955 ) Correct type annotation from Dict(str, Tensor) to Dict[str, Tensor]	2025-05-05 19:00:49 +01:00
Jonas	d80f53fa50	[generate] Fix `vocab_size` access for multimodal models (#37937 ) Implements last migrations for generation from `config.vocab_size` to `config.get_text_config().vocab.size` In doing so, we enable multimodal models to fully leverage all existing generation features.	2025-05-05 15:56:56 +01:00
Yih-Dar	7819911b0c	Use T4 single GPU runner with more CPU RAM (#37961 ) larger T4 single GPU Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-05-05 16:17:45 +02:00
Joao Gante	3b067a15dd	[core] reuse unused reserved cuda memory when loading models (#37920 )	2025-05-05 15:14:05 +01:00
ivarflakstad	afbc293e2b	More fault tolerant notification service (#37924 ) * Let notification service succeed even when artifacts and reported jobs on github have mismatch * Use default trace msg if no trace msg available * Add pop_default helper fn * style	2025-05-05 15:19:48 +02:00
NielsRogge	36ca58bf4f	[D-FINE] Update names (#37957 ) * Update names * Fix modular --------- Co-authored-by: qubvel <qubvel@gmail.com>	2025-05-05 13:05:46 +01:00
Joao Gante	2932f318a2	[docs] logits docstring (#37929 )	2025-05-02 16:38:35 +01:00
Jerry Zhang	fa3c3f9cab	Break weight tying when quantizing input embedding (#37905 ) Summary: Currently when we try to quantize input_embedding for some models, the output embedding (lm_head) will also be quantized the same way, since they are tied, and this may not be what we want. To break the tie, we added the option to allow people to 1. load unquantized weight 2. tie weights 3. quantize so that the tie will be broken Test Plan: ``` from transformers import ( AutoModelForCausalLM, AutoProcessor, AutoTokenizer, TorchAoConfig, ) from torchao.quantization.quant_api import ( IntxWeightOnlyConfig, Int8DynamicActivationIntxWeightConfig, AOPerModuleConfig ) from torchao.quantization.granularity import PerGroup, PerAxis import torch model_id = "microsoft/Phi-4-mini-instruct" embedding_config = IntxWeightOnlyConfig( weight_dtype=torch.int8, granularity=PerAxis(0), ) linear_config = Int8DynamicActivationIntxWeightConfig( weight_dtype=torch.int4, weight_granularity=PerGroup(32), weight_scale_dtype=torch.bfloat16, ) quant_config = AOPerModuleConfig({"_default": linear_config, "model.embed_tokens": embedding_config}) quantization_config = TorchAoConfig(quant_type=quant_config, include_embedding=True, untie_embedding_weights=True) quantized_model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float32, device_map="auto", quantization_config=quantization_config) tokenizer = AutoTokenizer.from_pretrained(model_id) print(quantized_model) print("embed_tokens.weight:", quantized_model.model.embed_tokens.weight) print("lm head weight:", quantized_model.lm_head.weight) from transformers.modeling_utils import find_tied_parameters print(find_tied_parameters(quantized_model)) ``` Reviewers: Subscribers: Tasks: Tags: Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-05-02 10:53:23 +02:00
Aritra Roy Gosthipaty	8a0a508f2b	Aligning modling code for GPT2 to work with vLLM (fallback) (#36934 ) * aligning for vllm * using input shape rather than attn outputs * remove demo * revert Conv1D * style * style * Update src/transformers/models/gpt2/modeling_gpt2.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix copies * Apply suggestions from code review Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * adding docs about vllm * chore: style --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>	2025-05-02 09:55:16 +02:00
Federico Baldassarre	e94a4807df	Add usage example for DINOv2 (#37398 ) * Add usage example for DINOv2 * More explicit shape names * More verbose text * Moved example to Notes section * Indentation	2025-05-01 08:54:22 -07:00
Bogeum Kim	d20aa68193	🌐 [i18n-KO] Translated `gpu_selection.md` to Korean (#36757 ) * Add _toctree.yml * feat: serving.md draft * Add _toctree.yml * feat: gpu_selection.md nmt draft * fix: TOC edit * Update docs/source/ko/serving.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ko/gpu_selection.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/ko/serving.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update _toctree.yml --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-05-01 08:44:12 -07:00
woctordho	ee25d57ed1	Improve performance of `load_state_dict` (#37902 ) Improve performance of load_state_dict	2025-05-01 16:35:17 +02:00
Joao Gante	410aa01901	[chat] clean code and add base help (#37892 )	2025-05-01 15:12:18 +01:00
co63oc	5b573bebb9	Fix typos in strings and comments (#37910 )	2025-05-01 14:58:58 +01:00
Ita Zaporozhets	c80f65265b	🚨 rm already deprecated pad_to_max_length arg (#37617 ) * rm already deprecated padding max length * truncate_strategy AS AN ARG is already deprecated for a few years * fix * rm test_padding_to_max_length * rm pad_to_max_length=True in other tests * rm from common * missed fnet	2025-05-01 15:21:55 +02:00
Diogo Glória-Silva	7a3e208892	fixed gemma3 collection path pointing to llama 2 collection. (#37899 )	2025-04-30 12:50:54 -07:00
Jerry Zhang	86777b5e2f	Support `AOPerModuleConfig` and `include_embedding` (#37802 ) * Support `AOPerModuleConfig` and include_embedding Summary: This PR adds support per module configuration for torchao Also added per module quantization examples: 1. Quantizing different layers with different quantization configs 2. Skip quantization for certain layers Test Plan: python tests/quantization/torchao_integration/test_torchao.py -k test_include_embedding python tests/quantization/torchao_integration/test_torchao.py -k test_per_module_config_skip Reviewers: Subscribers: Tasks: Tags: * format * format * inlcude embedding remove input embedding from module not to convert * more docs * Update docs/source/en/quantization/torchao.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_torchao.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_torchao.py Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-04-30 20:16:29 +02:00
Sifal	c3aeaa8060	Enhance documentation to explain chat-based few-shot prompting (#37828 ) * Enhance documentation to explain chat-based few-shot prompting Updates the documentation on few-shot prompting to illustrate how to structure examples using the chat-based format for instruction-tuned models. * Update docs/source/en/tasks/prompting.md Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tasks/prompting.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix typos --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-04-30 11:00:10 -07:00
Mohamed Mekkouri	36e2e33bbe	Fix Qwen3 tp plan with FP8 (#37871 ) * update for qwen 3 * fix style * rm print	2025-04-30 18:14:10 +02:00
Joao Gante	8e8025b384	[tests] reset logs in `torch.compile` test (#37894 )	2025-04-30 16:04:28 +01:00
Joao Gante	1b222903c3	[tests] Test all cache implementations (#37873 )	2025-04-30 15:37:00 +01:00
Yan Zhao	2c1155519f	Support FlaxPreTrainedModel to load model checkpoint from local subfolder safetensors (#37732 ) Support FlaxPreTrainedModel to load model checkpoint from subfolder in local directory as safetensors format Signed-off-by: Yan Zhao <zhao.y4@northeastern.edu>	2025-04-30 16:13:23 +02:00
Arjuna Sky Kok	5b223bbc8c	update comment in image_processing_base.py to reference image_process… (#37864 ) update comment in image_processing_base.py to reference image_processing_utils_fast	2025-04-30 14:31:29 +01:00

... 9 10 11 12 13 ...

19383 Commits