transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 04:40:06 +06:00

Author	SHA1	Message	Date
Raushan Turganbay	d2fd3868bb	[internvl] fix video inference (#38811 ) fix	2025-06-16 08:37:30 +02:00
SOUVIK CHAND [ZD]	d5d007a1a0	Updated Albert model Card (#37753 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Has been cancelled Details Build documentation / build (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Get all modified files (push) Has been cancelled Details Secret Leaks / trufflehog (push) Has been cancelled Details Update Transformers metadata / build_and_package (push) Has been cancelled Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Has been cancelled Details * Updated Albert model Card * Update docs/source/en/model_doc/albert.md added the quotes in <hfoption id="Pipeline"> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md updated checkpoints Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md changed !Tips description Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md updated text Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md updated transformer-cli implementation Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md changed text Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md removed repeated description Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update albert.md removed lines * Update albert.md updated pipeline code * Update albert.md updated auto model code, removed quantization as model size is not large, removed the attention visualizer part * Update docs/source/en/model_doc/albert.md updated notes Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update albert.md reduced a repeating point in notes * Update docs/source/en/model_doc/albert.md updated transformer-CLI Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/albert.md removed extra notes Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-13 14:58:06 -07:00
Sunil Reddy	443aafd3d6	[docs] updated roberta model card (#38777 ) Some checks failed Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details New model PR merged notification / Notify new model (push) Has been cancelled Details Self-hosted runner (push-caller) / Check if setup was changed (push) Has been cancelled Details Self-hosted runner (push-caller) / build-docker-containers (push) Has been cancelled Details Self-hosted runner (push-caller) / Trigger Push CI (push) Has been cancelled Details * updated roberta model card * fixes suggested after reviewing --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-13 12:02:44 -07:00
Steven Liu	fdb5da59dd	[docs] Update docs moved to the course (#38800 ) * update * update * update not_doctested.txt * slow_documentation_tests.txt	2025-06-13 12:02:27 -07:00
Lawrence Feng	8b73799500	fixed docstring in modular_qwen2_5_vl.py (#38798 ) * fixed docstring in modular_qwen2_5_vl.py * Regenerate file to match docstring update	2025-06-13 11:09:51 -07:00
Pavel Iakubovskii	9bec2654ed	Add V-JEPA for video classification model (#38788 ) * adding model and conversion scripts * add imports to test vjepa conversion * fix imports and make conversion work * fix computation for short side * replace attention with library attention function * cleanup more attention classes * remove config overrides * add test cases, fix some of the failing ones * fix the model outputs * fix outputs of the model per review * fix too big model test case * fix styling __init__.py * fix initialization test * remove all asserts per review * update sorting unsorting logic as per feedback * remove is_video per review * remove another is_video segment * remove unwanted stuff * small fixes * add docstrings for the model * revert adding vjepa2 config here * update styling * add config docstrings (wip) * fix dpr issue * removed test failing issues * update styles * merge predictor configs into main config * remove processing code, add video processor * remove permute which is not necessary now * fix styles * updated vjepa2 to be in video_processing_auto * update comment for preprocessing * test integration test and fix the outputs * update test values, change test to look at repeated frames for a given image * add a simple video processing test * refactoring pixel_values_videos and upload ckpts to original * fix torch_fx test cases * remove unused config * add all config docstrings * add more integration tests * add basic doc * revert unwanted styling changes * working make fixup * Fix model_type in config * Add ForVideoClassification model * update attention implementation to fit new hf standards * fix the preprocessing logic, ensure it matches the original model * remove use_rope logic, cleanup * fix docstrings * Further cleanup, update doc * Fix model prefix * fix get_vision_features * VJEPA2Embeddings style refactor * nit, style comment * change modules default values * Only `str` activation in config * GradientCheckpointingLayer * fixup * fix conversion script * Remove return_dict * remove None return typehint * Refactor VJEPA2Layer, remove use_SiLU * Fix fx tests * dpr -> drop_path_rates * move ModelOutput on top format docs bit * update docs * update docs * update doc example * remove prune_heads from model * remove unused config params * refactor embed signature * Add vjepa to docs * Fix config docstring * attention head * update defaults * Update docs/source/en/model_doc/vjepa2.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/vjepa2.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix import * Min refactoring * Update HUB_SOURCE and HUB_REPO in conversion script * Add missing headers * VJEPA -> V-JEPA in docs * Add image to doc * fix style * fix init weights * change checkpoint name in modeling tests * Initial cls head setup * remove rop attention from head (not needed) * remove swigluffn - not needed * Add siglip layer * Replace with siglip layer * Rename Siglip - VJEPA2 * remove unused modules * remove siglip mlp * nit * remove MLP * Refactor head cross attention * refactor VJEPA2HeadCrossAttentionLayer * nit renaming * fixup * remove commented code * Add cls head params to config * depth from config * move pooler + classifier to the model * Update for cls model signature * move layers, rename a bit * fix docs * update weights init * remove typehint for init * add to auto-mapping * enable tests * Add conversion script * fixup * add to docs * fix docs * nit * refactor for mapping * clean * Add integration test * Fixing multi gpu test * update not-split-modules * update video cls test tolerance * Increase test_inference_image tolerance * Update no-split modules for multi gpu * Apply suggestions from code review * fixing multi-gpu * fix docstring * Add cls snippet to docs * Update checkpoint	2025-06-13 17:56:15 +01:00
Kusanagi Nene	2ff964bcb4	Fix trainer.py not showing signature columns (#38465 ) Fix trainer.py not showing signature columns	2025-06-13 15:39:29 +00:00
Yih-Dar	4c3c177ecf	Fix a minor security issue (#38815 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-13 17:37:46 +02:00
Prashant Tandon	93445aed06	change fsdp_strategy to fsdp in TrainingArguments in accelerate doc (#38807 )	2025-06-13 15:32:40 +00:00
Matt	b82a45b3b4	Refactor DBRX tests to use CausalLMModelTest base classes (#38475 ) * Refactor DBRX tests to use CausalLMModelTest base classes - Changed DbrxModelTester to inherit from CausalLMModelTester - Changed DbrxModelTest to inherit from CausalLMModelTest - Removed duplicate methods that are already in base classes - Added required class attributes for model classes - Updated pipeline_model_mapping to include feature-extraction - Kept DBRX-specific configuration and test methods - Disabled RoPE tests as DBRX's rotary embedding doesn't accept config parameter This refactoring reduces code duplication and follows the pattern established in other causal LM model tests like Gemma. * Apply style fixes * Trigger tests * Refactor DBRX test * Make sure the DBRX-specific settings are handled * Use the attribute_map * Fix attribute map --------- Co-authored-by: openhands <openhands@all-hands.dev> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-06-13 16:22:12 +01:00
Quentin Gallouédec	64041694a8	Use `wandb.run.url` instead of `wandb.run.get_url()` (deprecated) (#38817 )	2025-06-13 15:20:04 +00:00
Rémi Ouazan	9ff246db00	Expectation fixes and added AMD expectations (#38729 )	2025-06-13 16:14:58 +02:00
Yih-Dar	e39172ecab	Fix `llava_next` tests (#38813 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-13 15:19:41 +02:00
Pavel Iakubovskii	b3b7789cbc	Better pipeline type hints ✨ (#38049 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * image-classification * depth-estimation * zero-shot-image-classification * image-feature-extraction * image-segmentation * mask-generation * object-detection * zero-shot-object-detection * image-to-image * image-text-to-text * image-to-text * text-classification * text-generation * text-to-audio * text2text_generation * fixup * token-classification * document-qa * video-classification * audio-classification * automatic-speech-recognition * feature-extraction * fill-mask * zero-shot-audio-classification * Add pipeline function typing * Add code generator and checker for pipeline types * Add to makefile * style * Add to CI * Style	2025-06-13 13:44:07 +01:00
Quentin Gallouédec	c989ddd294	Simplify and update trl examples (#38772 ) * Simplify and update trl examples * Remove optim_args from SFTConfig in Trainer documentation * Update docs/source/en/trainer.md * Apply suggestions from code review * Update docs/source/en/trainer.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Quentin Gallouédec <qgallouedec@Quentins-MacBook-Pro.local> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-13 12:03:49 +00:00
Quentin Gallouédec	de24fb63ed	Use HF papers (#38184 ) * Use hf papers * Hugging Face papers * doi to hf papers * style	2025-06-13 11:07:09 +00:00
Ákos Hadnagy	1031ed5166	Disable custom MRA kernels for ROCm (#38738 ) * Disable custom MRA kernels for ROCm * Move platform check code to utils * Ruff * Ruff again * Fix querying HIP version * Revert some changes * Add missing return statement --------- Co-authored-by: ivarflakstad <69173633+ivarflakstad@users.noreply.github.com>	2025-06-13 12:25:28 +02:00
Guang Yang	7f00b325f8	Unbreak optimum-executorch (#38646 ) * Unbreak optimum-executorch * use static cache if has layer_types but no sliding_window * revert view on kv_arange --------- Co-authored-by: Guang Yang <guangyang@fb.com>	2025-06-13 11:13:32 +02:00
Cyril Vallez	5f59a9b439	Fix configs and doc for the Qwens (#38808 ) fix doc and configs	2025-06-13 11:10:55 +02:00
Artur Chakhvadze	8222a9325d	Fix erroneous docstring for the ordering of SWA layers (#38794 )	2025-06-13 10:46:44 +02:00
Raushan Turganbay	e26ae89281	[docs] update cache docs with new info (#38775 ) * update docs with new info * Update docs/source/en/kv_cache.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-13 07:10:56 +00:00
Ita Zaporozhets	324cc77dc3	refactor create_token_type_ids_from_sequences (#37681 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * rm build_input.. from old file * refactor create_token_type_ids_from_sequences * handle when cls_token_id is None * updated fix * markuplm * refactoring rest of models * copies * revert funnel * rm incorrect file * ruff * ruff	2025-06-12 23:24:43 +02:00
SohamPrabhu	85f060e9b0	Updated moonshine modelcard (#38711 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Moved the sources to the right * small Changes * Some Changes to moonshine * Added the install to pipline * updated the monshine model card * Update docs/source/en/model_doc/moonshine.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/moonshine.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/moonshine.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/moonshine.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/moonshine.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Updated Documentation According to changes * Fixed the model with the commits * Update moonshine.md * Update moshi.md --------- Co-authored-by: Your Name <sohamprabhu@Mac.fios-router.home> Co-authored-by: Your Name <sohamprabhu@Sohams-MacBook-Air.local> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-12 10:27:17 -07:00
Drew Ross	645cf297cc	Add missing div in Pegasus model card (#38773 ) Add missing div	2025-06-12 10:27:07 -07:00
Yusuf Shihata	346f341630	[Docs] New DiT model card (#38721 ) * documenation finished * Update dit.md --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-12 10:26:50 -07:00
Cyril Vallez	4b8ec667e9	Remove all traces of `low_cpu_mem_usage` (#38792 ) * remove it from all py files * remove it from the doc * remove it from examples * style * remove traces of _fast_init * Update test_peft_integration.py * CIs	2025-06-12 16:39:33 +02:00
Kyle Mylonakis	3542e0b844	build: 📌 Remove upper bound on PyTorch (#38789 ) build: 📌 remove upper bound on torch dependency as issue which originally resulted in the pin has been released in torch 2.7.1	2025-06-12 16:34:13 +02:00
Yih-Dar	eea35a15b0	Fix `mllama` (#38704 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-12 16:15:35 +02:00
Mohammad Nasirifar	038a59e2cd	Initialize flash attn flag (#38768 ) _flash_supports_window_size is used further down in this file and relied on by e.g. [ring-flash-attention](https://github.com/zhuzilin/ring-flash-attention/blob/123f924/ring_flash_attn/adapters/hf_adapter.py#L9-L11). Even though it is an unexported name, it still makes sense to keep the state of `globals()` in this file consistent.	2025-06-12 14:06:13 +00:00
leopardracer	910355a010	Fix Typos in Comments: "quantitation" → "quantization", "averege" → "average" (#38766 ) * Update convert_llama4_weights_to_hf.py * Update modeling_visual_bert.py	2025-06-12 14:04:39 +00:00
Lysandre Debut	6a5fd0c6d2	Reword README in light of model definitions (#38762 ) * Slight readme reword * reword * reword * reword * Slight readme reword	2025-06-12 14:43:31 +01:00
Yih-Dar	c87058beb8	Fix `llava_onevision` tests (#38791 ) * fix * fix * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-12 15:06:49 +02:00
Yih-Dar	d4e7aa5526	Fix `qwen_2_5 omni` (#38658 ) * fix * fix * break style * break style * Apply style fixes * break style * Apply style fixes * fix modular --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-06-12 14:43:54 +02:00
Jesse Cai	e1812864ab	[docs] Add int4wo + 2:4 sparsity example to TorchAO README (#38592 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * update quantization readme * update --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>	2025-06-12 12:17:07 +00:00
Quentin Gallouédec	bc68defcac	Update PULL_REQUEST_TEMPLATE.md (#38770 )	2025-06-12 14:03:33 +02:00
Quentin Gallouédec	960fda25d1	Reduce verbosity for `average_tokens_across_devices=True` and `world size = 1` (#38785 ) * Warning to info for average_tokens_across_devices and world size = 1 * Update src/transformers/training_args.py	2025-06-12 14:02:53 +02:00
Yih-Dar	89c46b648d	Skip some export tests on torch 2.7 (#38677 ) * skip * fix * better check * Update import_utils.py --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>	2025-06-12 12:47:15 +02:00
Raushan Turganbay	27459025b8	[video processors] support frame sampling within processors (#38105 ) * apply updates smolVLM (still needs workaround for chat template) * add other models * dump qwen omni for now, come back later * port qwen omni from their impl * wait, all qwens sample videos in same way! * clean up * make smolvlm backwards compatible and fix padding * dix some tests * fox smolvlm tests * more clean up and test fixing * delete unused arg * fix * address comments * style * fix test	2025-06-12 09:34:30 +00:00
Cyril Vallez	887054c714	Fix masking utils (#38783 ) * fix * Update masking_utils.py * Update masking_utils.py	2025-06-12 11:00:46 +02:00
Yih-Dar	7c58336949	[Hotfix] Fix style bot (#38779 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-12 10:20:36 +02:00
Raushan Turganbay	7c6b1707c3	[masking utils] check `None` instead of try/except (#38561 ) * fix vllm's compile backend * fix the test * apply the same changes in other masking strategies	2025-06-12 06:50:28 +00:00
rileyafox	9487765f07	Add Qwen2 MoE model card (#38649 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * Add Qwen2 MoE model card * Revisions to qwen2 moe model card * Add Qwen2 MoE model card	2025-06-11 15:14:01 -07:00
Emile Aydar	32dbf4bddb	Update altCLIP model card (#38306 ) * Update altclip.md * Update altclip.md * Update altclip.md * Update altclip.md * Update altclip.md * Update altclip.md * Rename altclip.md to altclip.mdx * Rename altclip.mdx to altclip.md * Update altclip.md * Update altclip.md * Update altclip.md --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-11 14:48:34 -07:00
Dongruixuan Li	1dcb022e8f	chore(pixtral): emit block attention mask when using flash attention (#38741 ) Some checks are pending Self-hosted runner (benchmark) / Benchmark (aws-g5-4xlarge-cache) (push) Waiting to run Details Build documentation / build (push) Waiting to run Details New model PR merged notification / Notify new model (push) Waiting to run Details Slow tests on important models (on Push - A10) / Get all modified files (push) Waiting to run Details Slow tests on important models (on Push - A10) / Slow & FA2 tests (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Check if setup was changed (push) Waiting to run Details Self-hosted runner (push-caller) / build-docker-containers (push) Blocked by required conditions Details Self-hosted runner (push-caller) / Trigger Push CI (push) Blocked by required conditions Details Secret Leaks / trufflehog (push) Waiting to run Details Update Transformers metadata / build_and_package (push) Waiting to run Details * chore(pixtral): emit block attention mask when using flash attention Since flash_attention_2 relies solely on position_ids, emitting the block attention mask avoids unnecessary memory usage and prevents OOM on large inputs. * remove unnecessary attention_mask assignment	2025-06-11 18:55:23 +00:00
Yih-Dar	60d4b35b20	Make style bot trigger CI after push (#38754 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2025-06-11 20:40:04 +02:00
Drew Ross	bb44d2a0f6	Update pegasus model card (#38675 ) * Update Pegasus model card * Fix transformers-cli command * Update code examples to use bfloat16 * Reverted code examples to use float16 * Fix typo, update checkpoints link * Update str formatting in code examples * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Fix typo * Remove inaccurate badges * Revert badge removal * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Include cache_implementation argument in quantization example --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-06-11 10:56:25 -07:00
L	b84ebb7f3c	fix(qwen3_moe): pass kwargs to self_attn (#38691 ) This is needed to avoid `.item()` calls in `_flash_attention_forward`.	2025-06-11 19:26:08 +02:00
Matt	9f563ada70	Deprecate TF + JAX (#38758 ) * Scatter deprecation warnings around * Delete the tests * Make logging work properly!	2025-06-11 17:28:06 +01:00
Matt	337757cbd5	Update repo consistency check (#38763 )	2025-06-11 17:02:03 +01:00
Matthew Douglas	e2bdc13375	Remove IPEX requirement for bitsandbytes on CPU (#38594 ) Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2025-06-11 17:46:34 +02:00

1 2 3 4 5 ...

19358 Commits