transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-16 11:08:23 +06:00

History

Ryan Mullins 50d3530aa0 Gemma3 (#36658 ) * Fix converter * [Broken] Adds Gemma 3 to Hugging Face Transformers * Consolidating Config and Processor params across impls * Sorting out configuration parameters. Adds qk_norm before RoPE. Still not sure if RoPE is right. * Additional plumbing for CausalLM and ConditionalGeneration variants * incomplete draft of Orbax conversion script * More complete checkpoint conversion * Supporting Gemma 3 1B checkpoints * Updating RoPE for multiple frequencies * Adjustments to rotary embedder * Proof of life for text-only operation * Updating the conversion script to handle multimodal projection weights * Fixing tet-only conversions * Cleaner conversion script with multimodal support and a simpler processor * Additional refatcors to the Gemma3Processor * Simplified Processor to work over text representations * Updated conversion script to join text and vision embeddings at converion time * Logging for debugging * Update src/transformers/models/gemma2/modeling_gemma2.py Co-authored-by: Joshua Lochner <admin@xenova.com> * Removed extraneous Config params * Switching to fast tokenizer for checkpoint conversions * isolating siglip for performance tetsing * Minor changes for debugging tests against baselines * Adding average pooling for soft tokens * Updating processor code to enable simpler embedding interleaving for arbitrary number of images in prompts * Updating conversion script for ShieldGemma 2 conversion compatibility * Allow disable_compile to be provided as a kwarg * Refresh from modular * Updated conversion script and corrected sliding window * Fix type mismatch in cache_position (#4) * Fix dtype (#5) * Fix type mismatch in cache_position * Actually fix in the modular file Co-authored-by: Aritra Roy Gosthipaty <aritra.born2fly@gmail.com> --------- Co-authored-by: Aritra Roy Gosthipaty <aritra.born2fly@gmail.com> * fixes for embedding table overflow and missing image_soft_token_mask from Gemma3Processor * Adding 2D pooling for image embeddings * Revert "Adding 2D pooling for image embeddings" This reverts commit `65350cf531`. * Gemma3 average pooling changed from 1D to 2D * Major refactor to Gemma3MultimodalInputProjection * Updating Gemm 3 Auto* registrations * Add option to save Gemma 3 chat template with tokenizer during weights conversion * Removing unused imports * Moving out-of-vocab handling from Gemma3Processor to Gemma3ForConditionalGeneration * Removing duplicate config property * Removing final logit softcapping and 1-indexing of position ids * Fixing image processor config and none --> None typo * Fixing sliding window size for 1B * Updating image_mean and image_std in Image Processor * Attention masking changed to lower triangular * Moving image special tokens to conversion script * Mirror image processor defaults from conversion script into Gemma3ProcessorKwargs * Remove special token variables from symbol space * Moving image soft token mask computation from Gemma3Processor to Gemma3ForConditionalGeneration * tie lm_head and embedding weights Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> * Correct tied weights in Gemma3CausalLM * iterative bidirectional attention * resolving merge conflicts * Reverting to Gemma 2 HybridCache with sldiing window support and a sliding_window_pattern of 6 * Correcting RoPE scaling * clean up first pass, dummy model geenration works * final clean up before fixing tests * causal lm test works, so fine * Fix conversion * Update src/transformers/models/gemma3/processing_gemma3.py * model tests are happy * processor tests are happy * image processing tests added * fixup * Fix pre-processing in conversion * Inputs merging * Do not normalize vision embeddings * Apply Ryan's (and team) changes to attention * token type ids + mask * template * move embed scale, add rope scale, fix tests * Add chat template to tokenizer * Use prefix for causal model loading * use existing code for sliding mask from gemma2 * self.embed_tokens already normalizes * Correcting Gemma3TextConfig parameters in conversion script * typo, modular overwrites my fixes * enable device map for text model * Conversion updates * ultra nit: no einsums * update image token * copy deepcopy config + some docs * add some test, still WIP * Refactoring --include_chat_tempalte logic in converter * Update src/transformers/models/gemma3/modular_gemma3.py Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com> * Add eos tokens for instruct models * dump so i can work on dgx * Removing add_bos by default * dump * add fast im proc * docs for PaS + fixup * another fixup * one more fixup * fix tests * Inverting prior BOS change * ultra nit * Reverting to Tokenizer saved with add_bos_token=True and chat template starting with BOS * resize embeds, remove sqrt, add slow test outputs * FA2 but quality is meh * nit * skip FA2, no idea what happened * last bit for green CI * please, green CI for docs * T_T * Fix for Gemma3 logits * Support both options for system prompt * Update src/transformers/models/gemma3/image_processing_gemma3_fast.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/gemma3.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/gemma3.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/gemma3.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/gemma3.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update docs/source/en/model_doc/gemma3.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Docs updates now that assets are live * Style fixes --------- Co-authored-by: Joshua Lochner <admin@xenova.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Aritra Roy Gosthipaty <aritra.born2fly@gmail.com> Co-authored-by: Mayank Chaturvedi <imayank@google.com> Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com> Co-authored-by: raushan <raushan@huggingface.co> Co-authored-by: Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz> Co-authored-by: Xuan-Son Nguyen <thichthat@gmail.com> Co-authored-by: Lysandre <hi@lysand.re>		2025-03-12 09:06:17 +01:00
..
internal	Implement AsyncTextIteratorStreamer for asynchronous streaming (#34931 )	2024-12-20 12:08:12 +01:00
main_classes	Integrate SwanLab for offline/online experiment tracking and local visualization (#36433 )	2025-03-06 17:35:30 +01:00
model_doc	Gemma3 (#36658 )	2025-03-12 09:06:17 +01:00
quantization	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
tasks	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
_config.py	Add optimized `PixtralImageProcessorFast` (#34836 )	2024-11-28 16:04:05 +01:00
_redirects.yml	Docs / Quantization: Redirect deleted page (#31063 )	2024-05-28 18:29:22 +02:00
_toctree.yml	[docs] Serving LLMs (#36522 )	2025-03-10 13:14:19 -07:00
accelerate.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
add_new_model.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
add_new_pipeline.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
agents.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
attention.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
backbones.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cache_explanation.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chat_extras.md	Update chat_extras.md with content correction (#36599 )	2025-03-07 13:09:02 +00:00
chat_templating_multimodal.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chat_templating_writing.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chat_templating.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
community.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
contributing.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
conversations.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
custom_models.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
debugging.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deepspeed.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
executorch.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fast_tokenizers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
feature_extractors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fsdp.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
generation_features.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
generation_strategies.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
gguf.md	Fix gguf docs (#36601 )	2025-03-11 15:29:14 +01:00
glossary.md	Fix typos (#31819 )	2024-07-08 11:52:47 +01:00
gpu_selection.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
how_to_hack_models.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
hpo_train.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
image_processors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
index.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
installation.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
kv_cache.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
llm_optims.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
llm_tutorial_optimization.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
llm_tutorial.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
model_memory_anatomy.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
model_sharing.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
model_summary.md	model_summary.md - Restore link to Harvard's Annotated Transformer. (#29702 )	2024-03-23 18:29:39 -07:00
models.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
modular_transformers.md	fix: argument (#36558 )	2025-03-06 13:11:19 -08:00
notebooks.md	Enable doc in Spanish (#16518 )	2022-04-04 10:25:46 -04:00
optimizers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pad_truncation.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
peft.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_hardware.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
perf_infer_cpu.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_infer_gpu_multi.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_infer_gpu_one.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_torch_compile.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_cpu_many.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_cpu.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_gpu_many.md	Mention UltraScale Playbook 🌌 in docs (#36589 )	2025-03-06 14:48:11 -08:00
perf_train_gpu_one.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_special.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perf_train_tpu_tf.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
perplexity.md	[docs] use device-agnostic API instead of cuda (#34913 )	2024-11-26 09:23:34 -08:00
philosophy.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
pipeline_gradio.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pipeline_tutorial.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
pipeline_webserver.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pr_checks.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
processors.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
quicktour.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
run_scripts.md	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
serialization.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
serving.md	[docs] Serving LLMs (#36522 )	2025-03-10 13:14:19 -07:00
task_summary.md	[doctest] Fixes (#35863 )	2025-01-26 15:26:38 -08:00
tasks_explained.md	fix: Wrong task mentioned in docs (#34757 )	2024-11-18 18:42:28 +00:00
testing.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
tf_xla.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tflite.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tokenizer_summary.md	[docs] Spanish translation of tokenizer_summary.md (#31154 )	2024-06-03 16:52:23 -07:00
tools.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
torchscript.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
trainer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
training.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
troubleshooting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00