transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-30 09:42:22 +06:00

Author	SHA1	Message	Date
Ahmed Almaghz	64b73e61f8	[i18n-ar] Translated file : `docs/source/ar/benchmarks.md` into Arabic (#33023 ) * Add docs/source/ar/benchmarks.md to Add_docs_source_ar_benchmarks.md * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/benchmarks.md Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update _toctree.yml * Update benchmarks.md --------- Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>	2024-11-26 09:23:11 -08:00
vansin	a0ba631519	Update the Python version in the Chinese README to match the English README. (#34870 ) Update Python Version	2024-11-26 09:22:34 -08:00
Joshua Lochner	1f6b423f0c	Fix torch.onnx.export of Qwen2-VL vision encoder (#34852 ) * Fix torch.onnx.export of Qwen2-VL vision encoder This PR fixes onnx export support for the vision encoder of Qwen2-VL, which converts the `cu_seqlens` to `torch.int32`, leading to errors later on when using the values for slicing. `c57eafdaa1/src/transformers/models/qwen2_vl/modeling_qwen2_vl.py (L1044-L1046)` ## Error: ``` onnx.onnx_cpp2py_export.shape_inference.InferenceError: [ShapeInferenceError] (op_type:Slice, node name: /blocks.0/attn/Slice_4): axes has inconsistent type tensor(int64) ``` ## Code to reproduce issue: ```py import requests from PIL import Image import torch from transformers import ( AutoProcessor, Qwen2VLForConditionalGeneration, ) # Constants VISION_MODEL_NAME = "vision_encoder.onnx" # Load model and processor model_id = "hf-internal-testing/tiny-random-Qwen2VLForConditionalGeneration" model = Qwen2VLForConditionalGeneration.from_pretrained(model_id).eval() processor = AutoProcessor.from_pretrained(model_id) # Prepare inputs url = "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-VL/assets/demo.jpeg" image = Image.open(requests.get(url, stream=True).raw) conversation = [ { "role": "user", "content": [ { "type": "image" }, { "type": "text", "text": "Describe this image."}, ], }, ] images = [image] text_prompt = processor.apply_chat_template(conversation, add_generation_prompt=True) inputs = processor(text=[text_prompt], images=images, padding=True, return_tensors="pt") ## Vision model vision_inputs = dict( pixel_values=inputs["pixel_values"], grid_thw=inputs["image_grid_thw"], ) vision_inputs_positional = tuple(vision_inputs.values()) vision_outputs = model.visual.forward(vision_inputs_positional) # Test forward pass torch.onnx.export( model.visual, args=vision_inputs_positional, f=VISION_MODEL_NAME, export_params=True, opset_version=14, do_constant_folding=True, input_names=list(vision_inputs.keys()), output_names=["image_features"], dynamic_axes={ "pixel_values": { 0: "batch_size grid_t * grid_h * grid_w", 1: "channel * temporal_patch_size * patch_size * patch_size", }, "grid_thw": {0: "batch_size"}, "image_features": {0: "batch_size * grid_t * grid_h * grid_w"}, }, ) # Load and check the exported model model import onnx model = onnx.load(VISION_MODEL_NAME) onnx.checker.check_model(model, full_check=True) inferred = onnx.shape_inference.infer_shapes(model, check_type=True) ``` * Formatting * [run-slow] qwen2_vl	2024-11-26 16:14:36 +01:00
Matt	d5cf91b346	Separate chat templates into a single file (#33957 ) * Initial draft * Add .jinja file loading for processors * Add processor saving of naked chat template files * make fixup * Add save-load test for tokenizers * Add save-load test for tokenizers * stash commit * Try popping the file * make fixup * Pop the arg correctly * Pop the arg correctly * Add processor test * Fix processor code * stash commit * Processor clobbers child tokenizer's chat template * Processor clobbers child tokenizer's chat template * make fixup * Split processor/tokenizer files to avoid interactions * fix test * Expand processor tests * Rename arg to "save_raw_chat_template" across all classes * Update processor warning * Move templates to single file * Move templates to single file * Improve testing for processor/tokenizer clashes * Improve testing for processor/tokenizer clashes * Extend saving test * Test file priority correctly * make fixup * Don't pop the chat template file before the slow tokenizer gets a look * Remove breakpoint * make fixup * Fix error	2024-11-26 14:18:04 +00:00
Yuxuan.Zhang	5a45617887	change apply_rotary_pos_emb of Glmmodel for GLM-Edge Series model (#34629 ) * change apply_rotary_pos_emb * upload for glm-edge * remove useless part * follow the suggestion * fix * format * format * test * format again * format again * remove modular change * remove modular change * this apply_rotary_pos_emb need modify? * fix with this * format * format * ruff check * modify modular_glm failed * remove partial_rotary_factor of function partial_rotary_factor * fix wrong change of examples/research_projects * revert * remove line 118 * use q_rot	2024-11-26 15:05:42 +01:00
Vladislav Bronzov	1141eff1bd	Add Pytorch Tensor Parallel support for Mistral (#34927 ) add base tp support	2024-11-26 14:28:07 +01:00
eustlb	4d1d0f29a4	[Whisper] Fix whisper integration tests (#34111 ) * fix test_tiny_timestamp_generation * fix test_large_timestamp_generation * fix test_whisper_shortform_single_batch_prev_cond * fix test_whisper_shortform_multi_batch_hard_prev_cond * return_timestamps necessary with long form * fix test_default_multilingual_transcription_long_form * fix test_tiny_token_timestamp_generation_longform * fix test_whisper_longform_multi_batch_hard * Update tests/models/whisper/test_modeling_whisper.py Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com> * fix typo * do not expect special tokens * fix test_whisper_longform_single_batch_beam * fix test_whisper_longform_multi_batch_hard_prev_cond * update test_whisper_longform_multi_batch_hard_prev_cond * update test_whisper_longform_multi_batch_hard_prev_cond * these tests does not make sense anymore * this test does not make sense anymore * make fixup * suggested nits * add test with forced_decoder_ids * this test does not make sense anymore * change assert for unittest test cases * make fixup * test with prompt_ids and task and language * fix unittest test case call * fix test_tiny_generation * fix test_tiny_en_generation * fix test_tiny_en_batched_generation * fix test_tiny_longform_timestamps_generation * fix test_tiny_timestamp_generation * fix test_large_generation * fix test_large_batched_generation * fix test_large_generation_multilingual * fix test_large_timestamp_generation * fix test_large_timestamp_generation * fix test_tiny_token_timestamp_generation_longform * fix test_tiny_en_batched_generation * make fixup * [run-slow] whisper --------- Co-authored-by: Yoach Lacombe <52246514+ylacombe@users.noreply.github.com>	2024-11-26 12:23:08 +01:00
Mohamed Mekkouri	0e805e6d1e	Skipping aqlm non working inference tests till fix merged (#34865 )	2024-11-26 11:09:30 +01:00
Raushan Turganbay	73b4ab1085	VideoLLaVA: add default values (#34916 ) add default values	2024-11-26 08:20:06 +01:00
Yoni Gozlan	bdb29ff9f3	Fix import structure for Fast Image processors (#34859 ) * Fix import structure image_processor_fast * update to new inits	2024-11-25 16:27:56 -05:00
xuzifei-dmatrix	bfc3556b20	making gpt2 fx traceable (#34633 ) * making gpt2 fx tracable * running make fix-copies * Revert "running make fix-copies" This reverts commit `5a3437cb5b`.	2024-11-25 19:30:38 +01:00
Viktor Scherbakov	95c10fedb3	Updated documentation and added conversion utility (#34319 ) * Updated documentation and added conversion utility * Update docs/source/en/tiktoken.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/tiktoken.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Moved util function to integration folder + allow for str * Update formatting Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Updated formatting * style changes --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-11-25 18:44:09 +01:00
Mohamed Mekkouri	890ea7de93	Fix failling GGML test (#34871 ) fix_test	2024-11-25 18:04:52 +01:00
Mohamed Mekkouri	b76a292bde	Upgrade torch version to 2.5 in dockerfile for quantization CI (#34924 ) * Upgrade Torch 2.5 * uncomment	2024-11-25 17:38:20 +01:00
Yih-Dar	a830df2909	Fix `test_auto_backbone_timm_model_from_pretrained` (#34877 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-11-25 17:20:41 +01:00
jiqing-feng	a464afbe2a	fix static cache data type miss-match (#34799 ) * fix gptj data type missmatch Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * add low precision static cache tests Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix low-precision static cache tests * fix format Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * avoid config change Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * change data type convert in cache copy Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix comment Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * cast key value after k v out Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>	2024-11-25 16:59:38 +01:00
Benjamin Bossan	b13916c09d	[AWQ, CI] Bump AWQ version used in docker image (#34922 ) The old AWQ version is failing with the latest (unreleased) transformers, giving the error: > ImportError: cannot import name 'shard_checkpoint' from 'transformers.modeling_utils' This has been resolved in awq v0.2.7: https://github.com/casper-hansen/AutoAWQ/pull/644	2024-11-25 16:49:57 +01:00
Mohamed Mekkouri	4e6b19cd95	Fix : BitNet tests (#34895 ) * fix_tests_bitnet * fix format	2024-11-25 16:47:14 +01:00
Shane A	9121ab8fe8	Rename OLMo November to OLMo2 (#34864 ) * Rename/move OLMo Nov files to OLMo2 * Rename Olmo1124 and its variants to Olmo2	2024-11-25 16:31:22 +01:00
dependabot[bot]	1de3598d30	Bump tornado from 6.4.1 to 6.4.2 in /examples/research_projects/lxmert (#34917 ) Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4.1 to 6.4.2. - [Changelog](https://github.com/tornadoweb/tornado/blob/v6.4.2/docs/releases.rst) - [Commits](https://github.com/tornadoweb/tornado/compare/v6.4.1...v6.4.2) --- updated-dependencies: - dependency-name: tornado dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-11-25 15:19:29 +00:00
Jacky Lee	f4c04ba32b	Fix Qwen2 failing tests (#34819 ) * fix: qwen2 model ids * fix: line * fix: more format * update: reformat	2024-11-25 15:53:04 +01:00
Tom Aarsen	11cc2295c7	[`peft`] Given that `self.active_adapter` is deprecated, avoid using it (#34804 ) * Given that self.active_adapter is deprecated, avoid using it * Remove misleading comment - `self.active_adapter` is not used (and deprecated)	2024-11-25 15:29:52 +01:00
Donald Szeto	74db22f905	Fix convert_tokens_to_string when decoder is None (#34569 ) * Fix convert_tokens_to_string when decoder is None * revert unrelated changs --------- Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>	2024-11-25 14:35:24 +01:00
wanxiangchwng	97514a8ba3	chore: fix some typos (#34891 ) Signed-off-by: wanxiangchwng <cui.shuang@foxmail.com>	2024-11-25 13:05:59 +00:00
dependabot[bot]	62ab94dea8	Bump tornado from 6.4.1 to 6.4.2 in /examples/research_projects/visual_bert (#34887 ) Bump tornado in /examples/research_projects/visual_bert Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4.1 to 6.4.2. - [Changelog](https://github.com/tornadoweb/tornado/blob/v6.4.2/docs/releases.rst) - [Commits](https://github.com/tornadoweb/tornado/compare/v6.4.1...v6.4.2) --- updated-dependencies: - dependency-name: tornado dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-11-25 12:54:55 +00:00
Meliksah Turker	c50b5675d6	prepare_fa2_from_position_ids function bugfix (#33269 ) contiguous() is called before view() for key and value within prepare_fa2_from_position_ids function	2024-11-25 13:51:26 +01:00
VictorAtIfInsurance	a0f4f3174f	allow unused input parameters passthrough when chunking in asr pipelines (#33889 ) * allow unused parameter passthrough when chunking in asr pipelines * format code * format * run fixup * update tests * update parameters to pipline in test * updates parametrs in tests * change spelling in gitignore * revert .gitignore to main * add git ignore of devcontainer folder * assert asr output follows expected inference output type * run fixup * Remove .devcontainer from .gitignore * remove compliance check	2024-11-25 11:36:44 +01:00
kang sheng	4dc1a69349	Sum gathered input tokens (#34554 ) * sum gathered input tokens * ruff line-length is 119, format the code --------- Co-authored-by: kangsheng <kangsheng@meituan.com>	2024-11-25 11:27:13 +01:00
Raushan Turganbay	1e492afd61	🔴 Mllama: fix base prefix (#34874 ) fix base prefix	2024-11-25 11:20:20 +01:00
Arthur	857d46ca0c	[`Deberta/Deberta-v2`] Refactor code base to support compile, export, and fix LLM (#22105 ) * some modification for roadmap * revert some changes * yups * weird * make it work * sttling * fix-copies * fixup * renaming * more fix-copies * move stuff around * remove torch script warnings * ignore copies * revert bad changes * woops * just styling * nit * revert * style fixup * nits configuration style * fixup * nits * will this fix the tf pt issue? * style * ??????? * update * eval? * update error message * updates * style * grumble grumble * update * style * nit * skip torch fx tests that were failing * style * skip the failing tests * skip another test and make style	2024-11-25 10:43:16 +01:00
Raushan Turganbay	098962dac2	BLIP: fix generation after hub update (#34876 ) * fix blip generation * dont remove it yet * Update src/transformers/models/blip_2/modeling_blip_2.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * address comments * modular --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-11-25 10:41:55 +01:00
Raushan Turganbay	c1a8520419	Cache: init empty cache when `use_cache` (#34274 ) * fix * fix tests * fix copies * add docs * Revert "add docs" This reverts commit `32d35634f1`. * qwen move deltas * mllama can potentiall fullgraph compile * enable mllama compile and fix tests * remove mllama fixes	2024-11-25 10:11:33 +01:00
Dmitry Rogozhkin	1339a14dca	Add safe_globals to resume training on PyTorch 2.6 (#34632 ) Starting from version 2.4 PyTorch introduces a stricter check for the objects which can be loaded with torch.load(). Starting from version 2.6 loading with weights_only=True requires allowlisting of such objects. This commit adds allowlist of some numpy objects used to load model checkpoints. Usage is restricted by context manager. User can still additionally call torch.serialization.add_safe_globals() to add other objects into the safe globals list. Accelerate library also stepped into same problem and addressed it with PR-3036. Fixes: #34631 See: https://github.com/pytorch/pytorch/pull/137602 See: https://pytorch.org/docs/stable/notes/serialization.html#torch.serialization.add_safe_globals See: https://github.com/huggingface/accelerate/pull/3036 Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>	2024-11-25 10:03:43 +01:00
jeongin601	318fe25f22	Fix: Enable prefill phase key value caching of nemotron/minitron models (#34742 ) * modeling nemotron kv caching bugfix Signed-off-by: jeongin601 <0200angela@gmail.com> * test file deleted Signed-off-by: jeongin601 <0200angela@gmail.com> * code refinement Signed-off-by: jeongin601 <0200angela@gmail.com> * remove unused variables Signed-off-by: jeongin601 <0200angela@gmail.com> * import block sorted * removed deprecation warning Signed-off-by: jeongin601 <0200angela@gmail.com> * removed support for tuple shape past_key_values Signed-off-by: jeongin601 <0200angela@gmail.com> * Update conditional statement for cache initialization Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Signed-off-by: jeongin601 <0200angela@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2024-11-25 09:45:35 +01:00
Yoni Gozlan	3a8eb74668	Fix support for image processors modifications in modular (#34866 ) * add fix and examples * fix camel case naming	2024-11-22 18:14:24 -05:00
Mohamed Mekkouri	54be2d7ae8	Bitnet test fix to avoid using gated model (#34863 ) small test fix	2024-11-22 17:18:49 +01:00
Benjamin Bossan	286ffaaf0a	[CI] Skip EETQ tests while package is broken with latest transformers (#34854 ) * CI Skip EETQ tests while package is broken EETQ tries to import the shard_checkpoint function from transformers but the function has been removed. Therefore, trying to use EETQ currently results in an import error. This fix results in EETQ tests being skipped if there is an import error. The issue has been reported to EETQ: https://github.com/NetEase-FuXi/EETQ/issues/34 * Raise helpful error when trying to use eetq * Forget to raise the error in else clause	2024-11-22 17:13:30 +01:00
Andrés Marafioti	861758e235	smol improvements to support more flexible usage (#34857 ) * smol improvements to support more flexible usage * ruff	2024-11-22 16:34:38 +01:00
Nadav Timor	42b36d7395	Speculative decoding: Test the target distribution (to prevent issues like #32867 ) (#34553 ) * Update test_utils.py * formatting * Update test_utils.py * formatting * formatting * Update test_utils.py * formatting * Update test_utils.py * formatting * format * comments at standard positions	2024-11-22 16:02:37 +01:00
Arthur	597efd21d2	Auto compile when static cache (#34247 ) * generate with compile * nits * simple * generate with compile * nits * simple * safe * style * Update src/transformers/generation/utils.py Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co> * remove TOKENIZER forked warning --------- Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>	2024-11-22 15:33:35 +01:00
Konrad Kalita	d9e6f307e7	Remove quantization related config from dequantized model (#34856 ) * Remove quantization related config from dequantized model * Fix whitespace	2024-11-22 10:06:29 +01:00
Logan Adams	1867be666d	Update checks for torch.distributed.tensor to require torch >= 2.5 (#34816 ) * Update checks for torch.distributed.tensor * Update PR with feedback * Formatting fix for import order * Remove unused function	2024-11-22 10:05:26 +01:00
Raushan Turganbay	6a912ff2c5	Watermarking: fix order (#34849 ) fix watermarking order	2024-11-22 08:25:14 +01:00
Cyril Vallez	4e90b99ed9	Refactor StarCoder2 using modular (#34015 ) * Create modular_starcoder2.py * Update modular_starcoder2.py * update * finalize modular * revert # no-unravel * Add support * style * Update modular_model_converter.py * update docstring	2024-11-21 14:52:39 +01:00
Jonathan Mamou	18871599c9	Fix heuristic scheduling for UAG (#34805 ) * fix heuristic schedule * fix style * fix format	2024-11-21 14:46:35 +01:00
AbdelKarim ELJANDOUBI	d6a5c23f71	Fix ds nvme (#34444 ) * skip nested deepspeed.zero.Init call * make fixup * solve conflict * solve conflict * put back local * use context mangers instead of local thread * Skip recursive calls to deepspeed.zero.Init * Skip recursive calls to deepspeed.zero.Init * back to old notebooks * make style	2024-11-21 13:52:22 +01:00
Vladislav Bronzov	ae5cbf804b	Improve gguf tensor processing (#34515 ) * add tensor processing system to separate logic for models * format refactoring * small fix * make some methods private * move custom methods to processors * refactor tensor processing * format fix	2024-11-21 13:40:49 +01:00
farrosalferro	c57eafdaa1	Add Nemotron GGUF Loading Support (#34725 ) * Add Nemotron GGUF Loading Support * fix the Nemotron architecture assignation --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>	2024-11-21 11:37:34 +01:00
Quentin Gallouédec	d4e1acbb7c	Change logging level from warning to info for `max_steps` overriding `num_train_epochs` (#34810 ) Update trainer.py	2024-11-21 11:37:02 +01:00
Raushan Turganbay	28fb02fc05	VLMs: enable generation tests - last batch (#34484 ) * add tests for 3 more vlms * fix fuyu back * skip test	2024-11-21 11:00:22 +01:00

... 6 7 8 9 10 ...

17807 Commits