transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Matt	6e56b83453	Update chat template docs and bump Jinja version (#31455 ) * Update chat template docs * Minor bug in the version check * Update docs/source/en/chat_templating.md Co-authored-by: Joshua Lochner <admin@xenova.com> * Update docs/source/en/chat_templating.md Co-authored-by: Joshua Lochner <admin@xenova.com> * Update docs/source/en/chat_templating.md Co-authored-by: Joshua Lochner <admin@xenova.com> * Replace backticks with bolding because the doc builder was trying to parse them * Replace backticks with bolding because the doc builder was trying to parse them * Replace backticks with bolding because the doc builder was trying to parse them * More cleanups to avoid upsetting the doc builder * Add one more tip at the end --------- Co-authored-by: Joshua Lochner <admin@xenova.com>	2024-06-18 14:16:30 +01:00
Matt	28316d0e8b	Fix single letter stop strings (#31448 ) * Fix single letter stop strings * Change the 0 to a 1 to avoid potential empty vector headaches later * Restructure for clarity * Update tests/generation/test_stopping_criteria.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add the unsqueeze --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-18 14:07:16 +01:00
Matt	dabf01973a	Make "tool_use" the default chat template key when tools are passed (#31429 ) * Make "tool_use" the default when tools are passed * Add some opinionated text to the docs * Add some opinionated text to the docs	2024-06-18 13:54:42 +01:00
Joao Gante	cd71f9381b	Donut: fix `generate` call from local path (#31470 ) * local donut path fix * engrish * Update src/transformers/generation/utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-18 13:28:06 +01:00
dependabot[bot]	76289fbc7c	Bump urllib3 from 1.26.18 to 1.26.19 in /examples/research_projects/decision_transformer (#31459 ) Bump urllib3 in /examples/research_projects/decision_transformer Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.18 to 1.26.19. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/1.26.19/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.18...1.26.19) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-18 12:22:25 +01:00
Aymeric Roucher	b38612d312	Agents: Improve python interpreter (#31409 ) * Improve Python interpreter * Add with and assert statements * Prevent overwriting existing tools * Check interpreter errors are well logged in code agent * Add lazy evaluation for and and or * Improve variable assignment * Fix early return statements in functions * Add small import fix on interpreter tool	2024-06-18 11:55:36 +02:00
Kevin Hu	1f9387d33d	Fix typing errors in `Qwen2ForTokenClassification` (#31440 ) * Update modeling_qwen2.py * Fix llama * More fixes	2024-06-18 10:27:18 +01:00
Kerim	9ba9369a25	simple fix (#31456 )	2024-06-17 22:30:37 +01:00
Ella Charlaix	02300273e2	🚨 Remove dataset with restrictive license (#31452 ) remove dataset with restrictive license	2024-06-17 17:56:51 +01:00
Albert Villanova del Moral	a14b055b65	Pass datasets trust_remote_code (#31406 ) * Pass datasets trust_remote_code * Pass trust_remote_code in more tests * Add trust_remote_dataset_code arg to some tests * Revert "Temporarily pin datasets upper version to fix CI" This reverts commit `b7672826ca`. * Pass trust_remote_code in librispeech_asr_dummy docstrings * Revert "Pin datasets<2.20.0 for examples" This reverts commit `833fc17a3e`. * Pass trust_remote_code to all examples * Revert "Add trust_remote_dataset_code arg to some tests" to research_projects * Pass trust_remote_code to tests * Pass trust_remote_code to docstrings * Fix flax examples tests requirements * Pass trust_remote_dataset_code arg to tests * Replace trust_remote_dataset_code with trust_remote_code in one example * Fix duplicate trust_remote_code * Replace args.trust_remote_dataset_code with args.trust_remote_code * Replace trust_remote_dataset_code with trust_remote_code in parser * Replace trust_remote_dataset_code with trust_remote_code in dataclasses * Replace trust_remote_dataset_code with trust_remote_code arg	2024-06-17 17:29:13 +01:00
Bastien Le Chenadec	485fd81471	Support multiple validation datasets when `dataloader_persistent_workers=True` (#30627 ) * Support multiple validation datasets when dataloader_persistent_workers=True * Test support of multiple validation datasets	2024-06-17 16:58:39 +01:00
dependabot[bot]	147c404fb1	Bump idna from 2.8 to 3.7 in /examples/research_projects/visual_bert (#30201 ) Bumps [idna](https://github.com/kjd/idna) from 2.8 to 3.7. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst) - [Commits](https://github.com/kjd/idna/compare/v2.8...v3.7) --- updated-dependencies: - dependency-name: idna dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-17 16:39:42 +01:00
Fanli Lin	9454f437b0	[tests] make `TestDeepSpeedModelZoo` device-agnostic (#31402 ) * fix * use accelerator device count * ci fix	2024-06-17 16:42:57 +02:00
dependabot[bot]	7977f206dc	Bump idna from 2.8 to 3.7 in /examples/research_projects/lxmert (#30200 ) Bumps [idna](https://github.com/kjd/idna) from 2.8 to 3.7. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst) - [Commits](https://github.com/kjd/idna/compare/v2.8...v3.7) --- updated-dependencies: - dependency-name: idna dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-17 15:13:33 +01:00
dependabot[bot]	ee197e2b9e	Bump idna from 3.3 to 3.7 in /examples/research_projects/decision_transformer (#30203 ) Bump idna in /examples/research_projects/decision_transformer Bumps [idna](https://github.com/kjd/idna) from 3.3 to 3.7. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst) - [Commits](https://github.com/kjd/idna/compare/v3.3...v3.7) --- updated-dependencies: - dependency-name: idna dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-17 11:13:16 +01:00
Joao Gante	377e903928	Generate: fix `tokenizer` being popped twice (#31427 )	2024-06-17 10:36:10 +01:00
amyeroberts	02c525d226	Rename misnamed image processor test files (#31430 )	2024-06-17 10:21:28 +01:00
Yoach Lacombe	7ae4fc271d	Fix Bark logits processors device misplacement (#31416 ) Fix Logits Processors device misplacement	2024-06-17 09:54:06 +02:00
Raushan Turganbay	9af1b6a80a	Musicgen special tokens in tensors (#31420 ) fix	2024-06-17 10:09:27 +05:00
Dmitry Rogozhkin	eed9ed6798	xpu: support xpu backend from stock pytorch (>=2.4) (#31238 ) * xpu: support xpu backend from stock pytorch (>=2.4) Fixes: https://github.com/huggingface/transformers/issues/31237 XPU backend is available in the stock PyTorch starting from version 2.4, see [1]. This commit extends huggingface transformers to support XPU from both IPEX and the stock pytorch. IPEX is being tried first. See: https://github.com/pytorch/pytorch/issues/114842 Requires: https://github.com/huggingface/accelerate/pull/2825 Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> * xpu: enable gpt2 and decision_transformer tests for xpu pytorch backend Note that running xpu tests requires TRANSFORMERS_TEST_DEVICE_SPEC=spec.py passed to the test runner: import torch DEVICE_NAME = 'xpu' MANUAL_SEED_FN = torch.xpu.manual_seed EMPTY_CACHE_FN = torch.xpu.empty_cache DEVICE_COUNT_FN = torch.xpu.device_count Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> --------- Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>	2024-06-14 21:31:35 +02:00
amyeroberts	20812237ce	Remove empty create_and_test_config_common_properties tests (#31359 ) Remove empty tests	2024-06-14 20:15:48 +01:00
amyeroberts	3d0bd86915	Install the tensorflow example requirements in docker (#31428 )	2024-06-14 19:35:43 +01:00
amyeroberts	11f43c15d3	Remove duplicate image processor in auto map (#31383 )	2024-06-14 18:23:55 +01:00
Ian McKenzie	c212ac9a02	Change potential `inputs_embeds` padding `logger.warning` to `logger.warning_once` (#31411 ) change embeddings padding warning to warning_once	2024-06-14 17:36:15 +01:00
Yoach Lacombe	7e1c7dc8b6	Fix SpeechT5 `decoder_attention_mask` shape (#28071 ) * Fix SpeechT5 * add test foward with labels and attention mask * make style	2024-06-14 15:20:11 +02:00
Yoach Lacombe	d9daeff297	Set seed for M4T retain grad test (#31419 )	2024-06-14 14:48:04 +02:00
Yoach Lacombe	43ee58588b	Fix MusicGen SDPA (#31208 ) * fix sdpa musicgen * make style * remove copied from statement from Musicgen SDPA	2024-06-14 13:30:44 +02:00
amyeroberts	833fc17a3e	Pin datasets<2.20.0 for examples (#31417 )	2024-06-14 12:06:56 +01:00
Mansu Kim	cfb22e035e	Support Clip QKV for MPT (#31307 )	2024-06-14 11:47:06 +01:00
Albert Villanova del Moral	b7672826ca	Temporarily pin datasets upper version to fix CI (#31407 ) Temporarily pin datasets upper version	2024-06-13 18:01:18 +01:00
Jade Choghari	67a4ef89d4	Add missing French translation of tutoriel_pipeline.md (#31396 ) * Update french translation of tutoriel_pipeline.md * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Update docs/source/fr/tutoriel_pipeline.md Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> --------- Co-authored-by: Jade Choghari <chogharijade@icloud.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>	2024-06-13 17:48:54 +02:00
Pablo Montalvo	c624d5ba0b	add initial design for uniform processors + align model (#31197 ) * add initial design for uniform processors + align model * fix mutable default 👀 * add configuration test * handle structured kwargs w defaults + add test * protect torch-specific test * fix style * fix * fix assertEqual * move kwargs merging to processing common * rework kwargs for type hinting * just get Unpack from extensions * run-slow[align] * handle kwargs passed as nested dict * add from_pretrained test for nested kwargs handling * [run-slow]align * update documentation + imports * update audio inputs * protect audio types, silly * try removing imports * make things simpler * simplerer * move out kwargs test to common mixin * [run-slow]align * skip tests for old processors * [run-slow]align, clip * !$#@!! protect imports, darn it * [run-slow]align, clip * [run-slow]align, clip * update doc * improve documentation for default values * add model_max_length testing This parameter depends on tokenizers received. * Raise if kwargs are specified in two places * fix * expand VideoInput * fix * fix style * remove defaults values * add comment to indicate documentation on adding kwargs * protect imports * [run-slow]align * fix * remove set() that breaks ordering * test more * removed unused func * [run-slow]align	2024-06-13 16:27:16 +02:00
Matt	15b3923d65	Make chat templates part of ProcessorMixin (#30744 ) * Let's try moving chat templates out of IDEFICS and into the generic ProcessorMixin * Chat templates should not be mandatory * Chat templates should not be mandatory * Not all classes will have default chat templates * stash commit * Add chat template docstring * Clean up docstring * Add chat templates to LLaVA/LLaVA-next * Docstring fixup * Quick IDEFICS2 fixup * Remove some old references to the Conversation class * make fixup	2024-06-13 14:35:30 +01:00
Billy Cao	3c4a8dca0c	[QoL fix] [Image processing] Add warning on assumption of channel dim and avoid infering when inputs are PIL.Image (#31364 ) * Add warning on assumption of channel dim Use PIL info whenever possible to decide channel axis * Fix ruff format * Remove type checking Improve warning message * Update src/transformers/models/siglip/image_processing_siglip.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/image_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/image_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-13 10:34:58 +01:00
Luc Georges	348e2294ac	feat(ci): add trufflehog secrets detection (#31344 )	2024-06-12 18:00:43 +02:00
조준래	17896f6783	Change JSON serialization to custom json.dumps (#31100 ) * Change JSON serialization to custom json.dumps to prevent escaping of "<", ">", "&", "'" * caller has control over the order, remove sort_key=True * Move tojson into a proper function and expose a couple of other args --------- Co-authored-by: jun.4 <jun.4@kakaobrain.com> Co-authored-by: Matt <rocketknight1@gmail.com>	2024-06-12 14:59:35 +01:00
dependabot[bot]	1c77b3d9cf	Bump jupyter-core from 4.6.3 to 4.11.2 in /examples/research_projects/visual_bert (#31386 ) Bump jupyter-core in /examples/research_projects/visual_bert Bumps [jupyter-core](https://github.com/jupyter/jupyter_core) from 4.6.3 to 4.11.2. - [Release notes](https://github.com/jupyter/jupyter_core/releases) - [Changelog](https://github.com/jupyter/jupyter_core/blob/main/CHANGELOG.md) - [Commits](https://github.com/jupyter/jupyter_core/compare/4.6.3...4.11.2) --- updated-dependencies: - dependency-name: jupyter-core dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-12 14:12:53 +01:00
Marc Sun	254b25abd9	Use huggingface_hub helper function to split state dict (#31091 ) * shard saving from hf hub * index = None * fix tests * indent	2024-06-12 14:10:32 +02:00
Aaron V	1c73d85b86	Update comment in modeling_utils.py (#31299 )	2024-06-12 12:01:42 +01:00
Andrew Hong	9f863d9a5b	README underline between badges fix (#31376 ) Badge underline fix	2024-06-12 11:49:50 +01:00
amyeroberts	d218a2e51f	backbone_utils - fix relative import (#31382 ) Fix relative import	2024-06-12 11:42:20 +01:00
谭九鼎	84351d57eb	docs: fix broken link (#31370 ) * docs: fix broken link * fix link	2024-06-12 11:33:00 +01:00
her0e1c1	20fac1f249	[Bug Fix] Renamed loss to losses to suppress UnboundLocalError (#31365 ) Renamed loss to losses to suppress UnboundLocalError Co-authored-by: Your Name <you@example.com>	2024-06-12 11:29:25 +01:00
Raushan Turganbay	08ad34b19e	Fix idefics cache (#31377 ) * fix idefics cache * fix tests	2024-06-12 15:24:32 +05:00
Jason (Siyu) Zhu	a2ede66674	Add support to declare imports for code agent (#31355 ) * Support import declaration in Code Agent	2024-06-12 09:32:28 +02:00
Jade Choghari	35a6d9d648	Add french translation of AutoBackbone (#31300 )	2024-06-11 18:28:52 +01:00
amyeroberts	f53fe35b29	Fast image processor (#28847 ) * Draft fast image processors * Draft working fast version * py3.8 compatible cache * Enable loading fast image processors through auto * Tidy up; rescale behaviour based on input type * Enable tests for fast image processors * Smarter rescaling * Don't default to Fast * Safer imports * Add necessary Pillow requirement * Woops * Add AutoImageProcessor test * Fix up * Fix test for imagegpt * Fix test * Review comments * Add warning for TF and JAX input types * Rearrange * Return transforms * NumpyToTensor transformation * Rebase - include changes from upstream in ImageProcessingMixin * Safe typing * Fix up * convert mean/std to tesnor to rescale * Don't store transforms in state * Fix up * Update src/transformers/image_processing_utils_fast.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/auto/image_processing_auto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/auto/image_processing_auto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/models/auto/image_processing_auto.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Warn if fast image processor available * Update src/transformers/models/vit/image_processing_vit_fast.py * Transpose incoming numpy images to be in CHW format * Update mapping names based on packages, auto set fast to None * Fix up * Fix * Add AutoImageProcessor.from_pretrained(checkpoint, use_fast=True) test * Update src/transformers/models/vit/image_processing_vit_fast.py Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com> * Add equivalence and speed tests * Fix up --------- Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>	2024-06-11 15:47:38 +01:00
Matt	edc1dffd00	Chat Template support for function calling and RAG (#30621 ) * First draft, still missing automatic function conversion * First draft of the automatic schema generator * Lots of small fixes * the walrus has betrayed me * please stop committing your debug breakpoints * Lots of cleanup and edge cases, looking better now * Comments and bugfixes for the type hint parser * More cleanup * Add tests, update schema generator * Update tests, proper handling of return values * Small docstring change * More doc updates * More doc updates * Add json_schema decorator * Clean up the TODOs and finish the docs * self.maxDiff = None to see the whole diff for the nested list test * add import for add_json_schema * Quick test fix * Fix something that was bugging me in the chat template docstring * Less "anyOf" when unnecessary * Support return types for the templates that need them * Proper return type tests * Switch to Google format docstrings * Update chat templating docs to match new format * Stop putting the return type in with the other parameters * Add Tuple support * No more decorator - we just do it implicitly! * Add enum support to get_json_schema * Update docstring * Add copyright header * Update src/transformers/tokenization_utils_base.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/chat_templating.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/utils/chat_template_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/utils/chat_template_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add copyright header * make fixup * Fix indentation * Reformat chat_template_utils * Correct return value * Make regexes module-level * Support more complex, multi-line arg docstrings * Update error message for ... * Update ruff * Add document type validation * Refactor docs * Refactor docs * Refactor docs * Clean up Tuple error * Add an extra test for very complex defs and docstrings and clean everything up for it * Document enum block * Quick test fixes * Stop supporting type hints in docstring to fix bugs and simplify the regex * Update docs for the regex change * Clean up enum regex * Wrap functions in {"type": "function", "function": ...} * Update src/transformers/utils/chat_template_utils.py Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com> * Temporary tool calling commit * Add type hints to chat template utils, partially update docs (incomplete!) * Code cleanup based on @molbap's suggestion * Add comments to explain regexes * Fix up type parsing for unions and lists * Add custom exception types and adjust tests to look for them * Update docs with a demo! * Docs cleanup * Pass content as string * Update tool call formatting * Update docs with new function format * Update docs * Update docs with a second tool to show the model choosing correctly --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>	2024-06-11 15:46:38 +01:00
dependabot[bot]	ce3647ad2d	Bump jupyter-core from 4.6.3 to 4.11.2 in /examples/research_projects/lxmert (#31360 ) Bump jupyter-core in /examples/research_projects/lxmert Bumps [jupyter-core](https://github.com/jupyter/jupyter_core) from 4.6.3 to 4.11.2. - [Release notes](https://github.com/jupyter/jupyter_core/releases) - [Changelog](https://github.com/jupyter/jupyter_core/blob/main/CHANGELOG.md) - [Commits](https://github.com/jupyter/jupyter_core/compare/4.6.3...4.11.2) --- updated-dependencies: - dependency-name: jupyter-core dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-11 12:11:10 +01:00
Aymeric Roucher	12ae6d3573	Fix gradio tool demos (#31230 ) * Fix gradio tool demos	2024-06-11 11:35:27 +02:00

1 2 3 4 5 ...

16171 Commits