transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
Yih-Dar	d4564df1d4	Revive Nightly/Past CI (#31159 ) * build * build * build * build --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-06-20 18:57:24 +02:00
Yih-Dar	ec905f3a76	unskip 2 tests in cohere (#31517 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-06-20 17:21:08 +02:00
Joao Gante	1fd60fec75	RWKV: enable generation tests (#31490 ) * add rwkv tests * has_attentions set in individual tests	2024-06-20 14:15:01 +01:00
Jiahui Wei	d28e647f28	Fix mismatched ` in doc & other common typos (#31516 ) fix common doc typos Co-authored-by: Jiahui Wei <jiahui.wei@tusen.ai>	2024-06-20 14:03:07 +01:00
Younes Belkada	6d4306160a	GGUF: Fix llama 3 GGUF (#31358 ) * Create push-important-models.yml * llama3 support for GGUF * fixup * Update src/transformers/integrations/ggml.py * fix pre-tokenizer * fix * fix * fix * fix * fix * fix * address final comment * handle special tokens + add tests	2024-06-20 14:29:58 +02:00
Sadra Barikbin	35b112d344	Fix a teeny-tiny typo in `tokenization_utils_base.py`'s docstring (#31510 ) Update tokenization_utils_base.py	2024-06-20 10:35:52 +01:00
arthasking123	0ed3ffcb44	Add valid columns check in _remove_unused_columns method (#31466 ) * Add valid columns checking in _remove_unused_columns method https://github.com/huggingface/datasets/issues/6973#issue-2355517362 https://github.com/huggingface/datasets/issues/6535 https://discuss.huggingface.co/t/indexerror-invalid-key-16-is-out-of-bounds-for-size-0/14298/25 * Update modeling_mixtral.py * Update modeling_mixtral.py * Update modeling_mixtral.py	2024-06-19 13:26:37 +01:00
Daemyung Jang	547b5582ec	Consider inheritance in type checking for tensors (#31378 ) * Consider inheritance in type checking for tensors Add an additional check to bypass type assertion when both tensors are torch.Tensor instances. * Fix the quality issue	2024-06-19 14:05:20 +02:00
Timothé Pearce	3b5fa14fb8	Fix `wandb` integration with `SetFit` model (#30021 ) Fix W&B integration with SetFit model Co-authored-by: PEARCE Timothe <timothe_pearce@ext.connect-tech.sncf>	2024-06-19 13:23:05 +02:00
nikkie	f4d189441d	Fix typo: pas_token_id (#30894 ) Fix typo	2024-06-19 11:23:08 +01:00
Fanli Lin	4144c354e9	auto-detect device when no device is passed to pipeline (#31398 ) * fix device * Update src/transformers/pipelines/base.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * bug fix * add warning --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-19 11:12:39 +01:00
Billy Cao	cd5f7c1790	Add docs on zeroshot image classification prompt templates (#31343 ) * Add docs on pipeline templates * Fix example and comments Update usage tips * Update docs/source/en/tasks/zero_shot_image_classification.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/model_doc/siglip.md Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Trigger CI --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-19 11:11:44 +01:00
linlin	1c1aec2ef1	Update object_detection.md (#31488 ) Define MAX_SIZE before it is used.	2024-06-19 10:36:44 +01:00
Joao Gante	83259e406d	Mamba: add generative tests (#31478 )	2024-06-19 10:27:23 +01:00
Younes Belkada	7d683f7bae	Docs / AQLM: Clarify `torch.compile` support for AQLM (#31473 ) Update overview.md	2024-06-19 11:26:25 +02:00
Fanli Lin	077c139f57	[tests] rename `test_config_object` to `test_ds_config_object` (#31403 ) fix name	2024-06-19 11:19:15 +02:00
amyeroberts	609e662243	Use self.config_tester.run_common_tests() (#31431 ) * First testing updating config tests * Use run_common_tests	2024-06-19 10:18:08 +01:00
Phillip Rust	7c71b61dae	Fix autocast incompatibility in RecurrentGemma (#30832 )	2024-06-19 09:59:34 +02:00
Anton Vlasjuk	b275a41005	[`GPT2`] Add SDPA support (#31172 ) * `gpt2` sdpa support * fix (at least) one test, style, repo consistency * fix sdpa mask in forward --> fixes generation * test * test2 * test3 * test4 * simplify shapes for attn mask creation and small comments * hub fail test * benchmarks * flash attn 2 mask should not be inverted on enc-dec setup * fix comment * apply some suggestion from code review - only save _attn_implentation once - remove unnecessary comment * change elif logic * [run-slow] gpt2 * modify `test_gpt2_sample_max_time` to follow previous assertion patterns	2024-06-19 09:40:57 +02:00
Rémy Léone	22b41b3f8a	Update perf_train_gpu_many.md (#31451 ) * Update perf_train_gpu_many.md * Update docs/source/en/perf_train_gpu_many.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/perf_train_gpu_many.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-06-18 11:00:26 -07:00
Tom Aarsen	280cef51b3	Give more useful `metric_for_best_model` errors (#31450 ) Give more useful metric_for_best_model errors	2024-06-18 16:56:30 +01:00
Quentin Gallouédec	2505357e4f	Fix documentation typos (#31476 ) Fix doc typo	2024-06-18 16:09:50 +01:00
dependabot[bot]	4691ffbd41	Bump urllib3 from 1.26.18 to 1.26.19 in /examples/research_projects/visual_bert (#31472 ) Bump urllib3 in /examples/research_projects/visual_bert Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.18 to 1.26.19. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/1.26.19/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.18...1.26.19) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-18 16:08:15 +01:00
Yih-Dar	1c7c34bc64	Improve `PreTrainedTokenizerFast` loading time when there are many added tokens (#31404 ) * use hash * use hash * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2024-06-18 15:20:14 +02:00
Matt	6e56b83453	Update chat template docs and bump Jinja version (#31455 ) * Update chat template docs * Minor bug in the version check * Update docs/source/en/chat_templating.md Co-authored-by: Joshua Lochner <admin@xenova.com> * Update docs/source/en/chat_templating.md Co-authored-by: Joshua Lochner <admin@xenova.com> * Update docs/source/en/chat_templating.md Co-authored-by: Joshua Lochner <admin@xenova.com> * Replace backticks with bolding because the doc builder was trying to parse them * Replace backticks with bolding because the doc builder was trying to parse them * Replace backticks with bolding because the doc builder was trying to parse them * More cleanups to avoid upsetting the doc builder * Add one more tip at the end --------- Co-authored-by: Joshua Lochner <admin@xenova.com>	2024-06-18 14:16:30 +01:00
Matt	28316d0e8b	Fix single letter stop strings (#31448 ) * Fix single letter stop strings * Change the 0 to a 1 to avoid potential empty vector headaches later * Restructure for clarity * Update tests/generation/test_stopping_criteria.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Add the unsqueeze --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-18 14:07:16 +01:00
Matt	dabf01973a	Make "tool_use" the default chat template key when tools are passed (#31429 ) * Make "tool_use" the default when tools are passed * Add some opinionated text to the docs * Add some opinionated text to the docs	2024-06-18 13:54:42 +01:00
Joao Gante	cd71f9381b	Donut: fix `generate` call from local path (#31470 ) * local donut path fix * engrish * Update src/transformers/generation/utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2024-06-18 13:28:06 +01:00
dependabot[bot]	76289fbc7c	Bump urllib3 from 1.26.18 to 1.26.19 in /examples/research_projects/decision_transformer (#31459 ) Bump urllib3 in /examples/research_projects/decision_transformer Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.26.18 to 1.26.19. - [Release notes](https://github.com/urllib3/urllib3/releases) - [Changelog](https://github.com/urllib3/urllib3/blob/1.26.19/CHANGES.rst) - [Commits](https://github.com/urllib3/urllib3/compare/1.26.18...1.26.19) --- updated-dependencies: - dependency-name: urllib3 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-18 12:22:25 +01:00
Aymeric Roucher	b38612d312	Agents: Improve python interpreter (#31409 ) * Improve Python interpreter * Add with and assert statements * Prevent overwriting existing tools * Check interpreter errors are well logged in code agent * Add lazy evaluation for and and or * Improve variable assignment * Fix early return statements in functions * Add small import fix on interpreter tool	2024-06-18 11:55:36 +02:00
Kevin Hu	1f9387d33d	Fix typing errors in `Qwen2ForTokenClassification` (#31440 ) * Update modeling_qwen2.py * Fix llama * More fixes	2024-06-18 10:27:18 +01:00
Kerim	9ba9369a25	simple fix (#31456 )	2024-06-17 22:30:37 +01:00
Ella Charlaix	02300273e2	🚨 Remove dataset with restrictive license (#31452 ) remove dataset with restrictive license	2024-06-17 17:56:51 +01:00
Albert Villanova del Moral	a14b055b65	Pass datasets trust_remote_code (#31406 ) * Pass datasets trust_remote_code * Pass trust_remote_code in more tests * Add trust_remote_dataset_code arg to some tests * Revert "Temporarily pin datasets upper version to fix CI" This reverts commit `b7672826ca`. * Pass trust_remote_code in librispeech_asr_dummy docstrings * Revert "Pin datasets<2.20.0 for examples" This reverts commit `833fc17a3e`. * Pass trust_remote_code to all examples * Revert "Add trust_remote_dataset_code arg to some tests" to research_projects * Pass trust_remote_code to tests * Pass trust_remote_code to docstrings * Fix flax examples tests requirements * Pass trust_remote_dataset_code arg to tests * Replace trust_remote_dataset_code with trust_remote_code in one example * Fix duplicate trust_remote_code * Replace args.trust_remote_dataset_code with args.trust_remote_code * Replace trust_remote_dataset_code with trust_remote_code in parser * Replace trust_remote_dataset_code with trust_remote_code in dataclasses * Replace trust_remote_dataset_code with trust_remote_code arg	2024-06-17 17:29:13 +01:00
Bastien Le Chenadec	485fd81471	Support multiple validation datasets when `dataloader_persistent_workers=True` (#30627 ) * Support multiple validation datasets when dataloader_persistent_workers=True * Test support of multiple validation datasets	2024-06-17 16:58:39 +01:00
dependabot[bot]	147c404fb1	Bump idna from 2.8 to 3.7 in /examples/research_projects/visual_bert (#30201 ) Bumps [idna](https://github.com/kjd/idna) from 2.8 to 3.7. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst) - [Commits](https://github.com/kjd/idna/compare/v2.8...v3.7) --- updated-dependencies: - dependency-name: idna dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-17 16:39:42 +01:00
Fanli Lin	9454f437b0	[tests] make `TestDeepSpeedModelZoo` device-agnostic (#31402 ) * fix * use accelerator device count * ci fix	2024-06-17 16:42:57 +02:00
dependabot[bot]	7977f206dc	Bump idna from 2.8 to 3.7 in /examples/research_projects/lxmert (#30200 ) Bumps [idna](https://github.com/kjd/idna) from 2.8 to 3.7. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst) - [Commits](https://github.com/kjd/idna/compare/v2.8...v3.7) --- updated-dependencies: - dependency-name: idna dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-17 15:13:33 +01:00
dependabot[bot]	ee197e2b9e	Bump idna from 3.3 to 3.7 in /examples/research_projects/decision_transformer (#30203 ) Bump idna in /examples/research_projects/decision_transformer Bumps [idna](https://github.com/kjd/idna) from 3.3 to 3.7. - [Release notes](https://github.com/kjd/idna/releases) - [Changelog](https://github.com/kjd/idna/blob/master/HISTORY.rst) - [Commits](https://github.com/kjd/idna/compare/v3.3...v3.7) --- updated-dependencies: - dependency-name: idna dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-06-17 11:13:16 +01:00
Joao Gante	377e903928	Generate: fix `tokenizer` being popped twice (#31427 )	2024-06-17 10:36:10 +01:00
amyeroberts	02c525d226	Rename misnamed image processor test files (#31430 )	2024-06-17 10:21:28 +01:00
Yoach Lacombe	7ae4fc271d	Fix Bark logits processors device misplacement (#31416 ) Fix Logits Processors device misplacement	2024-06-17 09:54:06 +02:00
Raushan Turganbay	9af1b6a80a	Musicgen special tokens in tensors (#31420 ) fix	2024-06-17 10:09:27 +05:00
Dmitry Rogozhkin	eed9ed6798	xpu: support xpu backend from stock pytorch (>=2.4) (#31238 ) * xpu: support xpu backend from stock pytorch (>=2.4) Fixes: https://github.com/huggingface/transformers/issues/31237 XPU backend is available in the stock PyTorch starting from version 2.4, see [1]. This commit extends huggingface transformers to support XPU from both IPEX and the stock pytorch. IPEX is being tried first. See: https://github.com/pytorch/pytorch/issues/114842 Requires: https://github.com/huggingface/accelerate/pull/2825 Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> * xpu: enable gpt2 and decision_transformer tests for xpu pytorch backend Note that running xpu tests requires TRANSFORMERS_TEST_DEVICE_SPEC=spec.py passed to the test runner: import torch DEVICE_NAME = 'xpu' MANUAL_SEED_FN = torch.xpu.manual_seed EMPTY_CACHE_FN = torch.xpu.empty_cache DEVICE_COUNT_FN = torch.xpu.device_count Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> --------- Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>	2024-06-14 21:31:35 +02:00
amyeroberts	20812237ce	Remove empty create_and_test_config_common_properties tests (#31359 ) Remove empty tests	2024-06-14 20:15:48 +01:00
amyeroberts	3d0bd86915	Install the tensorflow example requirements in docker (#31428 )	2024-06-14 19:35:43 +01:00
amyeroberts	11f43c15d3	Remove duplicate image processor in auto map (#31383 )	2024-06-14 18:23:55 +01:00
Ian McKenzie	c212ac9a02	Change potential `inputs_embeds` padding `logger.warning` to `logger.warning_once` (#31411 ) change embeddings padding warning to warning_once	2024-06-14 17:36:15 +01:00
Yoach Lacombe	7e1c7dc8b6	Fix SpeechT5 `decoder_attention_mask` shape (#28071 ) * Fix SpeechT5 * add test foward with labels and attention mask * make style	2024-06-14 15:20:11 +02:00
Yoach Lacombe	d9daeff297	Set seed for M4T retain grad test (#31419 )	2024-06-14 14:48:04 +02:00

1 2 3 4 5 ...

16195 Commits