transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-14 10:08:29 +06:00

Author	SHA1	Message	Date
Jess	c6a84b7202	[DOCS] Add example for HammingDiversityLogitsProcessor (#25481 ) * updated logits processor text * Update logits_process.py * fixed formatting with black * fixed formatting with black * fixed formatting with Make Fixup * more formatting fixes * Update src/transformers/generation/logits_process.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/generation/logits_process.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * Revert "fixed formatting with Make Fixup" This reverts commit `47643083` * Revert "fixed formatting with black" This reverts commit `bfb1536736`. * Revert "fixed formatting with Make Fixup" This reverts commit `47643083` * Revert "fixed formatting with Make Fixup" This reverts commit `47643083` * Revert "fixed formatting with black" This reverts commit `ad6ceb64` * Revert "fixed formatting with black" This reverts commit `ad6ceb64b7`. * Update src/transformers/generation/logits_process.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Revert "fixed formatting with Make Fixup" This reverts commit `47643083` * formatted logits_process with make fixup --------- Co-authored-by: jesspeck <jess@localseoguide.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-25 12:35:40 +01:00
Joao Gante	85cf90a1c9	Generate: add missing logits processors docs (#25653 )	2023-08-25 11:56:17 +01:00
Pedro Cuenca	cb8e3ee25f	Add FlaxCLIPTextModelWithProjection (#25254 ) * Add FlaxClipTextModelWithProjection This is necessary to support the Flax port of Stable Diffusion XL: `fb6d705fb5/text_encoder_2/config.json (L3)` Co-authored-by: Martin Müller <martin.muller.me@gmail.com> Co-authored-by: Juan Acevedo <juancevedo@gmail.com> * Use FlaxCLIPTextModelOutput * make fix-copies again * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Use `return_dict` for consistency with other uses. Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Fix docstring example. * Add new model to FlaxCLIPTextModelTest * Add to IGNORE_NON_AUTO_CONFIGURED list * Fix naming convention. --------- Co-authored-by: Martin Müller <martin.muller.me@gmail.com> Co-authored-by: Juan Acevedo <juancevedo@gmail.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>	2023-08-25 10:58:14 +02:00
Anthony Susevski	8968fface4	fixed typo in speech encoder decoder doc (#25745 ) fixed typo in speech encoder decoder blog	2023-08-25 09:20:37 +02:00
Younes Belkada	ae320fa53f	[`PEFT`] Fix PeftConfig save pretrained when calling `add_adapter` (#25738 ) fix save_pretrained issue + add test	2023-08-25 08:19:11 +02:00
Wonhyeong Seo	f26099e7b5	🌐 [i18n-KO] Translated `visual_question_answering.md` to Korean (#25679 ) * docs: ko: visual_question_answering.md * feat: chatgpt draft tosquash: add code blocks * fix: manual edits ~L34 14:25 ~L126 16:52 ~L224 17:00 ~L335 17:11 ~EOF 17:18 * fix: self-correction * amend grammar, phrasing * docs: add new entry to _toctree.yml * fix: use terms from glossary Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com> --------- Co-authored-by: SeongWooChoi <46990061+nuatmochoi@users.noreply.github.com>	2023-08-24 11:14:58 -07:00
Sanchit Gandhi	0218876822	[ASR Pipe Test] Fix CTC timestamps error message (#25727 )	2023-08-24 17:58:37 +01:00
Younes Belkada	fd0b94fd7b	[`from_pretrained`] Fix failing PEFT tests (#25733 ) fix failing PEFT tests	2023-08-24 18:48:41 +02:00
amyeroberts	1b2381c46b	ImageProcessor - check if input pixel values between 0-255 (#25688 ) * Check if pixel values between 0-255 and add doc clarification * Add missing docstrings * _is_scale_image -> is_scaled_image * Spelling is hard * Tidy up	2023-08-24 17:24:36 +01:00
Stas Bekman	7a6efe1e9f	[idefics] idefics-9b test use 4bit quant (#25734 )	2023-08-24 08:33:14 -07:00
Arthur	fecf08560c	[`from_pretrained`] Simpler code for peft (#25726 ) * refactor complicated from pretrained for peft * nits * more nits * Update src/transformers/modeling_utils.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make tests happy * fixup after merge --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-08-24 16:18:39 +02:00
Joao Gante	0a365c3e6a	Generate: nudge towards `do_sample=False` when `temperature=0.0` (#25722 )	2023-08-24 14:15:43 +01:00
Younes Belkada	584eeb5387	[`AutoGPTQ`] Add correct installation of GPTQ library + fix slow tests (#25713 ) * add correct installation of GPTQ library * update tests values	2023-08-24 14:57:16 +02:00
Sylvain Gugger	2febd50614	Fix number of minimal calls to the Hub with peft integration (#25715 ) * Fix number of minimal calls to the Hub with peft integration * Alternate design * And this way? * Revert * Address comments	2023-08-24 14:56:11 +02:00
Younes Belkada	70b49f023c	[`PEFT`] Fix peft version (#25710 ) * fix peft version * address comments * adapt suggestion	2023-08-24 12:09:12 +02:00
Yih-Dar	8fff61b9db	Fix failing `test_batch_generation` for bloom (#25718 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-24 11:15:29 +02:00
Tom Aarsen	f01459c75d	docs: Resolve typos in warning text (#25711 ) Resolve typos in warning text	2023-08-24 11:14:27 +02:00
Sylvain Gugger	c2123626aa	Update list of persons to tag (#25708 )	2023-08-24 10:13:30 +02:00
Arthur	6e6da5e4b8	[`LlamaTokenizer`] make unk_token_length a property (#25689 ) make unk_token_length a property	2023-08-24 08:03:34 +02:00
Sourab Mangrulkar	b85b88069a	fix ram efficient fsdp init (#25686 )	2023-08-24 11:30:42 +05:30
Sylvain Gugger	68fa9a5937	Skip broken tests	2023-08-24 01:48:53 -04:00
Susnato Dhar	4d40109c3a	Fix typo in `configuration_gpt2.py` (#25676 ) Update configuration_gpt2.py	2023-08-23 11:40:03 -07:00
Joao Gante	3c2383b1c6	Generate: general test for decoder-only generation from `inputs_embeds` (#25687 ) Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-23 19:17:01 +01:00
Phuc Van Phan	656e17f6f7	correct resume training steps number in progress bar (#25691 ) feat: correct update resume update with steps	2023-08-23 20:09:14 +02:00
sanjeevk-os	6add3b313d	[DOCS] Added docstring example for EpsilonLogitsWarper #24783 (#25378 ) * [DOCS] Added docstring example for EpsilonLogitsWarper #24783 * minor code changes based on review comments * set seed for both generate calls, reduced the example length * fixed line length under 120 chars	2023-08-23 17:25:28 +01:00
Yih-Dar	2189a7f54a	Fix `pad_token` check condition (#25685 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-23 16:39:28 +02:00
Lysandre Debut	8657ec68fc	Sets the stalebot to 10 AM CEST (#25678 ) This sets the stale bot trigger time at 10 AM CEST rather than 5 PM CEST as all core maintainers on watch duty are now in the European timezone	2023-08-23 14:21:07 +02:00
Sanchit Gandhi	77cb2ab792	⚠️ [CLAP] Fix dtype of logit scales in init (#25682 ) [CLAP] Fix dtype of logit scales	2023-08-23 13:17:37 +01:00
Nora Belrose	2cf87e2bbb	Prevent Dynamo graph fragmentation in GPTNeoX with torch.baddbmm fix (#24941 ) * Pass a Python scalar for alpha in torch.baddbmm * fixup --------- Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>	2023-08-23 14:07:46 +02:00
Yih-Dar	b413e0610b	Remove `utils/documentation_tests.txt` (#25680 ) * fix * fix * fix * fix * fix * fix * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>	2023-08-23 11:14:45 +02:00
Yih-Dar	3d1edb6c5d	fix wrong path in some doc (#25658 ) * update * check --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-23 08:34:30 +02:00
Arthur	db58722084	[`GPTNeo`] Add input_embeds functionality to gpt_neo Causal LM (#25664 ) nit	2023-08-23 07:49:19 +02:00
Arthur	51794bf21e	[`SPM`] Patch `spm` Llama and T5 (#25656 ) * hot fix * only encode with string prefix if starts with prefix * styling * add a new test * fixup	2023-08-23 07:16:43 +02:00
Wonhyeong Seo	57943630e2	Add Llama2 resources (#25531 ) * docs: feat: model resources for llama2 Co-authored-by: Woojun Jung <hello_984@naver.com> * fix: add description for dpo and rearrange posts * docs: feat: add llama2 notebook resources * style: one liners for each resource Co-Authored-By: Woojun Jung <46880056+jungnerd@users.noreply.github.com> Co-Authored-By: Kihoon Son <75935546+kihoon71@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Fix typo Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Woojun Jung <hello_984@naver.com> Co-authored-by: Woojun Jung <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-08-22 17:14:54 -07:00
Yih-Dar	40a0cabd93	Update doc toctree (#25661 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-08-22 22:58:55 +02:00
Gabriel Asher	977b2f05d5	Add input_embeds functionality to gpt_neo Causal LM (#25659 ) * Updated gpt_neo causalLM to support using input embeddings for generation * added indentation * Did make fixup	2023-08-22 20:28:38 +02:00
AleksanderWWW	908f853688	stringify config (#25637 ) * stringify config * apply code formatting	2023-08-22 17:21:01 +02:00
Alex McKinney	5eeaef921f	Adds `TRANSFORMERS_TEST_BACKEND` (#25655 ) * Adds `TRANSFORMERS_TEST_BACKEND` Allows specifying arbitrary additional import following first `import torch`. This is useful for some custom backends, that will require additional imports to trigger backend registration with upstream torch. See https://github.com/pytorch/benchmark/pull/1805 for a similar change in `torchbench`. * Update src/transformers/testing_utils.py Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> * Adds real backend example to documentation --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>	2023-08-22 17:08:13 +02:00
Rafael Padilla	fd56f7f081	removing unnecesssary extra parameter (#25643 )	2023-08-22 10:10:30 -04:00
Arthur	e20fab0bbe	Fix bloom add prefix space (#25652 ) * properly support Sequence of pretokenizers * actual fix * make sure the fix works. Tests are not working for sure! * hacky way * add TODO * update * add a todo * nits * rename test * nits * rename test	2023-08-22 14:50:12 +02:00
Matt	62396cff46	TF 2.14 compatibility (#25630 ) * Update the TF pin and see if anything breaks * make fixup * make fixup * make fixup	2023-08-22 13:13:38 +01:00
Sylvain Gugger	3629190689	Put IDEFICS in the right section of the doc (#25650 )	2023-08-22 10:39:10 +02:00
Sylvain Gugger	edb28722c2	Pass the proper token to PEFT integration in auto classes (#25649 )	2023-08-22 10:13:56 +02:00
Christopher Akiki	88e51ba306	[MINOR:TYPO] (#25646 ) [MINOR:TYPO] Update tokenization_auto.py	2023-08-22 09:54:44 +02:00
Blake Wyatt	6a314ea7cd	[DOCS] MusicGen Docs Update (#25510 ) * docs: note token limitations for MusicGen * docs: note token limitations for MusicGen * docs: fix token count with token limitations for MusicGen	2023-08-22 08:22:45 +02:00
Tanay Mehta	182b83749a	Add Number Normalisation for SpeechT5 (#25447 ) * add: NumberNormalizer works for integers, floats, common currencies, negative numbers and percentages * fix: renamed number normalizer class and added normalization to SpeechT5Processor * fix: restyled with black and ruff, should pass code quality tests * fix: moved normalization to tokenizer and other small changes to normalizer * add: test for normalization and changed the existing full tokenizer test * fix: tokenization tests now pass, made changes to existing tokenization where normalization is covered; added normalize arg to func signature * fix: changed default normalize setting to False, modified the tests a bit * fix: added support for comma separated numbers, tokenization on the fly with kwargs and normalizer getter setter funcs	2023-08-22 08:12:57 +02:00
Joe Mifsud	58c36bea74	Support specifying revision in push_to_hub (#25578 ) Support revision in push_to_hub	2023-08-22 07:55:35 +02:00
Susnato Dhar	450a181d8b	Add Pop2Piano (#21785 ) * init commit * config updated also some modeling * Processor and Model config combined * extraction pipeline(upto before spectogram & mel_conditioner) added but not properly tested * model loading successful! * feature extractor done! * FE can now be called from HF * postprocessing added in fe file * same as prev commit * Pop2PianoConfig doc done * cfg docs slightly changed * fe docs done * batched * batched working! * temp * v1 * checking * trying to go with generate * with generate and model tests passed * before rebasing * . * tests done docs done remaining others & nits * nits * LogMelSpectogram shifted to FeatureExtractor * is_tf rmeoved from pop2piano/init * import solved * tokenization tests added * minor fixed regarding modeling_pop2piano * tokenizer changed to only return midi_object and other changes * Updated paper abstract(Camera-ready version) (#2) * more comments and nits * ruff changes * code quality fix * sg comments * t5 change added and rebased * comments except batching * batching done * comments * small doc fix * example removed from modeling * ckpt * forward it compatible with fe and generation done * comments * comments * code-quality fix(maybe) * ckpts changed * doc file changed from mdx to md * test fixes * tokenizer test fix * changes * nits done main changes remaining * code modified * Pop2PianoProcessor added with tests * other comments * added Pop2PianoProcessor to dummy_objects * added require_onnx to modeling file * changes * update .md file * remove extra line in index.md * back to the main index * added pop2piano to index * Added tokenizer.__call__ with valid args and batch_decode and aligned the processor part too * changes * added return types to 2 tokenizer methods * the PR build test might work now * added backends * PR build fix * vocab added * comments * refactored vocab into 1 file * added conversion script * comments * essentia version changed in .md * comments * more tokenizer tests added * minor fix * tests extended for outputs acc check * small fix --------- Co-authored-by: Jongho Choi <sweetcocoa@snu.ac.kr>	2023-08-21 16:35:00 +01:00
mchau	6f041fcbb8	fix documentation for CustomTrainer (#25635 ) fix doc	2023-08-21 17:23:17 +02:00
Rafael Padilla	8608bf2049	🚨🚨🚨 changing default threshold and applying threshold before the rescale (#25608 ) changing position of score threshold and its default value	2023-08-21 10:20:05 -04:00

... 24 25 26 27 28 ...

15053 Commits