transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-31 02:02:21 +06:00

Author	SHA1	Message	Date
amyeroberts	90e8263d91	Add methods to update and verify out_features out_indices (#23031 ) * Add methods to update and verify out_features out_indices * Safe update for config attributes * Fix function names * Save config correctly * PR comments - use property setters * PR comment - directly set attributes * Update test * Add updates to recently merged focalnet backbone	2023-05-04 10:15:06 +01:00
peter-sk	78b7debf56	GPTNeoForQuestionAnswering (#23057 ) * first draft - gives index error in question_answering.py * maturing * no labels * pipeline should know about QA * fixing checks * formatting * fixed docstring * initial commit * formatting * adding the class to many places * towards less unhappy checks * nearly there * Update src/transformers/models/gpt_neo/modeling_gpt_neo.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * avoid error * moving to device of star/end_logits --------- Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-03 15:59:19 -04:00
Robert Stone	b6933d76d2	Tidy Pytorch GLUE benchmark example (#23134 ) Migration to Evaluate for metric is not quite complete	2023-05-03 15:50:41 -04:00
Alara Dirik	b0a78091a5	Remove redundant print statements (#23133 ) remove redundant print statements	2023-05-03 18:04:48 +01:00
regisss	e3ee45aa54	Enable to use custom tracer in FX `symbolic_trace` (#23105 ) * Enable to use custom tracer in FX `symbolic_trace` * Integrate feedback from review * Formatting Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-05-03 12:47:36 -04:00
Alara Dirik	441658dd6c	Add focalnet backbone (#23104 ) Adds FocalNet backbone to return features from all stages	2023-05-03 19:32:42 +03:00
Julien Chaumond	ca7eb27ed5	[doc] Try a few ≠ ways of linking to Papers, users, and org profiles (#22611 ) * [doc] Try a few ≠ ways of linking to Papers, users, and org profiles * Empty commit * Empty commit now that the backend is fixed --------- Co-authored-by: Lysandre <lysandre@huggingface.co>	2023-05-03 18:23:09 +02:00
Nayeon Han	fbe0178f08	docs: ko: update `_toctree.yml` (#23112 ) * docs: ko: update `_toctree.yml` * fix: ko: update toc * fix: resolve suggestions * fix: resolve build issue --------- Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-05-03 11:04:58 -04:00
Mayank Agarwal	c4e32e206f	Add support for beam search's num_return_sequencs flag in flax (#23082 ) * add code for numReturnSeq * add flax support for num return sequences * Make Fix up for changes * add test for num return sequences * lint	2023-05-03 10:50:34 -04:00
Xuehai Pan	ee4bc07474	Support union types `X \| Y` syntax for `HfArgumentParser` for Python 3.10+ (#23126 ) * Support union types `X \| Y` syntax for `HfArgumentParser` for Python 3.10+ * Add tests for PEP 604 for `HfArgumentParser` * Reorganize tests	2023-05-03 10:49:54 -04:00
Alara Dirik	56b8d49ddf	Fix ConvNext V2 paramater naming issue (#23122 ) Fixes the parameter naming issue in ConvNextV2GRN module	2023-05-03 17:21:27 +03:00
Samin Yasar	b53004fdce	Add resources for LayoutLmV2 and reformat documentation resources (#23115 ) * add resources for layoutlmv2 * remove 🌎 from some resources	2023-05-03 09:53:00 -04:00
Joao Gante	3a08dc63fd	Generate: better warnings with pipelines (#23128 )	2023-05-03 14:43:17 +01:00
Manuel	2a16d8b275	improve unclear documentation (#23123 )	2023-05-03 09:36:30 -04:00
Joao Gante	a0bd464776	Generate: correct beam search length on score calculation for multi batch generation (#23127 )	2023-05-03 14:29:55 +01:00
Joao Gante	ce31e3c8bf	Generate: slow assisted generation test (#23125 )	2023-05-03 14:24:50 +01:00
Younes Belkada	b61d5b47f6	[`Doctest`] Fix pix2struct doctest (#23121 ) fix pix2struct doctest	2023-05-03 11:21:59 +02:00
Sylvain Gugger	4b6aecb48e	Pin numba for now (#23118 )	2023-05-02 22:02:39 -04:00
Gregory (Gabriel) Barello	3ff89f29f5	Fixed default config for `Pix2Struct` model to set `Pix2StructTextModel` to `is_decoder=True` (#23051 ) added as default keyword arg. to in order to correctly configure the decoder	2023-05-02 13:40:41 -04:00
Alex Punnen	805db1fe13	num_noise_spans should be <= num_items #22246 (#22938 )	2023-05-02 13:07:30 -04:00
Michael Benayoun	9ade58f055	[ONNX] Sam fix (#23110 ) * [WIP] Fix for the ONNX export * Apply changes * Remove commented code * Resolve todo * empty -> zeros * fix slow tests --------- Co-authored-by: younesbelkada <younesbelkada@gmail.com>	2023-05-02 17:20:02 +02:00
Younes Belkada	4baa34c18f	[`Flava`] Fix flava `torch.distributed.nn.functional import all_gather` issue (#23108 ) * fix flava `torch.distributed.nn.functional import all_gather` issue * more comments	2023-05-02 15:35:57 +02:00
Wing Lian	c6c6658499	Fix check for backword_pos (#23075 )	2023-05-02 09:32:42 -04:00
Sohyun Sim	f31a510bb3	🌐 [i18n-KO] Translated `torchscript.mdx` to Korean (#23060 ) * docs: ko: torchscript.mdx * feat: gpt and deepl draft * fix: manual edits * fix: edit anchor link * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> * fix: resolve suggestions --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-05-02 09:27:59 -04:00
peter-sk	2b0c924568	GPT2ForQuestionAnswering (#23030 ) * first draft - gives index error in question_answering.py * maturing * no labels * pipeline should know about QA * fixing checks * formatting * fixed docstring * make sure legacy code executes * comment * like this --------- Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>	2023-05-02 09:25:46 -04:00
regisss	bcedd0a471	Save the tokenizer and image preprocessor after training a model with the contrastive image-text example (#23035 ) Save tokenizer and image preprocessor	2023-05-02 09:23:16 -04:00
Arun Brahma	85e3d7b6a0	added type hints for blip_text pytorch model (#23071 ) * added type hints for blip_text pytorch model * updated type hints for blip_text pytorch model	2023-05-02 13:22:31 +01:00
dependabot[bot]	b8648290d2	Bump flask from 2.0.3 to 2.3.2 in /examples/research_projects/decision_transformer (#23094 ) Bump flask in /examples/research_projects/decision_transformer Bumps [flask](https://github.com/pallets/flask) from 2.0.3 to 2.3.2. - [Release notes](https://github.com/pallets/flask/releases) - [Changelog](https://github.com/pallets/flask/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/flask/compare/2.0.3...2.3.2) --- updated-dependencies: - dependency-name: flask dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-05-01 20:15:11 -04:00
Nayeon Han	f9426eeb94	🌐 [i18n-KO] Translated `tasks/zero_shot_image_classification.mdx` to Korean (#23065 ) docs: ko: `tasks/zero_shot_image_classification` Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>	2023-05-01 20:11:56 -04:00
Jungnerd	92601d2eb1	🌐 [i18n-KO] Translated `tasks/question_answering.mdx` to Korean (#23012 ) docs: ko: `tasks/question_answering.mdx` to Korean Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>	2023-05-01 11:05:40 -04:00
Hyeonseo Yun	78941b9fe5	🌐 [i18n-KO] Translated `tasks/image_classification.mdx` to Korean (#23048 ) * ko: init: tasks/image_classification.mdx * docs: ko: trans: tasks/image_classification.mdx * docs: ko: revise: sync glossary and spell check tasks/image_classification.mdx * docs: ko: revise: sync glossary tasks/image_classification.mdx * fix: resolve suggestions (github) image_classification.mdx Only github code review suggestion Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * fix: resolve suggestions image_classification.mdx Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com> --------- Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>	2023-05-01 09:50:05 -04:00
Zachary Mueller	9884862383	Depricate xpu_backend for ddp_backend (#23085 ) * Depricate xpu_backend for ddp_backend * Typo * Only do a minor deprecation, no need for major Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-05-01 09:44:47 -04:00
IMvision12	95cf3725b4	Fix `convnext` __init__ (#23078 ) fix	2023-05-01 09:36:42 -04:00
Ashwin Mathur	487f132a6f	Add `BioGPTForSequenceClassification` (#22253 ) * added BioGptForSequenceClassification * added source of copied code * typo * Format code with black * Update comments for copied code * Remove code copy comment * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Fix failing tests * Update code copied from comments * Fix code quality * Update src/transformers/models/biogpt/modeling_biogpt.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Fix lint error * Update src/transformers/models/biogpt/modeling_biogpt.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Rename model to biogpt for consistency * Add PipelineTesterMixin to test_modeling_biogpt.py * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Resolve merge confict --------- Co-authored-by: Guillem García Subies <37592763+GuillemGSubies@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-05-01 09:17:27 -04:00
Xin Wen	549e5f9f23	Fix string syntax error in logger warning message (additional comma) (#23083 )	2023-05-01 09:14:16 -04:00
Stephen Kaplan	9062d1bab2	Fix grammar error in summarization pipeline (#23080 ) Fix minor grammar issue	2023-05-01 08:54:57 -04:00
Joao Gante	849367ccf7	Generate: prepare assisted generation for release (#23052 )	2023-04-29 10:53:30 +01:00
Yih-Dar	dfeb5aa6a9	extend the test files (#23043 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-28 22:25:34 +02:00
Yih-Dar	b6865b9bef	Fix model parallelism for `BridgeTower` (#23039 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-28 21:53:58 +02:00
Younes Belkada	d337631b91	🚨🚨🚨 [`Blip`] remove labels masking (#23024 ) * remove labels masking * add fix on blip tf	2023-04-28 18:24:51 +02:00
s-JoL	c2c99dc7ef	add open-llama model with ckpt (#22795 ) * update Open-Llama model * update * update format * update doc * update * update stable embedding test * update test case * update format * update readme * fix typo * update name * remove tokenizer and update format * remove convert_open_llama_weights_to_hf * update warning and doc_string --------- Co-authored-by: songliang.bayesian <songliang.bayesian@bytedance.com>	2023-04-28 11:01:32 -04:00
Yih-Dar	0bf34b1c9f	Skip pt/flax equivalence tests in pytorch `bigbird` test file (#23040 ) skip Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-28 17:00:13 +02:00
Shivam Shrirao	4d0ea3d269	Cuda rng_state_all is used when saving in distributed mode so same should also be used when loading (#23045 ) cuda rng state should be all for distributed bc all were saved	2023-04-28 09:28:01 -04:00
Maria Khalusova	521a8ffa53	[docs] Doc TOC updates (#23049 ) * first draft of toc restructure * polishing based on feedback	2023-04-28 09:24:28 -04:00
Hyeonseo Yun	4893d919f1	🌐 [i18n-KO] Translated `model_sharing.mdx` to Korean (#22991 ) * docs: ko: init: model_sharing.mdx * docs: ko: trans: model_sharing.mdx Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com> Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com> Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com> Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com> * docs: ko: revised: apply code reviews model_sharing.mdx Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * docs: ko: revised: apply aditional reviews model_sharing.mdx 1. Natural Expression 2. `파인 튜닝` to `미세 조정` 3. Glossary Sync Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com> Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com> * docs: ko: revised: apply aditional reviews in model_sharing.mdx 1. Spell check 2. Natural Expression 3. Sync Glossary Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com> * docs: ko: revised: `프로그래밍 방식` to `API` in model_sharing.mdx Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com> --------- Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com> Co-authored-by: Nayeon Han <nayeon2.han@gmail.com> Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com> Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>	2023-04-28 09:20:33 -04:00
Maxime Méloux	9b435204b1	Add Trainer support for ReduceLROnPlateau (#23010 ) * Add Trainer support for ReduceLROnPlateau Fixes #16503 * Remove training argument and add default instance --------- Co-authored-by: mmeloux <maxime.meloux@loria.fr>	2023-04-28 09:17:30 -04:00
Yih-Dar	cf7baf4060	Make `_test_xla_generate` less flaky (#22996 ) * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-28 13:27:28 +02:00
Ehsan M. Kermani	a0e7332839	Fix CLAP link across all READMEs (#23032 ) * Fix CLAP link across all READMEs * Fix copy only for en	2023-04-27 18:07:02 -04:00
Bartosz Szmelczynski	88399476c3	Fix bigbird random attention (#21023 ) * switch np.random.permutation to jax.random.permuation * remove comments * remove leftover comment * skip similarity tests * modify indices_prng_key usage, add deterministic behaviour * update style * remove unused import * remove copy statement since classes are not identical * remove numpy import * revert removing copied from statements * make style from copied * remove copied from statement * update copied from statement to include only np.ndarry * add deterministic args, unittestskip equivalence tests	2023-04-27 13:52:28 -04:00
Yih-Dar	27b66bea01	Update `BridgeTowerModelTester` (#23029 ) * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-04-27 18:26:17 +02:00

1 2 3 4 5 ...

12771 Commits