amyeroberts
90e8263d91
Add methods to update and verify out_features out_indices ( #23031 )
...
* Add methods to update and verify out_features out_indices
* Safe update for config attributes
* Fix function names
* Save config correctly
* PR comments - use property setters
* PR comment - directly set attributes
* Update test
* Add updates to recently merged focalnet backbone
2023-05-04 10:15:06 +01:00
peter-sk
78b7debf56
GPTNeoForQuestionAnswering ( #23057 )
...
* first draft - gives index error in question_answering.py
* maturing
* no labels
* pipeline should know about QA
* fixing checks
* formatting
* fixed docstring
* initial commit
* formatting
* adding the class to many places
* towards less unhappy checks
* nearly there
* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* avoid error
* moving to device of star/end_logits
---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-05-03 15:59:19 -04:00
Robert Stone
b6933d76d2
Tidy Pytorch GLUE benchmark example ( #23134 )
...
Migration to Evaluate for metric is not quite complete
2023-05-03 15:50:41 -04:00
Alara Dirik
b0a78091a5
Remove redundant print statements ( #23133 )
...
remove redundant print statements
2023-05-03 18:04:48 +01:00
regisss
e3ee45aa54
Enable to use custom tracer in FX symbolic_trace
( #23105 )
...
* Enable to use custom tracer in FX `symbolic_trace`
* Integrate feedback from review
* Formatting
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-05-03 12:47:36 -04:00
Alara Dirik
441658dd6c
Add focalnet backbone ( #23104 )
...
Adds FocalNet backbone to return features from all stages
2023-05-03 19:32:42 +03:00
Julien Chaumond
ca7eb27ed5
[doc] Try a few ≠ ways of linking to Papers, users, and org profiles ( #22611 )
...
* [doc] Try a few ≠ ways of linking to Papers, users, and org profiles
* Empty commit
* Empty commit now that the backend is fixed
---------
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-05-03 18:23:09 +02:00
Nayeon Han
fbe0178f08
docs: ko: update _toctree.yml
( #23112 )
...
* docs: ko: update `_toctree.yml`
* fix: ko: update toc
* fix: resolve suggestions
* fix: resolve build issue
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-05-03 11:04:58 -04:00
Mayank Agarwal
c4e32e206f
Add support for beam search's num_return_sequencs flag in flax ( #23082 )
...
* add code for numReturnSeq
* add flax support for num return sequences
* Make Fix up for changes
* add test for num return sequences
* lint
2023-05-03 10:50:34 -04:00
Xuehai Pan
ee4bc07474
Support union types X | Y
syntax for HfArgumentParser
for Python 3.10+ ( #23126 )
...
* Support union types `X | Y` syntax for `HfArgumentParser` for Python 3.10+
* Add tests for PEP 604 for `HfArgumentParser`
* Reorganize tests
2023-05-03 10:49:54 -04:00
Alara Dirik
56b8d49ddf
Fix ConvNext V2 paramater naming issue ( #23122 )
...
Fixes the parameter naming issue in ConvNextV2GRN module
2023-05-03 17:21:27 +03:00
Samin Yasar
b53004fdce
Add resources for LayoutLmV2 and reformat documentation resources ( #23115 )
...
* add resources for layoutlmv2
* remove 🌎 from some resources
2023-05-03 09:53:00 -04:00
Joao Gante
3a08dc63fd
Generate: better warnings with pipelines ( #23128 )
2023-05-03 14:43:17 +01:00
Manuel
2a16d8b275
improve unclear documentation ( #23123 )
2023-05-03 09:36:30 -04:00
Joao Gante
a0bd464776
Generate: correct beam search length on score calculation for multi batch generation ( #23127 )
2023-05-03 14:29:55 +01:00
Joao Gante
ce31e3c8bf
Generate: slow assisted generation test ( #23125 )
2023-05-03 14:24:50 +01:00
Younes Belkada
b61d5b47f6
[Doctest
] Fix pix2struct doctest ( #23121 )
...
fix pix2struct doctest
2023-05-03 11:21:59 +02:00
Sylvain Gugger
4b6aecb48e
Pin numba for now ( #23118 )
2023-05-02 22:02:39 -04:00
Gregory (Gabriel) Barello
3ff89f29f5
Fixed default config for Pix2Struct
model to set Pix2StructTextModel
to is_decoder=True
( #23051 )
...
added as default keyword arg. to in order to correctly configure the decoder
2023-05-02 13:40:41 -04:00
Alex Punnen
805db1fe13
num_noise_spans should be <= num_items #22246 ( #22938 )
2023-05-02 13:07:30 -04:00
Michael Benayoun
9ade58f055
[ONNX] Sam fix ( #23110 )
...
* [WIP] Fix for the ONNX export
* Apply changes
* Remove commented code
* Resolve todo
* empty -> zeros
* fix slow tests
---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
2023-05-02 17:20:02 +02:00
Younes Belkada
4baa34c18f
[Flava
] Fix flava torch.distributed.nn.functional import all_gather
issue ( #23108 )
...
* fix flava `torch.distributed.nn.functional import all_gather` issue
* more comments
2023-05-02 15:35:57 +02:00
Wing Lian
c6c6658499
Fix check for backword_pos ( #23075 )
2023-05-02 09:32:42 -04:00
Sohyun Sim
f31a510bb3
🌐 [i18n-KO] Translated torchscript.mdx
to Korean ( #23060 )
...
* docs: ko: torchscript.mdx
* feat: gpt and deepl draft
* fix: manual edits
* fix: edit anchor link
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-05-02 09:27:59 -04:00
peter-sk
2b0c924568
GPT2ForQuestionAnswering ( #23030 )
...
* first draft - gives index error in question_answering.py
* maturing
* no labels
* pipeline should know about QA
* fixing checks
* formatting
* fixed docstring
* make sure legacy code executes
* comment
* like this
---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
2023-05-02 09:25:46 -04:00
regisss
bcedd0a471
Save the tokenizer and image preprocessor after training a model with the contrastive image-text example ( #23035 )
...
Save tokenizer and image preprocessor
2023-05-02 09:23:16 -04:00
Arun Brahma
85e3d7b6a0
added type hints for blip_text pytorch model ( #23071 )
...
* added type hints for blip_text pytorch model
* updated type hints for blip_text pytorch model
2023-05-02 13:22:31 +01:00
dependabot[bot]
b8648290d2
Bump flask from 2.0.3 to 2.3.2 in /examples/research_projects/decision_transformer ( #23094 )
...
Bump flask in /examples/research_projects/decision_transformer
Bumps [flask](https://github.com/pallets/flask ) from 2.0.3 to 2.3.2.
- [Release notes](https://github.com/pallets/flask/releases )
- [Changelog](https://github.com/pallets/flask/blob/main/CHANGES.rst )
- [Commits](https://github.com/pallets/flask/compare/2.0.3...2.3.2 )
---
updated-dependencies:
- dependency-name: flask
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-01 20:15:11 -04:00
Nayeon Han
f9426eeb94
🌐 [i18n-KO] Translated tasks/zero_shot_image_classification.mdx
to Korean ( #23065 )
...
docs: ko: `tasks/zero_shot_image_classification`
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-05-01 20:11:56 -04:00
Jungnerd
92601d2eb1
🌐 [i18n-KO] Translated tasks/question_answering.mdx
to Korean ( #23012 )
...
docs: ko: `tasks/question_answering.mdx` to Korean
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
2023-05-01 11:05:40 -04:00
Hyeonseo Yun
78941b9fe5
🌐 [i18n-KO] Translated tasks/image_classification.mdx
to Korean ( #23048 )
...
* ko: init: tasks/image_classification.mdx
* docs: ko: trans: tasks/image_classification.mdx
* docs: ko: revise: sync glossary and spell check tasks/image_classification.mdx
* docs: ko: revise: sync glossary tasks/image_classification.mdx
* fix: resolve suggestions (github) image_classification.mdx
Only github code review suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestions image_classification.mdx
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
2023-05-01 09:50:05 -04:00
Zachary Mueller
9884862383
Depricate xpu_backend for ddp_backend ( #23085 )
...
* Depricate xpu_backend for ddp_backend
* Typo
* Only do a minor deprecation, no need for major
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-05-01 09:44:47 -04:00
IMvision12
95cf3725b4
Fix convnext
__init__ ( #23078 )
...
fix
2023-05-01 09:36:42 -04:00
Ashwin Mathur
487f132a6f
Add BioGPTForSequenceClassification
( #22253 )
...
* added BioGptForSequenceClassification
* added source of copied code
* typo
* Format code with black
* Update comments for copied code
* Remove code copy comment
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Fix failing tests
* Update code copied from comments
* Fix code quality
* Update src/transformers/models/biogpt/modeling_biogpt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Fix lint error
* Update src/transformers/models/biogpt/modeling_biogpt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Rename model to biogpt for consistency
* Add PipelineTesterMixin to test_modeling_biogpt.py
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Resolve merge confict
---------
Co-authored-by: Guillem García Subies <37592763+GuillemGSubies@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-05-01 09:17:27 -04:00
Xin Wen
549e5f9f23
Fix string syntax error in logger warning message (additional comma) ( #23083 )
2023-05-01 09:14:16 -04:00
Stephen Kaplan
9062d1bab2
Fix grammar error in summarization pipeline ( #23080 )
...
Fix minor grammar issue
2023-05-01 08:54:57 -04:00
Joao Gante
849367ccf7
Generate: prepare assisted generation for release ( #23052 )
2023-04-29 10:53:30 +01:00
Yih-Dar
dfeb5aa6a9
extend the test files ( #23043 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 22:25:34 +02:00
Yih-Dar
b6865b9bef
Fix model parallelism for BridgeTower
( #23039 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 21:53:58 +02:00
Younes Belkada
d337631b91
🚨 🚨 🚨 [Blip
] remove labels masking ( #23024 )
...
* remove labels masking
* add fix on blip tf
2023-04-28 18:24:51 +02:00
s-JoL
c2c99dc7ef
add open-llama model with ckpt ( #22795 )
...
* update Open-Llama model
* update
* update format
* update doc
* update
* update stable embedding test
* update test case
* update format
* update readme
* fix typo
* update name
* remove tokenizer and update format
* remove convert_open_llama_weights_to_hf
* update warning and doc_string
---------
Co-authored-by: songliang.bayesian <songliang.bayesian@bytedance.com>
2023-04-28 11:01:32 -04:00
Yih-Dar
0bf34b1c9f
Skip pt/flax equivalence tests in pytorch bigbird
test file ( #23040 )
...
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 17:00:13 +02:00
Shivam Shrirao
4d0ea3d269
Cuda rng_state_all is used when saving in distributed mode so same should also be used when loading ( #23045 )
...
cuda rng state should be all for distributed bc all were saved
2023-04-28 09:28:01 -04:00
Maria Khalusova
521a8ffa53
[docs] Doc TOC updates ( #23049 )
...
* first draft of toc restructure
* polishing based on feedback
2023-04-28 09:24:28 -04:00
Hyeonseo Yun
4893d919f1
🌐 [i18n-KO] Translated model_sharing.mdx
to Korean ( #22991 )
...
* docs: ko: init: model_sharing.mdx
* docs: ko: trans: model_sharing.mdx
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* docs: ko: revised: apply code reviews model_sharing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* docs: ko: revised: apply aditional reviews model_sharing.mdx
1. Natural Expression
2. `파인 튜닝` to `미세 조정`
3. Glossary Sync
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
* docs: ko: revised: apply aditional reviews in model_sharing.mdx
1. Spell check
2. Natural Expression
3. Sync Glossary
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
* docs: ko: revised: `프로그래밍 방식` to `API` in model_sharing.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
---------
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-04-28 09:20:33 -04:00
Maxime Méloux
9b435204b1
Add Trainer support for ReduceLROnPlateau ( #23010 )
...
* Add Trainer support for ReduceLROnPlateau
Fixes #16503
* Remove training argument and add default instance
---------
Co-authored-by: mmeloux <maxime.meloux@loria.fr>
2023-04-28 09:17:30 -04:00
Yih-Dar
cf7baf4060
Make _test_xla_generate
less flaky ( #22996 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 13:27:28 +02:00
Ehsan M. Kermani
a0e7332839
Fix CLAP link across all READMEs ( #23032 )
...
* Fix CLAP link across all READMEs
* Fix copy only for en
2023-04-27 18:07:02 -04:00
Bartosz Szmelczynski
88399476c3
Fix bigbird random attention ( #21023 )
...
* switch np.random.permutation to jax.random.permuation
* remove comments
* remove leftover comment
* skip similarity tests
* modify indices_prng_key usage, add deterministic behaviour
* update style
* remove unused import
* remove copy statement since classes are not identical
* remove numpy import
* revert removing copied from statements
* make style from copied
* remove copied from statement
* update copied from statement to include only np.ndarry
* add deterministic args, unittestskip equivalence tests
2023-04-27 13:52:28 -04:00
Yih-Dar
27b66bea01
Update BridgeTowerModelTester
( #23029 )
...
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-27 18:26:17 +02:00