Gregory (Gabriel) Barello
3ff89f29f5
Fixed default config for Pix2Struct
model to set Pix2StructTextModel
to is_decoder=True
( #23051 )
...
added as default keyword arg. to in order to correctly configure the decoder
2023-05-02 13:40:41 -04:00
Alex Punnen
805db1fe13
num_noise_spans should be <= num_items #22246 ( #22938 )
2023-05-02 13:07:30 -04:00
Michael Benayoun
9ade58f055
[ONNX] Sam fix ( #23110 )
...
* [WIP] Fix for the ONNX export
* Apply changes
* Remove commented code
* Resolve todo
* empty -> zeros
* fix slow tests
---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
2023-05-02 17:20:02 +02:00
Younes Belkada
4baa34c18f
[Flava
] Fix flava torch.distributed.nn.functional import all_gather
issue ( #23108 )
...
* fix flava `torch.distributed.nn.functional import all_gather` issue
* more comments
2023-05-02 15:35:57 +02:00
Wing Lian
c6c6658499
Fix check for backword_pos ( #23075 )
2023-05-02 09:32:42 -04:00
Sohyun Sim
f31a510bb3
🌐 [i18n-KO] Translated torchscript.mdx
to Korean ( #23060 )
...
* docs: ko: torchscript.mdx
* feat: gpt and deepl draft
* fix: manual edits
* fix: edit anchor link
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
* fix: resolve suggestions
---------
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-05-02 09:27:59 -04:00
peter-sk
2b0c924568
GPT2ForQuestionAnswering ( #23030 )
...
* first draft - gives index error in question_answering.py
* maturing
* no labels
* pipeline should know about QA
* fixing checks
* formatting
* fixed docstring
* make sure legacy code executes
* comment
* like this
---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
2023-05-02 09:25:46 -04:00
regisss
bcedd0a471
Save the tokenizer and image preprocessor after training a model with the contrastive image-text example ( #23035 )
...
Save tokenizer and image preprocessor
2023-05-02 09:23:16 -04:00
Arun Brahma
85e3d7b6a0
added type hints for blip_text pytorch model ( #23071 )
...
* added type hints for blip_text pytorch model
* updated type hints for blip_text pytorch model
2023-05-02 13:22:31 +01:00
dependabot[bot]
b8648290d2
Bump flask from 2.0.3 to 2.3.2 in /examples/research_projects/decision_transformer ( #23094 )
...
Bump flask in /examples/research_projects/decision_transformer
Bumps [flask](https://github.com/pallets/flask ) from 2.0.3 to 2.3.2.
- [Release notes](https://github.com/pallets/flask/releases )
- [Changelog](https://github.com/pallets/flask/blob/main/CHANGES.rst )
- [Commits](https://github.com/pallets/flask/compare/2.0.3...2.3.2 )
---
updated-dependencies:
- dependency-name: flask
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-01 20:15:11 -04:00
Nayeon Han
f9426eeb94
🌐 [i18n-KO] Translated tasks/zero_shot_image_classification.mdx
to Korean ( #23065 )
...
docs: ko: `tasks/zero_shot_image_classification`
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-05-01 20:11:56 -04:00
Jungnerd
92601d2eb1
🌐 [i18n-KO] Translated tasks/question_answering.mdx
to Korean ( #23012 )
...
docs: ko: `tasks/question_answering.mdx` to Korean
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
2023-05-01 11:05:40 -04:00
Hyeonseo Yun
78941b9fe5
🌐 [i18n-KO] Translated tasks/image_classification.mdx
to Korean ( #23048 )
...
* ko: init: tasks/image_classification.mdx
* docs: ko: trans: tasks/image_classification.mdx
* docs: ko: revise: sync glossary and spell check tasks/image_classification.mdx
* docs: ko: revise: sync glossary tasks/image_classification.mdx
* fix: resolve suggestions (github) image_classification.mdx
Only github code review suggestion
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* fix: resolve suggestions image_classification.mdx
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
2023-05-01 09:50:05 -04:00
Zachary Mueller
9884862383
Depricate xpu_backend for ddp_backend ( #23085 )
...
* Depricate xpu_backend for ddp_backend
* Typo
* Only do a minor deprecation, no need for major
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-05-01 09:44:47 -04:00
IMvision12
95cf3725b4
Fix convnext
__init__ ( #23078 )
...
fix
2023-05-01 09:36:42 -04:00
Ashwin Mathur
487f132a6f
Add BioGPTForSequenceClassification
( #22253 )
...
* added BioGptForSequenceClassification
* added source of copied code
* typo
* Format code with black
* Update comments for copied code
* Remove code copy comment
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Fix failing tests
* Update code copied from comments
* Fix code quality
* Update src/transformers/models/biogpt/modeling_biogpt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Fix lint error
* Update src/transformers/models/biogpt/modeling_biogpt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Rename model to biogpt for consistency
* Add PipelineTesterMixin to test_modeling_biogpt.py
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Resolve merge confict
---------
Co-authored-by: Guillem García Subies <37592763+GuillemGSubies@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2023-05-01 09:17:27 -04:00
Xin Wen
549e5f9f23
Fix string syntax error in logger warning message (additional comma) ( #23083 )
2023-05-01 09:14:16 -04:00
Stephen Kaplan
9062d1bab2
Fix grammar error in summarization pipeline ( #23080 )
...
Fix minor grammar issue
2023-05-01 08:54:57 -04:00
Joao Gante
849367ccf7
Generate: prepare assisted generation for release ( #23052 )
2023-04-29 10:53:30 +01:00
Yih-Dar
dfeb5aa6a9
extend the test files ( #23043 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 22:25:34 +02:00
Yih-Dar
b6865b9bef
Fix model parallelism for BridgeTower
( #23039 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 21:53:58 +02:00
Younes Belkada
d337631b91
🚨 🚨 🚨 [Blip
] remove labels masking ( #23024 )
...
* remove labels masking
* add fix on blip tf
2023-04-28 18:24:51 +02:00
s-JoL
c2c99dc7ef
add open-llama model with ckpt ( #22795 )
...
* update Open-Llama model
* update
* update format
* update doc
* update
* update stable embedding test
* update test case
* update format
* update readme
* fix typo
* update name
* remove tokenizer and update format
* remove convert_open_llama_weights_to_hf
* update warning and doc_string
---------
Co-authored-by: songliang.bayesian <songliang.bayesian@bytedance.com>
2023-04-28 11:01:32 -04:00
Yih-Dar
0bf34b1c9f
Skip pt/flax equivalence tests in pytorch bigbird
test file ( #23040 )
...
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 17:00:13 +02:00
Shivam Shrirao
4d0ea3d269
Cuda rng_state_all is used when saving in distributed mode so same should also be used when loading ( #23045 )
...
cuda rng state should be all for distributed bc all were saved
2023-04-28 09:28:01 -04:00
Maria Khalusova
521a8ffa53
[docs] Doc TOC updates ( #23049 )
...
* first draft of toc restructure
* polishing based on feedback
2023-04-28 09:24:28 -04:00
Hyeonseo Yun
4893d919f1
🌐 [i18n-KO] Translated model_sharing.mdx
to Korean ( #22991 )
...
* docs: ko: init: model_sharing.mdx
* docs: ko: trans: model_sharing.mdx
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* docs: ko: revised: apply code reviews model_sharing.mdx
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* docs: ko: revised: apply aditional reviews model_sharing.mdx
1. Natural Expression
2. `파인 튜닝` to `미세 조정`
3. Glossary Sync
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
* docs: ko: revised: apply aditional reviews in model_sharing.mdx
1. Spell check
2. Natural Expression
3. Sync Glossary
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
* docs: ko: revised: `프로그래밍 방식` to `API` in model_sharing.mdx
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
---------
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-04-28 09:20:33 -04:00
Maxime Méloux
9b435204b1
Add Trainer support for ReduceLROnPlateau ( #23010 )
...
* Add Trainer support for ReduceLROnPlateau
Fixes #16503
* Remove training argument and add default instance
---------
Co-authored-by: mmeloux <maxime.meloux@loria.fr>
2023-04-28 09:17:30 -04:00
Yih-Dar
cf7baf4060
Make _test_xla_generate
less flaky ( #22996 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-28 13:27:28 +02:00
Ehsan M. Kermani
a0e7332839
Fix CLAP link across all READMEs ( #23032 )
...
* Fix CLAP link across all READMEs
* Fix copy only for en
2023-04-27 18:07:02 -04:00
Bartosz Szmelczynski
88399476c3
Fix bigbird random attention ( #21023 )
...
* switch np.random.permutation to jax.random.permuation
* remove comments
* remove leftover comment
* skip similarity tests
* modify indices_prng_key usage, add deterministic behaviour
* update style
* remove unused import
* remove copy statement since classes are not identical
* remove numpy import
* revert removing copied from statements
* make style from copied
* remove copied from statement
* update copied from statement to include only np.ndarry
* add deterministic args, unittestskip equivalence tests
2023-04-27 13:52:28 -04:00
Yih-Dar
27b66bea01
Update BridgeTowerModelTester
( #23029 )
...
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-27 18:26:17 +02:00
peter-sk
d65b14ed67
added GPTNeoForTokenClassification ( #22908 )
...
* added GPTNeoForTokenClassification
* add to top-level init
* fixup
* test
* more fixup
* add to gpt_neo.mdx
* repo consistency
* dummy copy
* fix copies
* optax >= 0.1.5 assumes jax.Array exists - which it doesn't for jax <= 0.3.6
* merge with main made this superfluous
* added classifier_dropout
* remove legacy code
* removed fmt:on/off
removed expected_outputs
* doc style fix
* classifier_dropout is always in config
---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
2023-04-27 12:10:03 -04:00
peter-sk
614e191c4d
added GPTNeoXForTokenClassification ( #23002 )
...
* initial commit
* added GPTNeoXForTokenClassification
* typo
* doc
fixed extra comma that turned into a tuple
* unifying variable names
fixing forward call
* classifier_dropout is in config
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-04-27 11:08:26 -04:00
Arthur
1933231a0a
[MEGA] nit size test ( #23028 )
...
* add fast not use warning
* properly check sequence_length vs chunk_size
* fixup
2023-04-27 16:21:00 +02:00
Yih-Dar
a4908da04e
Fix the expected error in test_offline_mode_pipeline_exception
( #23022 )
...
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-27 14:22:05 +02:00
Nayeon Han
e28fff18b8
🌐 [i18n-KO] Translated multilingual.mdx
to Korean ( #23008 )
...
docs: ko: `multilingual.mdx`
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
2023-04-27 08:06:12 -04:00
Younes Belkada
9435cc6670
[Pix2Struct
] Fix pix2struct doctest ( #23023 )
...
fix pix2struct doctest
2023-04-27 11:48:02 +02:00
fxmarty
3042c63a95
Add methods to PreTrainedModel to use PyTorch's BetterTransformer ( #21259 )
...
* fix mess
* better documentation
* typo
* fix doc
* update
* add test
* fix test
* more tests
* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* move to utils
* Apply suggestions from code review
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>
* nit
---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>
2023-04-27 11:03:42 +02:00
Sylvain Gugger
0083b149e9
🚨 🚨 🚨 Use default ignore index in Luke ( #23014 )
...
Use default ignore index in Luke
2023-04-26 17:55:01 -04:00
Zachary Mueller
8b129030cb
Bring back PartialState DeepSpeed ( #22921 )
...
* Bring back deepspeed integration
* Branchname
* Self-scheduled
* newline
* Use deepspeed env var
* Remove comment
* Del env var after partialstate
2023-04-26 15:35:59 -04:00
Sylvain Gugger
4331923b97
Fix None value when adding info to auto_map ( #22990 )
2023-04-26 14:39:36 -04:00
Arthur
d0b5002378
[Llama Tokenizer] Fast llama template ( #22959 )
...
* update template processing for llama fast to add eos
* style
* update
* adress training from new issue
* fix
* update
* special tokens can be given even if not used
2023-04-26 19:13:20 +02:00
Younes Belkada
00bc6e2067
[PEFT
] Add HFTracer support for PEFT ( #23006 )
...
* add hack fx
* continue hacking
* final changes
* Test
* Add a keys method
* Fix keys method
* revert unneeded changes
* small nit
---------
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>
2023-04-26 18:45:05 +02:00
Younes Belkada
304aacac90
🚨 🚨 🚨 [Pix2Struct
] Attempts to fix training issues 🚨 🚨 🚨 ( #23004 )
...
* multiple fixes
- add `add_special_tokens` to `True` by default
- remove label smoothing and labels masking
* fix test
2023-04-26 18:29:25 +02:00
Javier de la Rosa
ba0dc54576
Add gradient checkpointing to Whisper Flax ( #22954 )
...
* Add gradient checkpointing to Whisper Flax
* self.gradient_checkpointing only needed in nn.Module, removing unnecessary comments
2023-04-26 12:19:16 -04:00
Yih-Dar
a72b82ebe6
Remove a failing ONNX test ( #23011 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-04-26 17:44:12 +02:00
Ritik Nandwal
20ac86c6f1
Add TensorFlow Wav2Vec2 for sequence classification ( #22073 )
...
* Add initial changes for TF wav2vec2 for sequence classification
* Add suggested changes
* Add serving and serving output methods
* Add serving_output implementation and fix layer_weights
* Add fixes
* Fixed test cases
* Fixing test and adding suggested changes
2023-04-26 13:35:30 +01:00
Hyeonseo Yun
4c2b4c4c3c
🌐 [i18n-KO] Translated token_classification.mdx
to Korean ( #22945 )
...
* docs: ko: init: token_classification.mdx
* docs: ko: trans: tasks/token_classification.mdx
* docs: ko: revise: apply suggestions tasks/token_classification.mdx
right vocabulary, spell check, natural expression
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
* docs: ko: revise: `Hub` to `허브` in tasks/token_classification.mdx
* docs: ko: revise: `example` in tasks/token_classification.mdx
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>
Co-Authored-By: Jungnerd <46880056+jungnerd@users.noreply.github.com>
* docs: ko: revise: ko expression in tasks/token_classification.mdx
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
* Revert "docs: ko: revise: ko expression in tasks/token_classification.mdx"
This reverts commit 8efe28059b
.
* docs: ko: revise: `quick tour` in tasks/token_classification.mdx
Co-Authored-By: Gabriel Yang <gabrielwithhappy@gmail.com>
---------
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
2023-04-26 07:56:14 -04:00
Sohyun Sim
6dc2474727
🌐 [i18n-KO] Translated tasks/image_captioning.mdx
to Korean ( #22943 )
...
docs: ko: tasks/image_captioning.mdx
Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-04-26 07:54:58 -04:00