Sylvain Gugger
03af4c42a6
Docstring check ( #26052 )
...
* Fix number of minimal calls to the Hub with peft integration
* Alternate design
* And this way?
* Revert
* Nits to fix
* Add util
* Print when changes are made
* Add list to ignore
* Add more rules
* Manual fixes
* deal with kwargs
* deal with enum defaults
* avoid many digits for floats
* Manual fixes
* Fix regex
* Fix regex
* Auto fix
* Style
* Apply script
* Add ignored list
* Add check that templates are filled
* Adding to CI checks
* Add back semi-fix
* Ignore more objects
* More auto-fixes
* Ignore missing objects
* Remove temp semi-fix
* Fixes
* Update src/transformers/models/pvt/configuration_pvt.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update utils/check_docstrings.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Update src/transformers/utils/quantization_config.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* Deal with float defaults
* Fix small defaults
* Address review comment
* Treat
* Post-rebase cleanup
* Address review comment
* Update src/transformers/models/deprecated/mctct/configuration_mctct.py
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
* Address review comment
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2023-10-04 15:13:37 +02:00
Bharat Ramanathan
122b2657f8
feat: add trainer label to wandb run upon initialization ( #26466 )
2023-10-04 14:57:41 +02:00
statelesshz
4fdf47cd3c
Extend Trainer to enable Ascend NPU to use the fused Adamw optimizer when training ( #26194 )
2023-10-04 14:57:11 +02:00
dependabot[bot]
fc296f419e
Bump pillow from 9.3.0 to 10.0.1 in /examples/research_projects/decision_transformer ( #26580 )
...
Bump pillow in /examples/research_projects/decision_transformer
Bumps [pillow](https://github.com/python-pillow/Pillow ) from 9.3.0 to 10.0.1.
- [Release notes](https://github.com/python-pillow/Pillow/releases )
- [Changelog](https://github.com/python-pillow/Pillow/blob/main/CHANGES.rst )
- [Commits](https://github.com/python-pillow/Pillow/compare/9.3.0...10.0.1 )
---
updated-dependencies:
- dependency-name: pillow
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-04 11:52:46 +02:00
김준재_T3056
2f3ea08a07
docs: feat: add clip notebook resources from OSSCA community ( #26505 )
2023-10-03 11:20:22 -07:00
Lysandre Debut
5c66378cea
[Tokenizers] Skip tests temporarily ( #26574 )
...
* Skip tests temporarily
* style
* Add additional test
2023-10-03 19:43:42 +02:00
Jungnerd
2c7b26f508
🌐 [i18n-KO] Translated semantic_segmentation.md
to Korean ( #26515 )
...
* docs: ko: sementic_segmentation.md
* feat: manual draft
* fix: manual edits
* fix: resolve suggestions
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
* fix: resolve suggestions
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* fix: edit the title
---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-10-03 10:25:50 -07:00
Sanchit Gandhi
57f44dc428
[Whisper] Allow basic text normalization ( #26149 )
...
* [Whisper] Allow basic text normalization
* up
* style copies
2023-10-03 17:57:16 +01:00
Lysandre
bd6205919a
v4.35.0.dev0
2023-10-03 16:54:37 +02:00
Arthur
c26b2a29e5
[Nougat
] from transformers import * ( #26562 )
...
* remove unprotected import to PIL
* cleanup
---------
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-10-03 16:32:12 +02:00
Younes Belkada
2aef9a9601
[PEFT
] Final fixes ( #26559 )
...
* fix issues with PEFT
* logger warning futurewarning issues
* fixup
* adapt from suggestions
* oops
* rm test
2023-10-03 14:53:09 +02:00
Younes Belkada
ae9a344cce
[Mistral
] Add Flash Attention-2 support for mistral
( #26464 )
...
* add FA-2 support for mistral
* fixup
* add sliding windows
* fixing few nits
* v1 slicing cache - logits do not match
* add comment
* fix bugs
* more mem efficient
* add warning once
* add warning once
* oops
* fixup
* more comments
* copy
* add safety checker
* fixup
* Update src/transformers/models/mistral/modeling_mistral.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* copied from
* up
* raise when padding side is right
* fixup
* add doc + few minor changes
* fixup
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-10-03 13:44:46 +02:00
Arthur
1a2e966cfe
Nit-added-tokens ( #26538 )
...
* fix stripping
* nits
* fix another test
* styling
* fix?
* update
* revert bad merge
* found the bug
* YES SIR
* is that change really required?
* make fast even faster
* re order functions
2023-10-03 12:23:46 +02:00
Srijan Sahay Srivastava
245da7ed38
[Doctest] Add configuration_encoder_decoder.py
( #26519 )
...
* [Doctest] Add configuration_encoder_decoder.py
Added configuration_encoder_decoder.py to utils/documentation_tests.txt for doctest
* Revert "[Doctest] Add configuration_encoder_decoder.py"
This reverts commit bd653535a4
.
* [Doctest] Add configuration_encoder_decoder.py
add configuration_encoder_decoder.py to utils/documentation_tests.txt
* [Doctest] Add configuration_encoder_decoder.py
add configuration_encoder_decoder.py to utils/documentation_tests.txt
* [Doctest] Add configuration_encoder_decoder.py
add configuration_encoder_decoder.py to utils/documentation_tests.txt
* changed as per request
* fixed line 46
2023-10-03 11:21:24 +02:00
Funtowicz Morgan
3632fb3c25
[AMD] Add initial version for run_tests_multi_gpu ( #26346 )
...
* Add initial version for run_tests_multi_gpu
* Trigger change in BERT
* fix typo setup -> setup_gpu
* Add tag mi210
* Enable multi-gpu jobs
* One more
* Use dynamic device allocation
* Attempt to fix syntax for docker create
* fix script path
* fix
* temp machine type
* fix label
* Enable multi-gpu tests
* Rename multi-amd-gpu to multi-gpu
* Let's not be lazy dude
* Update rocm-smi output
* Add gpu_flavour in the matrix
* Fix typos
* merge single/multi dispatch into the matrix
* Format.
* Revert BERT's change
---------
Co-authored-by: Guillaume LEGENDRE <glegendre01@gmail.com>
2023-10-03 11:13:45 +02:00
Sanchit Gandhi
768aa3d9cd
[Wav2Vec2 and Co] Update init tests for PT 2.1 ( #26494 )
2023-10-03 10:52:34 +02:00
Nathan Cahill
b5ca8fcd20
Add tokenizer kwargs to fill mask pipeline. ( #26234 )
...
* add tokenizer kwarg inputs
* Adding tokenizer_kwargs to _sanitize_parameters
* Add truncation=True example to tests
* Update test_pipelines_fill_mask.py
* Update test_pipelines_fill_mask.py
* make fix-copies and make style
* Update fill_mask.py
Replace single tick with double
* make fix-copies
* Style
---------
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-10-03 10:25:10 +02:00
Patrick von Platen
df6a855e7b
[RFC, Logging] Change warning to info ( #26545 )
...
[Logging] Change warning to info
2023-10-03 08:55:39 +02:00
dependabot[bot]
cf345d5f38
Bump urllib3 from 1.26.9 to 1.26.17 in /examples/research_projects/decision_transformer ( #26554 )
...
Bump urllib3 in /examples/research_projects/decision_transformer
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.9 to 1.26.17.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.9...1.26.17 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-03 08:55:12 +02:00
dependabot[bot]
6de6fdd06d
Bump urllib3 from 1.26.5 to 1.26.17 in /examples/research_projects/visual_bert ( #26552 )
...
Bump urllib3 in /examples/research_projects/visual_bert
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.5 to 1.26.17.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.5...1.26.17 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-03 08:55:01 +02:00
dependabot[bot]
e092b4ad68
Bump urllib3 from 1.26.5 to 1.26.17 in /examples/research_projects/lxmert ( #26551 )
...
Bump urllib3 in /examples/research_projects/lxmert
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 1.26.5 to 1.26.17.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/1.26.5...1.26.17 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-10-03 08:54:50 +02:00
Florian Zimmermeister
9ed538f2e6
[i18n-DE] contribute chapter ( #26481 )
...
* start working on next chapter
* finish testing
* Update docs/source/de/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/de/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* Update docs/source/de/testing.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
2023-10-02 09:56:40 -07:00
Wonhyeong Seo
1470f731b6
🌐 [i18n-KO] Translated tokenizer_summary.md
to Korean ( #26243 )
...
* docs: ko: toknenizer_summary.md
Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-Authored-By: Juntae <79131091+sronger@users.noreply.github.com>
Co-Authored-By: Injin Paek <71638597+eenzeenee@users.noreply.github.com>
* update review
* fix: resolve suggestions
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>
Co-Authored-By: Steven Liu <59462357+stevhliu@users.noreply.github.com>
* fix: resolve suggestions
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
---------
Co-authored-by: HanNayeoniee <nayeon2.han@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Juntae <79131091+sronger@users.noreply.github.com>
Co-authored-by: Injin Paek <71638597+eenzeenee@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>
2023-10-02 09:55:33 -07:00
Arthur
c20d90d577
add build_inputs_with_special_tokens to LlamaFast ( #26297 )
...
* add build_inputs_with_special_tokens to LlamaFast
* fixup
* Update src/transformers/models/llama/tokenization_llama_fast.py
2023-10-02 18:30:44 +02:00
Arthur
bab3331906
Code-llama-nit ( #26300 )
...
* fix encoding when the fill token is None
* add tests and edge cases
* fiuxp
* Update tests/models/code_llama/test_tokenization_code_llama.py
2023-10-02 18:29:27 +02:00
Adithya Hegde Kota
4b4c6aabfb
[Doctest] Add configuration_roformer.py ( #26530 )
...
* [Doctest] Add configuration_roformer.py
* [Doctest] Add configuration_roformer.py
* [Doctest] Add configuration_roformer.py
* [Doctest] Add configuration_roformer.py
* Removed documentation_test.txt
* Removed configuration_roformer.py
* Update not_doctested.txt
2023-10-02 17:19:13 +02:00
Arthur
e4dad4fe32
Remove-warns ( #26483 )
...
* fix stripping
* remove some warnings and update some warnings
* revert changes for other PR
2023-10-02 16:52:00 +02:00
Younes Belkada
1b8decb04c
[PEFT
] Protect adapter_kwargs
check ( #26537 )
...
Update modeling_utils.py
2023-10-02 14:59:24 +02:00
Arthur
63864e057f
Fix model integration ci ( #26322 )
...
* fix wav2vec2
* nit
* stash
* one more file to update
* fix byt5
* vocab size is 256, don't change that!
* use other revision
* test persimon in smaller size
* style
* tests
* nits
* update add tokens from pretrained
* test tokenization
* nits
* potential fnet fix?
* more nits
* nits
* correct test
* assert close
* udpate
* ouch
* fix it
* some more nits
* FINALLU
* use `adept` checkpoints
* more adept checkpoints
* that was invlved!
2023-10-02 13:55:46 +02:00
Younes Belkada
6824461f2a
[core
/ auto
] Fix bnb test with code revision + bug with code revision ( #26431 )
...
* fix bnb test with code revision
* fix test
* Apply suggestions from code review
* Update src/transformers/models/auto/auto_factory.py
* Update src/transformers/models/auto/auto_factory.py
* Update src/transformers/models/auto/auto_factory.py
2023-10-02 11:35:07 +02:00
Younes Belkada
24178c2461
[PEFT
] Pass token when calling find_adapter_config
( #26488 )
...
* try
* nit
* nits
2023-10-02 11:23:03 +02:00
HelgeS
7d6627d0d9
Fix broken link to video classification task ( #26487 )
2023-10-02 11:19:11 +02:00
marcmk6
6d02ca4bb9
Fix issue of canine forward requiring input_ids anyway ( #26290 )
...
* fix issue of canine forward requires input_ids anyway
The `forward` requires `input_ids` for deriving other variables in all cases. Change this to use the given one between `input_ids` and `inputs_embeds`
* fix canine forward
The current `forward` requires (the shape of) `input_ids` for deriving other variables whenever `input_ids` or `inputs_embeds` is provided. Change this to use the given one instead of `input_ids` all the time.
* fix format
* fix format
2023-10-02 11:06:40 +02:00
Jan Philipp Harries
7d77d7f79c
Fix requests connection error during modelcard creation ( #26518 )
...
fix requests connection error
Co-authored-by: Jan Philipp Harries <jphme@users.noreply.github.com>
2023-10-02 10:52:51 +02:00
Florian Seiler
ca0379b8c8
Fix num_heads in _upad_input ( #26490 )
...
* Fix num_heads in _upad_input
The variable num_key_value_heads has falsely been named num_heads, which led to reshaping the query_layer using the wrong attention head count. (It would have been enough to use the correct variable self.num_heads instead of num_heads, but I renamed num_heads to num_key_value_heads for clarity)
* fixed copies using make fix-copies and ran make fixup
---------
Co-authored-by: fseiler <f.seiler@jerocom.de>
2023-10-02 10:10:19 +02:00
Lysandre Debut
67239f7360
Revert falcon exception ( #26472 )
...
* Revert "Falcon: fix revision propagation (#26006 )"
This reverts commit 118c676ef3
.
* Revert "Put Falcon back (#25960 )"
This reverts commit 22a69f1d7d
.
2023-10-02 09:13:19 +02:00
Sanchit Gandhi
0b192de1f3
[ASR Pipe] Improve docs and error messages ( #26476 )
...
* improve docs/errors
* why whisper
* Update docs/source/en/pipeline_tutorial.md
Co-authored-by: Lysandre Debut <hi@lysand.re>
* specify pt only
---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
2023-09-29 18:32:37 +01:00
Sanchit Gandhi
68e85fc822
[Flax Examples] Seq2Seq ASR Fine-Tuning Script ( #21764 )
...
* from seq2seq speech
* [Flax] Example script for speech seq2seq
* tests and fixes
* make style
* fix: label padding tokens
* fix: label padding tokens over list
* update ln names for Whisper
* try datasets iter loader
* create readme and append results
* style
* make style
* adjust lr
* use pt dataloader
* make fast
* pin gen max len
* finish
* add pt to requirements for test
* fix pt -> torch
* add accelerate
2023-09-29 16:42:58 +01:00
Yih-Dar
391177441b
Avoid all-zeor attnetion mask used in testing ( #26469 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-29 11:06:06 +02:00
Yih-Dar
9b23d0de0e
Skip 2 failing persimmon pipeline tests for now ( #26485 )
...
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-09-29 10:52:18 +02:00
Maria Khalusova
14170b784b
[docs] navigation improvement between text gen pipelines and text gen params ( #26477 )
...
* navigation improvement between text generation pipelines and text generation docs
* make style
2023-09-29 09:43:39 +02:00
Steven Liu
7bb1c0c147
[docs] Update offline mode docs ( #26478 )
...
update
2023-09-29 09:42:21 +02:00
Sanchit Gandhi
211f93aab9
[Whisper Tokenizer] Make decoding faster after adding timestamps ( #26299 )
...
make decoding faster
2023-09-28 19:02:27 +01:00
Amelie Schreiber
4e931a8eb3
Esm checkpointing ( #26454 )
...
* Fixed in-place operation error in EsmEmbeddings
* Fixed in-place operation error in EsmEmbeddings again
---------
Co-authored-by: Schreiber-Finance <amelie.schreiber.finance@gmail.com>
2023-09-28 18:49:39 +01:00
Marc Sun
5e11d72d4d
fix_mbart_tied_weights ( #26422 )
...
* fix_mbart_tied_weights
* add test
2023-09-28 15:08:35 +02:00
fleance
216dff7549
Do not warn about unexpected decoder weights when loading T5EncoderModel and LongT5EncoderModel ( #26211 )
...
Ignore decoder weights when using T5EncoderModel and LongT5EncoderModel
Both T5EncoderModel and LongT5EncoderModel do not have any decoder layers, so
loading a pretrained model checkpoint such as t5-small will give warnings about
keys found in the model checkpoint that are not in the model itself.
To prevent this log warning, r"decoder" has been added to _keys_to_ignore_on_load_unexpected for
both T5EncoderModel and LongT5EncoderModel
2023-09-28 11:27:43 +02:00
Younes Belkada
38e96324ef
[PEFT
] introducing adapter_kwargs
for loading adapters from different Hub location (subfolder
, revision
) than the base model ( #26270 )
...
* make use of adapter_revision
* v1 adapter kwargs
* fix CI
* fix CI
* fix CI
* fixup
* add BC
* Update src/transformers/integrations/peft.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
* fixup
* change it to error
* Update src/transformers/modeling_utils.py
* Update src/transformers/modeling_utils.py
* fixup
* change
* Update src/transformers/integrations/peft.py
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-09-28 11:13:03 +02:00
Fakhir Ali
52e2c13da3
[VITS] Fix speaker_embed device mismatch ( #26115 )
...
* [VITS] Fix speaker_embed device mismatch
- pass device arg to speaker_id tensor
* [VITS] put speaker_embed on device when int
* [VITS] device=self.device
instead of self.embed_speaker.weight.device
* [VITS] make tensor directly on device
using torch.full()
2023-09-28 10:56:36 +02:00
Tanishq Abraham
098c3f400c
change mention of decoder_input_ids to input_ids and same with decode_inputs_embeds ( #26406 )
...
* change mention of decoder_input_ids to input_ids and same with decoder_input_embeds
* Style
---------
Co-authored-by: Lysandre <lysandre@huggingface.co>
2023-09-28 10:15:48 +02:00
Phuc Van Phan
ba47efbfe4
docs: change assert to raise and some small docs ( #26232 )
...
* docs: change assert to raise and some small docs
* docs: add rule and some document
* fix: fix bug
* fix: fix bug
* chorse: revert logging
* chorse: revert
2023-09-28 10:14:17 +02:00