Sylvain Gugger
22888d3082
Remove Niels from templates ( #21564 )
2023-02-14 09:47:43 -05:00
Sylvain Gugger
68b21b37ea
Final cleanup of TOKENIZER_FOR_DOC ( #21565 )
...
FInal cleanup of TOKENIZER_FOR_DOC
2023-02-14 09:47:32 -05:00
Sylvain Gugger
c6f163c786
Skip failing test
2023-02-14 09:20:47 -05:00
Joao Gante
a81fe4e1df
Generate: input expansion for any model input ( #21624 )
2023-02-14 14:16:22 +00:00
Joao Gante
13e03e619d
Generate: filter encoder inputs when its signature does not accept wildcards ( #21603 )
2023-02-14 10:46:46 +00:00
Younes Belkada
41fa672df1
Enable requires_grad
on input embedding to train on top of frozen layers ( #21598 )
...
* v1
* make fixup
* add more methods
2023-02-14 09:43:06 +01:00
Zachary Mueller
8c5026628a
Add in big model inference to issue template ( #21611 )
...
* Add in big model inference to issue template
* Trigger
* Untrigger
* empty test commit
2023-02-13 16:40:34 -05:00
Joao Gante
56b03c96b8
Fix TF CTC tests ( #21606 )
2023-02-13 21:23:00 +00:00
Yih-Dar
cbecf121cd
Fix env. variable type issue in testing ( #21609 )
...
* fix env issue
* fix env issue
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-13 20:53:26 +01:00
Steven Liu
5987e0ab69
Clarify available pipelines in quicktour ( #21607 )
...
clarify available pipelines
2023-02-13 11:37:48 -08:00
Stas Bekman
101b9a7eb1
[deepspeed] performance docs ( #21573 )
...
* [deepspeed] performance docs
* fix
* re-org
* update
* update
* a new NCCL Collectives section
* inference
* Update docs/source/en/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* suggestion
* Update docs/source/en/main_classes/deepspeed.mdx
* suggestion
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-13 10:29:12 -08:00
Stas Bekman
68eff4036d
Update setup.py ( #21584 )
...
* Update setup.py
* suggestions
2023-02-13 10:12:14 -08:00
Nolwenn Bernard
a27074abb5
[i18n-fr] Translate quicktour page to French ( #21589 )
...
* Translate quicktour to French
* Traduction missing task
2023-02-13 13:05:31 -05:00
Joao Gante
fa4bdb0a40
Generate: correct default model input creation for decoder-only models ( #21580 )
2023-02-13 17:04:49 +00:00
Yih-Dar
edc1e734bf
Fix Blip-2 CI ( #21595 )
...
* use fp16
* use fp16
* use fp16
* use fp16
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-13 16:44:27 +01:00
Warren Green
fd5320bb57
Add missing arguemtn to run_clip.py ( #21588 )
2023-02-13 10:27:23 -05:00
Yi Wang
1210c72e82
Correct Markdown bullets indentation ( #21583 )
2023-02-13 10:22:29 -05:00
dependabot[bot]
92487f5d0b
Bump ipython from 8.1.1 to 8.10.0 in /examples/research_projects/decision_transformer ( #21577 )
...
Bump ipython in /examples/research_projects/decision_transformer
Bumps [ipython](https://github.com/ipython/ipython ) from 8.1.1 to 8.10.0.
- [Release notes](https://github.com/ipython/ipython/releases )
- [Commits](https://github.com/ipython/ipython/compare/8.1.1...8.10.0 )
---
updated-dependencies:
- dependency-name: ipython
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-02-13 10:21:50 -05:00
Billy Lee
dee4d72e72
annotated TFvisionEncoderDecoder input type hints ( #21432 )
...
* annotated TFvisionEncoderDecoder input type hints
Co-authored-by: JuheonChu <chuj@dickinson.edu>
Co-authored-by: AdiaWu <wua@dickinson.edu>
* fixed failing tests
* make fix-copies
* failed test fix
* style fix
* revert
---------
Co-authored-by: JuheonChu <chuj@dickinson.edu>
Co-authored-by: AdiaWu <wua@dickinson.edu>
Co-authored-by: Matt <rocketknight1@gmail.com>
2023-02-13 15:20:18 +00:00
Younes Belkada
1666c42f0b
[bnb
] Let's make the daily CI green 🍏 ( #21597 )
...
* fix bnb slow test
* make fixup
2023-02-13 16:18:50 +01:00
Joao Gante
24273268b7
Generate: Fix flaky indexing error in test_constrained_beam_search_generate_dict_output
( #21561 )
2023-02-13 15:12:07 +00:00
Dzmitry Pletnikau
93ed89bf40
Add inputs_embeds
support when generating with GPT-J ( #21575 )
2023-02-13 15:11:40 +00:00
Christopher Akiki
dcb5e01197
[MINOR] Fix link in timeseries transformer docs ( #21602 )
...
[MINOR] Fix link
I'm not sure this will also fix the currently broken link in the docs (Specifically here: https://huggingface.co/docs/transformers/model_doc/time_series_transformer ) whereby clicking on `kashif` attempts to link to the following non-existent URL: https://huggingface.co/docs/transformers/model_doc/%3Chttps://huggingface.co/kashif
2023-02-13 10:11:16 -05:00
Thomas Paviot
dd7429d645
Remove trailing 'extractive' word from en documentation ( #21594 )
...
remove trailing word
2023-02-13 10:09:00 -05:00
Joao Gante
4be75e9728
CI: skip failing TF hubert test ( #21601 )
...
skip test
2023-02-13 09:34:23 -05:00
Maria Khalusova
3baa407f92
Add: document question answering task guide ( #21518 )
...
* document question answering guide
* Added the list of supported models
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* switched to AutoProcessor
* feedback addressed
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
* Update docs/source/en/tasks/document_question_answering.mdx
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
* more feedback addressed
* addressed comments about evaluation loss
* added appropriate image link
* make style
* typo fix
* resolving toc conflict
* fixed the image link
---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
2023-02-13 09:24:56 -05:00
Joao Gante
eb6c59bc78
Generate: TF supports multiple eos tokens ( #21571 )
2023-02-13 12:24:22 +00:00
Sylvain Gugger
c836f77266
Fix quality on main (ruff release)
2023-02-11 20:09:16 -05:00
Younes Belkada
75a208ef66
[Blip2
] Add int8 support for blip2-flan-t5-xxl
( #21574 )
...
add int8 support
2023-02-10 23:28:24 +01:00
Yih-Dar
b47a16743b
Remove more unused attributes in config classes ( #21543 )
...
* Remove unused decoder_layerdrop
* Update SPECIAL_CASES_TO_ALLOW for MT5Config
* Remove unused position_embedding_init_scale
* Remove unused decoder_max_relative_position
* Use unused decoder_max_relative_position
* Remove unused init_std
* Remove unused forgotten attributes
* Remove unused patch_norm
* Remove unused max_seq_len
* Update SPECIAL_CASES_TO_ALLOW for OneFormerConfig
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-10 22:57:28 +01:00
Han Wu
862e8e4f4a
Added timesformer configuration ( #21446 )
...
* Added timesformer configuration
Co-authored-by: JuheonChu <chuj@dickinson.edu>
* Create documentation_tests.txt
* Update documentation_tests.txt
Co-authored-by: JuheonChu <chuj@dickinson.edu>
* Delete documentation_tests.txt
Updates, Deleting "src/transformers/utils/documentation_tests.txt" file.
Co-authored-by: JuheonChu <chuj@dickinson.edu>
* Create documentation_tests.txt
Co-authored-by: JuheonChu <chuj@dickinson.edu>
* Delete documentation_tests.txt
Co-authored-by: JuheonChu <chuj@dickinson.edu>
---------
Co-authored-by: JuheonChu <chuj@dickinson.edu>
2023-02-10 22:54:40 +01:00
amyeroberts
cb56590111
Replace input_values_processing with unpack_inputs ( #21502 )
...
* Replace input_values_prrocessing with unpack_inputs
* Skip test failing with OOM
* Update tests
2023-02-10 18:19:39 +00:00
Shubhamai
557125637d
improving contributing tests section ( #21569 )
...
* improving tests section
* documenting other env variables
2023-02-10 13:17:01 -05:00
Stas Bekman
3b7ed25da9
[deepspeed] deal with models w/o config.hidden_size
( #21504 )
...
* [deepspeed] deal with models w/o config.hidden_size
* typo
* typo
2023-02-10 09:44:19 -08:00
Yih-Dar
4f831e661b
Goodbye to Blip-2 doctests ( #21566 )
...
Byebye Blip-2 doctest
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2023-02-10 18:37:06 +01:00
Sayak Paul
e2ec3089ce
[Tasks] Adds image captioning ( #21512 )
...
* add: task guide on image cpationing.
* Empty commit to trigger CI
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* address additional comments from the PR.
* fix: wording.
* Update docs/source/en/tasks/image_captioning.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-10 22:52:12 +05:30
Stas Bekman
2f5507580b
[from_pretrained] extend torch_dtype="auto"
to look up config.torch_dtype
first, expand docs ( #21524 )
...
* [from_pretrained] expand on torch_dtype entry
* fold 4 into 1
* style
* support torch_dtype='config' plus tests
* style
* oops
* fold config into auto, fix bug
* fix check
* better log
* better log
* clean up
2023-02-10 09:09:21 -08:00
Shubhamai
9e40bba6ba
[Tests] Improve flax test_attention_outputs ( #21486 )
...
improving flax tests
2023-02-10 11:31:49 -05:00
steventk-g
c88b11c591
Add _mp_fn to run_mae.py for XLA testing ( #21551 )
...
Update run_mae.py
2023-02-10 09:53:55 -05:00
Patrick von Platen
b20147a3c8
[Variant] Make sure variant files are not incorrectly deleted ( #21562 )
...
* [Variant] Make sure variant files are not incorrectly deleted
* Apply suggestions from code review
* fix
2023-02-10 15:44:51 +01:00
Yueming Hao
51c3f42d8e
Replace inefficient torch.sqrt taking scalar input with numpy.sqrt ( #21496 )
...
* fix rsqrt
* fix typo
2023-02-10 09:44:14 -05:00
Jannis Vamvas
b0d539ccad
Add X-MOD ( #20939 )
...
* Add X-MOD to Readme
* Add documentation for X-MOD
* Implement X-MOD
* Fix formatting of X-MOD docs
* Change signature of X-MOD forward methods to use lang_ids
* Minor changes
* Rebase with main and run make fix-copies
* Make suggested changes to docstrings
* Improve code readability
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
* Fix code style
* Conversion script: Remove asserts and type annotations
* Remove _TOKENIZER_FOR_DOC
* XMOD -> Xmod
* Update copyright note
* Fix doctests
* Fix docstring
* Add integration test for FillMaskPipeline
* Revert "Add integration test for FillMaskPipeline"
This reverts commit 4381eb3b1d0f5d85785f89caba83928e6efa6d1f.
* Add end-to-end integration test for mask fill
* make style
* Rebase with main and make fix-copies
---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
2023-02-10 15:32:06 +01:00
GeneZC
adb2503ea3
Fix stuff related to the causal_mask in CodeGen. ( #21527 )
...
* Fix stuff related to the causal_mask in CodeGen.
1. Line 613, `_keys_to_ignore_on_load_missing = [r"h\.\d+\.attn\.masked_bias", r"h\.\d+\.attn\.bias"]` => `_keys_to_ignore_on_load_missing = [r"h\.\d+\.attn\.causal_mask"]` to load correctly from CodeGen checkpoint without `causal_mask`.
2. Line 152, `causal_mask = self.causal_mask[:, :, key_length - query_length : key_length, :key_length]
` => `causal_mask = self.causal_mask[:, :, key_length - query_length : key_length, :key_length].bool()
` to alleviate potential user warning saying like `UserWarning: where received a uint8 condition tensor. This behavior is deprecated and will be removed in a future version of PyTorch. Use a boolean condition instead.`.
* Revert the .bool()
Revert the .bool() and leave it to the future PR.
2023-02-10 09:16:23 -05:00
Quentin Meeus
5b72b3412b
Remove CLI spams with Whisper FeatureExtractor ( #21267 )
...
* Remove CLI spams with Whisper FeatureExtractor
Whisper feature extractor representation includes the MEL filters, a list of list that is represented as ~16,000 lines. This needlessly spams the command line. I added a `__repr__` method that replaces this list with a string "<array of shape (80, 201)>"
* Remove mel_filters from to_dict output
Credits to @ArthurZucker
* remove unused import
* update feature extraction tests for the changes in to_dict
2023-02-10 09:15:16 -05:00
Eugene Zapolsky
129011c20b
adding a tip for deepspeed integration in multi-node environment ( #21459 )
...
* adding note concerning use_node_local_storage
* overriding checkpoint.use_node_local_storage if save_on_each_node == True
* add more content
* add more content
* improve
* style
---------
Co-authored-by: Stas Bekman <stas@stason.org>
2023-02-10 09:12:56 -05:00
Katie Le
21a2d900ec
Added with torch.no_grad() to Camembert integration test ( #21544 )
...
add with torch.no_grad() to Camembert integration test
Co-authored-by: Bibi <Bibi@katies-mac.local>
2023-02-10 10:58:29 +01:00
Younes Belkada
f83942684d
[pipeline
] A simple fix for half-precision & 8bit models ( #21479 )
...
* v1 fix
* adapt from suggestions
* make style
* fix tests
* add gpu tests
* update docs
* fix other tests
* Apply suggestions from code review
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
* better fix
* make fixup
* better example
* revert changes
* proposal
* more elegant solution
* Update src/transformers/pipelines/automatic_speech_recognition.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2023-02-10 10:26:17 +01:00
Sylvain Gugger
97d3390fc8
Skip failing test for now
2023-02-09 20:11:26 -05:00
Katie Le
23c146c38b
Added with torch.no_grad() to XLM-Roberta integration test ( #21547 )
...
* added with torch.no_grad() to the integration tests and applied make style
* added with torch.no_grad() to xlm roberta forward pass
---------
Co-authored-by: Bibi <Bibi@katies-mac.local>
2023-02-09 21:49:54 +01:00
Sylvain Gugger
04b2f13c37
🚨 🚨 🚨 Enforce single model initialization ( #21431 )
...
* Enforce single model initialization
* Add OneFormer example for problem 3
* Do it the Stas way
* Actually rename the uses...
* Rewrite test
* Try to change the test this way
* Fix all init slow/fast tests
* Break connection
* Fix more tests
* Fix test for initialization
* Remove custom test
* Quality
* Fix last failing tests
* The end?
2023-02-09 15:46:26 -05:00