Martina Fumanelli
f6ad0e0556
Add installation.mdx Italian translation ( #17530 )
...
* Add the Italian translation of the file installation.mdx and edit _toctree
* Add the Italian translation of the file installation.mdx and edit _toctree
2022-06-06 07:48:08 -04:00
Jonatas Grosman
4aed1dc81b
Adding the Portuguese version of the tasks/token_classification.mdx documentation ( #17492 )
...
* add tasks/token_classification pt doc structure
* add tasks/token_classification pt doc translation
* add tasks/token_classification pt doc translation
2022-06-06 07:47:34 -04:00
Anugunj Naman
da71df1afc
fix integration test levit ( #17555 )
2022-06-06 13:47:32 +02:00
Stas Bekman
26e5e129b4
[deepspeed] fix load_best_model test ( #17550 )
2022-06-03 11:19:03 -07:00
Britney Muller
72f5b94984
Update index.mdx ( #17547 )
...
This PR updates our Expert Acceleration Program image with a new image featuring our experts.
This is similar to our Transformers/README.md image update that has proven to be successful.
2022-06-03 12:56:37 -05:00
Sylvain Gugger
c4e58cd8ba
Clean imports to fix test_fetcher ( #17531 )
...
* Clean imports to fix test_fetcher
* Add dependencies printer
* Update utils/tests_fetcher.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
* Fix Perceiver import
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
2022-06-03 12:34:41 -04:00
bhuang
254d9c068e
Update run_glue_no_trainer.py ( #17546 )
2022-06-03 12:29:37 -04:00
Sylvain Gugger
8343901263
Fix all offload and MP tests ( #17533 )
2022-06-03 09:59:13 -04:00
amyeroberts
1c57242d7b
Fix bug - layer names and activation from previous refactor ( #17524 )
...
* Fix activation and layers in MLP head
* Remove unused import
2022-06-03 09:31:10 -04:00
Patrick Deutschmann
babeff5524
Add support for Perceiver ONNX export ( #17213 )
...
* Start adding perceiver support for ONNX
* Fix pad token bug for fast tokenizers
* Fix formatting
* Make get_preprocesor more opinionated (processor priority, otherwise tokenizer/feature extractor)
* Clean docs format
* Minor cleanup following @sgugger's comments
* Fix typo in docs
* Fix another docs typo
* Fix one more typo in docs
* Update src/transformers/onnx/utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/onnx/utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/onnx/utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-06-03 07:40:22 -04:00
Robert Dargavel Smith
5c17918fe4
Allow from transformers import TypicalLogitsWarper ( #17477 )
...
* Allow from transformers import TypicalLogitsWarper
* Added TypicalLogitsWarper
* Allow from transformers import TypicalLogitsWarper
* Allow from transformers import TypicalLogitsWarper
* Allow from transformers import TypicalLogitsWarper
* Allow from transformers import TypicalLogitsWarper
Added TypicalLogitsWarper
Allow from transformers import TypicalLogitsWarper
Allow from transformers import TypicalLogitsWarper
Allow from transformers import TypicalLogitsWarper
2022-06-03 11:08:35 +02:00
DanielHesslow
607acd4fbd
Add Gated-SiLU to T5 ( #17420 )
...
* Add gated-silu to t5 architecture to support UL2
* Fix error message
* formatting
* formatting again
* refactor
* fix classnames in _init_weights
* remove is_gated
* add test
* fix test
* Try without the test?
* Add back the test.
* Improve error message.
Co-authored-by: Daniel Hesslow <daniel@lighton.ai>
2022-06-03 10:56:37 +02:00
lewtun
1c220ced8e
Update URL for Hub PR docs ( #17532 )
2022-06-02 21:52:30 +02:00
Arthur
013462c57b
fix OPT-Flax CI tests ( #17512 )
2022-06-02 18:52:46 +02:00
Stas Bekman
2f59ad1609
[trainer/deepspeed] load_best_model (reimplement re-init) ( #17151 )
...
* [trainer/deepspeed] load_best_model
* to sync with DS PR #1947
* simplify
* rework load_best_model test
* cleanup
* bump deepspeed>=0.6.5
Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>
2022-06-02 09:14:21 -07:00
Moreno La Quatra
046c5ea906
Implemented loss for training AudioFrameClassification ( #17513 )
...
* Implemented loss for training AudioFrameClassification
* reported changes in wav2vec2 main class and used make copies to propagate
* running black for code formatting
2022-06-02 17:40:02 +02:00
Kamal Raj
085321c9a1
Update configuration_auto.py ( #17527 )
2022-06-02 10:37:00 -04:00
Sylvain Gugger
048dd73bba
Check list of models in the main README and sort it ( #17517 )
...
* Script for README
* Fix copies
* Complete error message
2022-06-02 08:10:08 -04:00
Sylvain Gugger
588d8f1f26
Fix when Accelerate is not installed ( #17518 )
2022-06-02 07:45:41 -04:00
Sylvain Gugger
f128ccb997
Clean README in post release job as well. ( #17519 )
2022-06-02 07:44:03 -04:00
Yih-Dar
216499bfcc
Fix CI tests hang forever ( #17471 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-06-02 10:30:54 +02:00
Yih-Dar
659b27fd26
Print more library versions in CI ( #17384 )
...
* print more lib. versions and just befor test runs
* update print_env_pt.py
* rename to print_env
* Disable warning + better job name
* print python version
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-06-02 10:24:16 +02:00
Yih-Dar
0932adb3e8
Split push CI into 2 workflows ( #17369 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-06-02 10:19:26 +02:00
Yih-Dar
58fb3c9f98
Fix Tapas tests ( #17510 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-06-01 21:01:32 +02:00
Joao Gante
ca1f1c8685
CLI: tool to convert PT into TF weights and open hub PR ( #17497 )
2022-06-01 18:52:07 +01:00
Zachary Mueller
3766df4fe1
Fix flakey no-trainer test ( #17515 )
2022-06-01 13:40:49 -04:00
fireindark707
028d4b7c8b
Deal with the error when task is regression ( #16330 )
2022-06-01 11:15:53 -04:00
Anugunj Naman
84aaadd8c5
Adding LeViT Model by Facebook ( #17466 )
...
* levit files
* levit tests
* weights script
* weights script
* update
* style fixes
* few minor corrections
* Added teacher model
* edit docs
* fix-copies
* style fixes
* pr error resolved
* Update README.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/en/index.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/en/model_doc/levit.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/en/model_doc/levit.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/en/model_doc/levit.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update docs/source/en/model_doc/levit.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/__init__.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/configuration_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/configuration_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* suggested pr changes
* style fixes
* minor bug
* update
* minor doc edit
* style
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update tests/models/levit/test_modeling_levit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/levit/modeling_levit.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* residual layer readable
* style
* Update docs/source/en/model_doc/levit.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/modeling_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/modeling_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/modeling_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update tests/models/levit/test_feature_extraction_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* change checkpoints and style
* update
* minor changes
* Update src/transformers/models/levit/modeling_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
* Update src/transformers/models/levit/modeling_levit.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-06-01 17:06:20 +02:00
Yih-Dar
1d2b57b8a2
Fix CTRL tests ( #17508 )
...
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-06-01 16:27:23 +02:00
Yih-Dar
693720e567
Fix LayoutXLMProcessorTest ( #17506 )
...
* fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-06-01 16:26:37 +02:00
Ryokan RI
4d1ce39683
Debug LukeForMaskedLM ( #17499 )
...
* add a test for a word only input
* make LukeForMaskedLM work without entity inputs
* update test
* add LukeForMaskedLM to MODEL_FOR_MASKED_LM_MAPPING_NAMES
* restore pyproject.toml
* empty line at the end of pyproject.toml
2022-06-01 10:03:06 -04:00
Sylvain Gugger
4390151ba2
Fix MP and CPU offload tests for Funnel and GPT-Neo ( #17503 )
2022-06-01 09:59:40 -04:00
Sylvain Gugger
6813439fdc
Exclude Databricks from notebook env ( #17496 )
2022-06-01 09:00:11 -04:00
Will Frey
3042ea4f6f
Fix tokenizer
type annotation in pipeline(...)
( #17500 )
...
I think you mean to accept either an instance of `PreTrainedTokenizer` or `PreTrainedTokenizerFast` inside of the `pipeline(...)` factory function, if the `tokenizer` argument isn't a `str`.
2022-06-01 08:43:28 -04:00
amyeroberts
bdc01711d6
Refactor classes to inherit from nn.Module instead of nn.Sequential ( #17493 )
...
* Adapt Maskformer, VAN, ResNet and RegNet modules to inherit from nn.Module
2022-06-01 13:36:19 +01:00
nilboy
b1160c0b56
Fix wav2vec2 export onnx model with attention_mask error ( #16004 )
...
* Fix wav2vec2 export onnx model with attention_mask error
* fix repository_consistency
2022-06-01 13:30:58 +02:00
Xing Han Lu
d91da4c6df
Add warning when using older version of torch for ViltFeatureExtractor ( #16756 )
...
* Update feature_extraction_vilt.py
* apply black
* Update imports
* Change warning to logging
* Use logger instead of logging.logging
* make fixup
* Move error message
* Update src/transformers/models/vilt/feature_extraction_vilt.py
Co-authored-by: Xing Han Lu <xhlperso@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2022-06-01 07:15:38 -04:00
Kyeongpil Kang
24092b1464
Fix typo of variable names for key and query projection layer ( #17155 )
...
self.pos_proj and self.pos_q_proj should be changed to self.pos_key_proj and self.pos_query_proj as same as PyTorch implements.
2022-06-01 11:38:44 +01:00
Jimin Park
811da2b8c2
Fixed wrong error message for missing weight file ( #17216 )
2022-06-01 06:24:20 -04:00
Ruihua Fang
4f38808e9e
Add OnnxConfig for SqueezeBert iss17314 ( #17315 )
...
* add onnx config for SqueezeBert
* add test for onnx config for SqueezeBert
* add automatically updated doc for onnx config for SqueezeBert
* Update src/transformers/onnx/features.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
* Update src/transformers/models/squeezebert/configuration_squeezebert.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
2022-06-01 06:16:15 -04:00
Patrick von Platen
ba286fe7d5
[GPT2Tokenizer] Fix GPT2 with bos token ( #17498 )
2022-05-31 20:06:48 +02:00
Arthur
7822a9b7a7
Opt in flax and tf ( #17388 )
...
* initial commit
* add init file
* update globakl init
* update index and dummy objects
* style
* update modelling auto
* fix initi typo in src/transformers
* fix typo in modeling tf auto, opt was in wrong mapping name
* fixed a slow test : saved_model
* style
* fix positionnal embedding if no position id is provided
* update tf test
* update test flax requirements
* fixed serialization
* update
* update tf name to allow smooth convertion
* update flax tests
* style
* fix test typo
* fix tf typo test
* add xla for generate support in causal LM
* fixed bug
* cleaned tf tests
* style
* removed from PT for slow tests
* fix typp
* opt test as slow
* trying to fix GPT2 undefined
* correct documentation and add to test doc
* update tf doc
* fix doc
* fake commit
* Apply suggestions from code review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
* update test based on review
* merged main layer for functionning test
* fixup + quality
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* update long comment
* make fix copies
Co-authored-by: Arthur <arthur@huggingface.co>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-31 18:41:22 +02:00
Patrick von Platen
f394a2a50d
[Json configs] Make json prettier for all saved tokenizer files & ensure same json format for all processors (tok + feat_extract) ( #17457 )
...
* [Json dump] Make json prettier
* correct more tokenizeirs
* more patterns
* add aggressive test
* the aggressive test was actually useful :-)
* more tests
* Apply suggestions from code review
2022-05-31 17:07:30 +02:00
Vít Novotný
6ee1474b67
Accumulate tokens into batches in PreTrainedTokenizerBase.add_tokens()
( #17119 )
...
* Accumulate tokens into batches in PreTrainedTokenizerBase.add_tokens()
For tokenizers with a small number of special tokens or special tokens
with consecutive token IDs, this reduces the time complexity of creating
the trie from quadratic to linear, see also #16936 .
* Extend explanation of batching added tokens
2022-05-31 16:36:45 +02:00
Patrick von Platen
52e7c92920
Add HF.co for PRs / Issues regarding specific model checkpoints ( #17485 )
...
* Add HF.co for PRs / Issues regarding specific model checkpoints
* Update .github/ISSUE_TEMPLATE/config.yml
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
2022-05-31 15:58:39 +02:00
Martina Fumanelli
dfc38463b8
Setup for Italian translation and add quicktour.mdx translation ( #17472 )
...
* Setup for Italian translation and add first document
- Add 'it' folder for files translated into Italian
- Add _config.py and _toctree.yml files
- Add translation of quicktour.mdx
* Fix style issue of italian documentation files
* Add 'it' to the languages section in the .github/workflows
* Remove - installation from _toctree for Italian
* Translation for index file
- Add index to _toctree.yml
- Add translation of index.mdx
* Fix typo in docs/source/it/index.mdx
* Translate code comments in docs/source/it/_config.py
Co-authored-by: Martina Fumanelli <martinafumanelli@Martinas-MBP.homenet.telecomitalia.it>
2022-05-31 09:57:43 -04:00
Yih-Dar
8f8b3cbce4
Fix checkpoint name ( #17484 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-05-31 15:40:48 +02:00
Yih-Dar
400b30936a
Docker image build in parallel ( #17434 )
...
* docker image build in parallel
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-05-31 15:39:03 +02:00
Ritik Nandwal
5af38953bb
Added XLM onnx config ( #17030 )
...
* Add onnx configuration for xlm
* Add supported features for xlm
* Add xlm to models exportable with onnx
* Add xlm architecture to test file
* Modify docs
* Make code quality fixes
2022-05-31 09:26:06 -04:00
Sylvain Gugger
567d9c061d
Disk offload fix ( #17428 )
...
* Fix offload to disk for big models
* Add test
* Fix test for other models
2022-05-31 09:16:18 -04:00