Sylvain Gugger
d76d2a2af7
Make sure telemetry arguments are not returned as unused kwargs ( #17063 )
...
* Make sure telemetry arguments are not returned as unused kwargs
* Fix test
2022-05-04 07:47:57 -04:00
lewtun
675e2d1663
Remove masked image modeling from BEIT ONNX export ( #16980 )
...
* Add masked image modelling to task mapping
* Refactor ONNX features to be listed alphabetically
* Add warning about BEiT masked image modeling
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-04 10:05:24 +02:00
lewtun
4bb1d0ec84
Skip RoFormer ONNX test if rjieba not installed ( #16981 )
...
* Skip RoFormer ONNX test if rjieba not installed
* Update deps table
* Skip RoFormer serialization test
* Fix RoFormer vocab
* Add rjieba to CircleCI
2022-05-04 10:04:10 +02:00
Thomas Wang
db034660fb
Fix hashing for deduplication ( #17048 )
2022-05-04 08:40:24 +02:00
Pavel Belevich
39f8eafc1b
Remove device parameter from create_extended_attention_mask_for_decoder ( #16894 )
2022-05-03 11:06:11 -04:00
Sylvain Gugger
dd739f7045
Remove fetch in model templates test
2022-05-03 10:49:12 -04:00
Sylvain Gugger
1c9fcd0e04
Fix RNG reload in resume training from epoch checkpoint ( #17055 )
...
* Fix RNG reload in resume training from epoch checkpoint
* Fix test
2022-05-03 10:31:24 -04:00
Sylvain Gugger
6e17ba6aa5
Remove Python and use v2 action ( #17059 )
2022-05-03 10:12:17 -04:00
Sylvain Gugger
a8fa2f91f4
Make Trainer compatible with sharded checkpoints ( #17053 )
...
* Make Trainer compatible with sharded checkpoints
* Add doc
2022-05-03 09:55:10 -04:00
Yih-Dar
19420fd99e
Move test model folders ( #17034 )
...
* move test model folders (TODO: fix imports and others)
* fix (potentially partially) imports (in model test modules)
* fix (potentially partially) imports (in tokenization test modules)
* fix (potentially partially) imports (in feature extraction test modules)
* fix import utils.test_modeling_tf_core
* fix path ../fixtures/
* fix imports about generation.test_generation_flax_utils
* fix more imports
* fix fixture path
* fix get_test_dir
* update module_to_test_file
* fix get_tests_dir from wrong transformers.utils
* update config.yml (CircleCI)
* fix style
* remove missing imports
* update new model script
* update check_repo
* update SPECIAL_MODULE_TO_TEST_MAP
* fix style
* add __init__
* update self-scheduled
* fix add_new_model scripts
* check one way to get location back
* python setup.py build install
* fix import in test auto
* update self-scheduled.yml
* update slack notification script
* Add comments about artifact names
* fix for yolos
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-05-03 14:42:02 +02:00
Sanchit Gandhi
cd9274d010
[FlaxBert] Add ForCausalLM ( #16995 )
...
* [FlaxBert] Add ForCausalLM
* make style
* fix output attentions
* Add RobertaForCausalLM
* remove comment
* fix fx-to-pt model loading
* remove comment
* add modeling tests
* add enc-dec model tests
* add big_bird
* add electra
* make style
* make repo-consitency
* add to docs
* remove roberta test
* quality
* amend cookiecutter
* fix attention_mask bug in flax bert model tester
* tighten pt-fx thresholds to 1e-5
* add 'copied from' statements
* amend 'copied from' statements
* amend 'copied from' statements
* quality
2022-05-03 11:26:19 +02:00
Patrick von Platen
31616b8d61
[T5 Tokenizer] Model has no fixed position ids - there is no hardcode… ( #16990 )
...
* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length
* [T5 Tokenizer] Model has no fixed position ids - there is no hardcoded max length
* correct t5 tokenizer
* correct t5 tokenizer
* fix test
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* finish
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-05-02 21:27:34 +02:00
Sylvain Gugger
1073f00d4e
Clean up setup.py ( #17045 )
...
* Clean up setup.py
* Trigger CI
* Upgrade Python used
2022-05-02 12:58:17 -04:00
Lysandre Debut
30ca529902
Make the sacremoses dependency optional ( #17049 )
...
* Make sacremoses optional
* Pickle
2022-05-02 12:47:47 -04:00
Lysandre Debut
bb2e088be7
Allow all imports from transformers ( #17050 )
2022-05-02 12:47:39 -04:00
NielsRogge
1ac698744c
Add YOLOS ( #16848 )
...
* First draft
* Add YolosForObjectDetection
* Make forward pass work
* Add mid position embeddings
* Add interpolation of position encodings
* Add expected values
* Add YOLOS to tests
* Add integration test
* Support tiny model as well
* Support all models in conversion script
* Remove mid_pe_size attribute
* Make more tests pass
* Add model to README and fix config
* Add copied from statements
* Rename base_model_prefix to vit
* Add missing YOLOS_PRETRAINED_CONFIG_ARCHIVE_MAP
* Apply suggestions from code review
* Apply more suggestions from code review
* Convert remaining checkpoints
* Improve docstrings
* Add YolosFeatureExtractor
* Add feature extractor to docs
* Add corresponding tests
* Fix style
* Fix docs
* Apply suggestion from code review
* Fix bad rebase
* Fix some more bad rebase
* Fix missing character
* Improve docs and variable names
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-05-02 18:30:55 +02:00
Zachary Mueller
f275e593bf
Fix no_trainer examples to properly calculate the number of samples ( #17046 )
...
* Update all examples to properly calculate progress bar
2022-05-02 11:56:25 -04:00
Zachary Mueller
35d48db881
Update no_trainer examples to use new logger ( #17044 )
...
* Propagate and fix imports
2022-05-02 11:56:15 -04:00
calpt
daecae1f1c
[Trainer] Move logic for checkpoint loading into separate methods for easy overriding ( #17043 )
2022-05-02 10:40:37 -04:00
NielsRogge
2de2c9ecca
Clean up vision tests ( #17024 )
...
* Clean up tests
* Make fixup
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
2022-05-02 16:28:58 +02:00
Sylvain Gugger
4be8b95a9f
Disable Flax GPU tests on push ( #17042 )
2022-05-02 10:25:53 -04:00
yujun
bdd690a74d
add torch.no_grad when in eval mode ( #17020 )
...
* add torch.no_grad when in eval mode
* make style quality
2022-05-02 07:49:19 -04:00
Martin Pömsl
9586e222af
Fix typo in RetriBERT docstring ( #17018 )
2022-05-02 07:48:20 -04:00
Sanchit Gandhi
93b802c43e
[Flax(Speech)EncoderDecoder] Fix bug in decoder_module
( #17036 )
...
* [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module`
* [FlaxEncoderDecoder] Fix bug in `decoder_module`
2022-05-02 13:06:45 +02:00
Sylvain Gugger
1ae182d9a6
Fix style
2022-05-02 06:19:31 -04:00
Michael Benayoun
2c2a2169b6
Fx with meta ( #16836 )
...
* Add meta proxy
* Uses meta data to trace data dependent control-flow
* Remove commented class
* Handles torch creating functions
* Added type annotation to fix tracing
* Tracing works for everything but T5 and GPT-J
* Almost all previously supported models pass
* All architectures can be traced except T5
* Intermediate commit to have a trace of the comparison operators for HFProxy
* Everything works, except loss computation
* Everything works
* Removed unused import
* Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version
* Fix torch_matmul_override
* Change attributes reference to deepcopy
* Remove breakpoint and add torch_index_override
* Small fix
* Fix typo
* Replace asserts by explicit exceptions
2022-05-02 11:46:52 +02:00
Sanchit Gandhi
ff846e9b28
[FlaxGenerate] Fix bug in decoder_start_token_id ( #17035 )
2022-05-02 11:05:27 +02:00
Manan Dey
eb877f1fd0
update docs of length_penalty ( #17022 )
2022-05-02 11:01:18 +02:00
Omar U. Espejel
da47c264f9
Add translating guide ( #17004 )
...
* Add translating guide
2022-04-30 17:43:38 -05:00
Yih-Dar
ede5e04191
Add a check on config classes docstring checkpoints ( #17012 )
...
* Add the check
* add missing ckpts
* add a list to ignore
* call the added check script
* better regex pattern
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-30 10:40:46 +02:00
Sylvain Gugger
7152ed2bae
Result of new doc style with fixes ( #17015 )
...
* Result of new doc style with fixes
* Add last two files
* Bump hf-doc-builder
2022-04-29 17:42:15 -04:00
Sylvain Gugger
18df440709
Replace dict/BatchEncoding instance checks by Mapping ( #17014 )
...
* Replace dict/BatchEncoding instance checks by Mapping
* Typo
2022-04-29 17:20:52 -04:00
Nicolas Patry
b8dffd1f3e
Revert "Updating variable names. ( #16445 )" ( #17011 )
...
This reverts commit 4f3a14e3c2
.
2022-04-29 12:26:45 -04:00
Nicolas Patry
4f3a14e3c2
Updating variable names. ( #16445 )
2022-04-29 17:44:28 +02:00
tarzan
20fb5d51ea
Update README_zh-hans.md ( #16977 )
2022-04-29 11:05:03 -04:00
Pavel Belevich
63fbed5c59
Make create_extended_attention_mask_for_decoder static method ( #16893 )
2022-04-29 10:57:09 -04:00
Joao Gante
fb0ae12947
TF: XLA bad words logits processor and list of processors ( #16974 )
2022-04-29 15:54:58 +01:00
Zachary Mueller
57e6464ac9
Update all require decorators to use skipUnless when possible ( #16999 )
2022-04-29 08:55:38 -04:00
Yih-Dar
e952e049b4
use scale=1.0 in floats_tensor called in speech model testers ( #17007 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-29 14:41:33 +02:00
Sylvain Gugger
e6f00a11d7
Update README to latest release ( #16997 )
2022-04-28 14:17:44 -04:00
Zachary Mueller
3486a92a57
Fix savedir for by epoch ( #16996 )
2022-04-28 13:49:45 -04:00
Yih-Dar
5af5735f62
set eos_token_id to None to generate until max length ( #16989 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-28 19:47:38 +02:00
amyeroberts
01562dac7e
Rename a class to reflect framework pattern AutoModelXxx -> TFAutoModelXxx ( #16993 )
2022-04-28 18:11:54 +01:00
conan1024hao
1be8d56ec6
Add parameter --config_overrides for run_mlm_wwm.py ( #16961 )
...
* dd parameter --config_overrides for run_mlm_wwm.py
* linter
2022-04-28 10:44:55 -04:00
Yih-Dar
1f9e862507
Update check_models_are_tested to deal with Windows path ( #16973 )
...
* fix
* Apply suggestions from code review
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-04-28 15:31:57 +02:00
Dat Quoc Nguyen
dced262409
Update tokenization_bertweet.py ( #16941 )
...
The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).
2022-04-27 16:54:31 -04:00
Yih-Dar
992996e9ca
Add -e flag to some GH workflow yml files ( #16959 )
...
* Add -e flag
* add check
* create new keys
* run python setup.py build install
* add comments
* change to develop
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-27 21:44:21 +02:00
Yih-Dar
596afb4297
Fix check_all_models_are_tested ( #16970 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
2022-04-27 21:18:29 +02:00
Sylvain Gugger
691cdbb7d7
Fix doc notebooks links ( #16969 )
...
* Fix doc notebooks links
* Remove missing section
2022-04-27 14:59:53 -04:00
Zachary Mueller
60e1d883f1
Fixup no_trainer save logic ( #16968 )
...
* Fixup all examples
2022-04-27 14:46:49 -04:00