Julien Chaumond
f39217a5ec
[tests] Light cleanup of tempfile in tests/
2020-04-30 22:30:15 -04:00
Julien Chaumond
f54dc3f4d5
[ci] Load pretrained models into the default (long-lived) cache
...
There's an inconsistency right now where:
- we load some models into CACHE_DIR
- and some models in the default cache
- and often, in both for the same models
When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth.
I'd rather always use the default cache
2020-04-30 22:30:15 -04:00
Scottish_Fold007
6b410bedfc
Model Card: gaochangkuan README.md ( #4033 )
...
* Create README.md
* Update README.md
* tweak
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-30 22:26:58 -04:00
husein zolkepli
8829ace4aa
added gpt2 117m bahasa readme
...
(cherry picked from commit a4a673a1d0
)
2020-04-30 22:20:00 -04:00
Benjamin Muller
1851a64b6f
create model_card camembert-base-wikipedia-4gb
2020-04-30 22:16:12 -04:00
Benjamin Muller
443e5e34af
Create README.md
2020-04-30 22:16:00 -04:00
Benjamin Muller
60e1556a44
Create model_card camembert-base-ccnet-4gb
2020-04-30 22:15:47 -04:00
Benjamin Muller
fa9365eca5
Create README.md
2020-04-30 22:15:38 -04:00
Benjamin Muller
afe002b04c
Create README.md
2020-04-30 22:15:23 -04:00
Suraj Parmar
8b5e5ebcf9
Continue training args and tqdm in notebooks ( #3939 )
...
* Continue training args
* Continue training args
* added explaination
* added explaination
* added explaination
* Fixed tqdm auto
* Update src/transformers/training_args.py
Co-Authored-By: Julien Chaumond <chaumond@gmail.com>
* Update src/transformers/training_args.py
* Update src/transformers/training_args.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-30 22:14:08 -04:00
Julien Chaumond
ab90353f1a
[cli] {login, upload, s3} display more helpful error messages
2020-04-30 12:51:06 -04:00
Julien Chaumond
452dd0e4d9
[ci] Align test_hf_api.py with API change
2020-04-30 12:06:01 -04:00
Jordan
7f9193ef09
Fixed Style Inconsistency ( #3976 )
2020-04-30 14:33:09 +02:00
Jared T Nielsen
64070cbb88
Fix TF input docstrings to refer to tf.Tensor rather than torch.FloatTensor. ( #4051 )
2020-04-30 14:28:56 +02:00
Lysandre Debut
e73595bd64
Remove jitted method so that our models are pickable. ( #4050 )
2020-04-29 09:53:19 -04:00
Sam Shleifer
2c77842887
[Fix common tests on GPU] send model, ids to torch_device ( #4014 )
2020-04-29 09:47:20 -04:00
Julien Chaumond
6faca88ee0
Align MarianMT with #4030
...
cc @sshleifer
2020-04-28 20:35:20 -04:00
Julien Chaumond
211e130811
[github] Issue templates: populate some labels
...
cc @bramvanroy @stefan-it
2020-04-28 20:34:34 -04:00
Julien Chaumond
455c639093
CDN urls ( #4030 )
...
* [file_utils] use_cdn + documentation
* Move to cdn. urls for weights
* [urls] Hotfix for bert-base-japanese
2020-04-28 20:27:14 -04:00
Thomas Wolf
8ba4c5885f
Allow a more backward compatible behavior of max_len_single_sentence and max_len_sentences_pair ( #3994 )
...
* Allow a more backward compatible behavior of max_len_single_sentence and max_len_sentences_pair and
* The style and quality are now top-notch
2020-04-29 01:13:59 +02:00
Sam Shleifer
847e7f3379
MarianMTModel.from_pretrained('Helsinki-NLP/opus-marian-en-de') ( #3908 )
...
Co-Authored-By: Stefan Schweter <stefan@schweter.it>
2020-04-28 18:22:37 -04:00
Sam Shleifer
d714dfeaa8
[isort] add known 3rd party to setup.cfg ( #4053 )
...
* add known 3rd party to setup.cfg
* comment
* Update CONTRIBUTING.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-28 17:12:00 -04:00
MichalMalyska
d52b0e294a
Minor Readme Fixes ( #4056 )
...
Added contact info and fixed typos.
2020-04-28 16:42:15 -04:00
Alex Combessie
55adefe428
Add license information to model cards ( #3864 )
...
Close #3357
2020-04-28 16:40:21 -04:00
ydaigo
0ac6d0bf33
Create README.md
...
I create japanese binary classification.
2020-04-28 15:35:30 -04:00
Louis MARTIN
c73c83b0e6
Small cosmetic changes to CamemBERT model card
2020-04-28 15:32:55 -04:00
Bogdan Kostić
4a94c062a4
Provide model card for roberta-base-squad2-covid
2020-04-28 15:29:30 -04:00
jazzcook15
c7d06b79ae
Fix #3954 - GPT2 is not traceable ( #3955 )
...
* Update sqrt computation so it can survive a torch.jit.trace
* Update modeling_gpt2.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-04-28 21:18:56 +02:00
Patrick von Platen
9a0a8c1c6f
add examples to doc ( #4045 )
2020-04-28 16:33:23 +02:00
Patrick von Platen
fa49b9afea
Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility ( #3383 )
...
* change encoder decoder style to bart & t5 style
* make encoder decoder generation dummy work for bert
* make style
* clean init config in encoder decoder
* add tests for encoder decoder models
* refactor and add last tests
* refactor and add last tests
* fix attn masks for bert encoder decoder
* make style
* refactor prepare inputs for Bert
* refactor
* finish encoder decoder
* correct typo
* add docstring to config
* finish
* add tests
* better naming
* make style
* fix flake8
* clean docstring
* make style
* rename
2020-04-28 15:11:09 +02:00
Patrick von Platen
180585741c
[Generation] Generation should allow to start with empty prompt ( #3993 )
...
* fix empty prompt
* fix length in generation pipeline
2020-04-28 14:33:15 +02:00
Patrick von Platen
52679fbc2e
add dialogpt training tips ( #3996 )
2020-04-28 14:32:31 +02:00
Stefan Schweter
b5c6d3d4c7
notebooks: minor fix for community provided models example ( #4025 )
2020-04-28 09:12:25 +02:00
martindh
2fade302ac
camembert-base-fquad
...
Model card for illuin release of camembert-base-fquad
2020-04-27 18:29:55 -04:00
Manuel Romero
20c3b8cab4
Create model card
2020-04-27 18:27:46 -04:00
Manuel Romero
b3f272ffcb
Create model card
2020-04-27 18:27:04 -04:00
Nick Doiron
518f291eef
add model card for Hindi-BERT
2020-04-27 18:25:16 -04:00
monologg
d7b3bf547c
Model cards for KoELECTRA
2020-04-27 18:21:01 -04:00
Sai Saketh Aluru
db9d56c08a
Add modelcard for Hate-speech-CNERG/dehatebert-mono-arabic model ( #3979 )
...
* Add dehatebert-mono-arabic readme card
* Update dehatebert-mono-arabic model card
2020-04-27 18:18:54 -04:00
sshleifer
41750a6cff
Fix typos
2020-04-27 13:25:53 -04:00
Lorenzo Ampil
12bb7fe770
Fix t5 doc typos ( #3978 )
...
* Fix tpo in into and add line under
* Add missing blank line under
* Correct types under
2020-04-27 18:27:15 +02:00
Julien Chaumond
97a375484c
rm boto3 dependency
2020-04-27 11:17:14 -04:00
Txus
4e817ff418
Create README.md ( #3966 )
2020-04-25 09:16:40 -04:00
Junyi_Li
73d6a2f901
[model_cards] xlnet_chinese_large & roberta_chinese_large
2020-04-24 16:12:42 -04:00
Manuel Romero
623ba0236d
Create README.md ( #3882 )
2020-04-24 15:57:01 -04:00
Leandro von Werra
f4078e0db6
Feat/add model card ( #3923 )
...
* add model card for gpt2-imdb-ctrl
* fix title
* add sentiment control description
2020-04-24 10:24:28 -04:00
YuvalPeleg
03322b4261
Create README.md ( #3917 )
2020-04-24 10:24:00 -04:00
Julien Chaumond
c811526004
[examples] For convenience, also save the tokenizer
...
Close #3921
2020-04-24 09:52:42 -04:00
Cola
b0167632ce
Shuffle train subset for summarization example ( #3909 )
...
* Shuffle train subset
* Cleaner shuffle
2020-04-24 07:55:34 -04:00
Julien Chaumond
c53cc018de
[Trainer] Fix _rotate_checkpoints
...
Close #3920
2020-04-23 23:59:43 +00:00