jaymody
858b1d1e5a
allow an already created tensorboard SummaryWriter be passed to Trainer
2020-05-04 19:58:24 -04:00
Patrick von Platen
8e67573a64
[EncoderDecoder Tests] Improve tests ( #4046 )
...
* Hoist bert model tester for patric
* indent
* make tests work
* Update tests/test_modeling_bert.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: sshleifer <sshleifer@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-05-04 02:18:36 +02:00
Lorenzo Ampil
6af3306a1d
Add decoder specific error message for T5Stack.forward ( #4128 )
2020-05-03 12:40:08 +02:00
Zhiyu Lin
1cdd2ad2af
Fix #2941 ( #4109 )
...
* Fix of issue #2941
Reshaped score array to avoid `numpy` ValueError.
* Update src/transformers/pipelines.py
* Update src/transformers/pipelines.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-05-02 11:20:30 -04:00
Manuel Romero
5f4f6b65b3
distilroberta-base-finetuned-sentiment ( #4115 )
...
* Create model card
Create Model card for distilroberta-base-finetuned-sentiment
* Update model_cards/mrm8488/distilroberta-base-finetuned-sentiment/README.md
* Update model_cards/mrm8488/distilroberta-base-finetuned-sentiment/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-05-02 11:19:31 -04:00
Suraj Parmar
7da051f135
model card for surajp/albert-base-sanskrit ( #4114 )
...
* Create README.md
* Update model_cards/surajp/albert-base-sanskrit/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-05-02 11:15:39 -04:00
Zhen Wang
14911e2e12
Create README.md ( #4112 )
2020-05-02 10:52:12 -04:00
HUSEIN ZOLKEPLI
9e97c87539
Added huseinzol05/gpt2-345M-bahasa-cased ( #4102 )
2020-05-02 10:51:15 -04:00
William Falcon
4c5bd92183
Update run_pl_glue.py ( #4117 )
2020-05-02 10:38:30 -04:00
William Falcon
5282b31df4
Update run_pl_ner.py ( #4118 )
2020-05-02 10:38:21 -04:00
Stefan Schweter
1e616c0af3
NER: parse args from .args file or JSON ( #4110 )
...
* ner: parse args from .args file or JSON
* examples: mention json-based configuration file support for run_ner script
2020-05-02 10:29:17 -04:00
Patrick von Platen
abb1fa3f37
Update README.md
2020-05-02 10:32:00 +02:00
Patrick von Platen
0ccbfd2868
Update Reformer ReadME
2020-05-02 10:31:00 +02:00
Patrick von Platen
2d8340a91f
[Reformer] Move model card to google model ( #4113 )
...
* correct model card
* remove model card from patrick von platen
2020-05-02 10:25:22 +02:00
Julien Chaumond
d713cfc5eb
GePpeTto 🇮🇹 : Fixpath to model card
2020-05-01 11:48:58 -04:00
Lorenzo De Mattei
f3d44301cc
GePpeTto model 🇮🇹 ( #4099 )
...
* Create GePpeTto.md
* Update model_cards/LorenzoDeMattei/GePpeTto.md
* Update model_cards/LorenzoDeMattei/GePpeTto.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-05-01 11:46:42 -04:00
Julien Chaumond
27d55125e6
Configs: saner num_labels in configs. ( #3967 )
2020-05-01 11:28:55 -04:00
Stefan Schweter
e80be7f1d0
docs: add xlm-roberta section to multi-lingual section ( #4101 )
2020-05-01 11:06:58 -04:00
Sam Shleifer
18db92dd9a
[testing] add timeout_decorator ( #3543 )
2020-05-01 09:05:47 -04:00
Julien Chaumond
b8686174be
Merge pull request #3934 from huggingface/examples_args_from_files
...
[qol] example scripts: parse args from .args file or JSON
2020-04-30 22:40:13 -04:00
Julien Chaumond
f39217a5ec
[tests] Light cleanup of tempfile in tests/
2020-04-30 22:30:15 -04:00
Julien Chaumond
f54dc3f4d5
[ci] Load pretrained models into the default (long-lived) cache
...
There's an inconsistency right now where:
- we load some models into CACHE_DIR
- and some models in the default cache
- and often, in both for the same models
When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth.
I'd rather always use the default cache
2020-04-30 22:30:15 -04:00
Scottish_Fold007
6b410bedfc
Model Card: gaochangkuan README.md ( #4033 )
...
* Create README.md
* Update README.md
* tweak
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-30 22:26:58 -04:00
husein zolkepli
8829ace4aa
added gpt2 117m bahasa readme
...
(cherry picked from commit a4a673a1d0
)
2020-04-30 22:20:00 -04:00
Benjamin Muller
1851a64b6f
create model_card camembert-base-wikipedia-4gb
2020-04-30 22:16:12 -04:00
Benjamin Muller
443e5e34af
Create README.md
2020-04-30 22:16:00 -04:00
Benjamin Muller
60e1556a44
Create model_card camembert-base-ccnet-4gb
2020-04-30 22:15:47 -04:00
Benjamin Muller
fa9365eca5
Create README.md
2020-04-30 22:15:38 -04:00
Benjamin Muller
afe002b04c
Create README.md
2020-04-30 22:15:23 -04:00
Suraj Parmar
8b5e5ebcf9
Continue training args and tqdm in notebooks ( #3939 )
...
* Continue training args
* Continue training args
* added explaination
* added explaination
* added explaination
* Fixed tqdm auto
* Update src/transformers/training_args.py
Co-Authored-By: Julien Chaumond <chaumond@gmail.com>
* Update src/transformers/training_args.py
* Update src/transformers/training_args.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-30 22:14:08 -04:00
Julien Chaumond
ab90353f1a
[cli] {login, upload, s3} display more helpful error messages
2020-04-30 12:51:06 -04:00
Julien Chaumond
452dd0e4d9
[ci] Align test_hf_api.py with API change
2020-04-30 12:06:01 -04:00
Jordan
7f9193ef09
Fixed Style Inconsistency ( #3976 )
2020-04-30 14:33:09 +02:00
Jared T Nielsen
64070cbb88
Fix TF input docstrings to refer to tf.Tensor rather than torch.FloatTensor. ( #4051 )
2020-04-30 14:28:56 +02:00
Lysandre Debut
e73595bd64
Remove jitted method so that our models are pickable. ( #4050 )
2020-04-29 09:53:19 -04:00
Sam Shleifer
2c77842887
[Fix common tests on GPU] send model, ids to torch_device ( #4014 )
2020-04-29 09:47:20 -04:00
Julien Chaumond
6faca88ee0
Align MarianMT with #4030
...
cc @sshleifer
2020-04-28 20:35:20 -04:00
Julien Chaumond
211e130811
[github] Issue templates: populate some labels
...
cc @bramvanroy @stefan-it
2020-04-28 20:34:34 -04:00
Julien Chaumond
455c639093
CDN urls ( #4030 )
...
* [file_utils] use_cdn + documentation
* Move to cdn. urls for weights
* [urls] Hotfix for bert-base-japanese
2020-04-28 20:27:14 -04:00
Thomas Wolf
8ba4c5885f
Allow a more backward compatible behavior of max_len_single_sentence and max_len_sentences_pair ( #3994 )
...
* Allow a more backward compatible behavior of max_len_single_sentence and max_len_sentences_pair and
* The style and quality are now top-notch
2020-04-29 01:13:59 +02:00
Sam Shleifer
847e7f3379
MarianMTModel.from_pretrained('Helsinki-NLP/opus-marian-en-de') ( #3908 )
...
Co-Authored-By: Stefan Schweter <stefan@schweter.it>
2020-04-28 18:22:37 -04:00
Sam Shleifer
d714dfeaa8
[isort] add known 3rd party to setup.cfg ( #4053 )
...
* add known 3rd party to setup.cfg
* comment
* Update CONTRIBUTING.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-04-28 17:12:00 -04:00
MichalMalyska
d52b0e294a
Minor Readme Fixes ( #4056 )
...
Added contact info and fixed typos.
2020-04-28 16:42:15 -04:00
Alex Combessie
55adefe428
Add license information to model cards ( #3864 )
...
Close #3357
2020-04-28 16:40:21 -04:00
ydaigo
0ac6d0bf33
Create README.md
...
I create japanese binary classification.
2020-04-28 15:35:30 -04:00
Louis MARTIN
c73c83b0e6
Small cosmetic changes to CamemBERT model card
2020-04-28 15:32:55 -04:00
Bogdan Kostić
4a94c062a4
Provide model card for roberta-base-squad2-covid
2020-04-28 15:29:30 -04:00
jazzcook15
c7d06b79ae
Fix #3954 - GPT2 is not traceable ( #3955 )
...
* Update sqrt computation so it can survive a torch.jit.trace
* Update modeling_gpt2.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2020-04-28 21:18:56 +02:00
Patrick von Platen
9a0a8c1c6f
add examples to doc ( #4045 )
2020-04-28 16:33:23 +02:00
Patrick von Platen
fa49b9afea
Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility ( #3383 )
...
* change encoder decoder style to bart & t5 style
* make encoder decoder generation dummy work for bert
* make style
* clean init config in encoder decoder
* add tests for encoder decoder models
* refactor and add last tests
* refactor and add last tests
* fix attn masks for bert encoder decoder
* make style
* refactor prepare inputs for Bert
* refactor
* finish encoder decoder
* correct typo
* add docstring to config
* finish
* add tests
* better naming
* make style
* fix flake8
* clean docstring
* make style
* rename
2020-04-28 15:11:09 +02:00