ahotrod
ba498eac38
Create README.md ( #2785 )
...
* Create README.md
* Update README.md
* Update README.md
* Update README.md
* [model_cards] Use code fences for consistency
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-02-10 17:27:59 -05:00
Malte Pietsch
68ccc04ee6
Add model readme for deepset/roberta-base-squad2 ( #2797 )
...
* Add readme for deepset/roberta-base-squad2
* update model readme
2020-02-10 15:21:48 -05:00
Lysandre
539f601be7
intermediate_size > hidden_dim in distilbert config docstrings
2020-02-10 13:45:57 -05:00
Lysandre
cfb7d108bd
FlauBERT lang embeddings only when n_langs > 1
2020-02-10 13:24:04 -05:00
Julien Chaumond
b4691a438d
[model_cards] BERT-of-Theseus: use the visual as thumbnail
...
cc @jetrunner
Co-Authored-By: Kevin Canwen Xu <canwenxu@outlook.com>
2020-02-10 11:27:08 -05:00
Julien Chaumond
fc325e97cd
[model_cards] Showcase model tag syntax
2020-02-10 11:27:08 -05:00
Lysandre
fd639e5be3
Correct quickstart example when using the past
2020-02-10 11:25:56 -05:00
Julien Chaumond
63a5399bc4
[model_cards] Specify language meta + thumbnail
...
cc @tholor
see #2799
2020-02-10 11:20:05 -05:00
Lysandre
125a75a121
Correctly compute tokens when padding on the left
2020-02-10 10:47:42 -05:00
Malte Pietsch
9c64d1da35
Add model readme for bert-base-german-cased ( #2799 )
...
* add readme for bert-base-german-cased
* update readme
2020-02-10 10:27:29 -05:00
Kevin Canwen Xu
bf99014c46
Create BERT-of-Theseus model card
2020-02-10 09:58:40 -05:00
Thomas Wolf
92e974196f
Merge pull request #2765 from huggingface/extract-cached-archives
...
Add option to `cached_path` to automatically extract archives
2020-02-10 14:05:16 +01:00
Morgan Funtowicz
6aa7973aec
Fix circleci cuInit error on Tensorflow >= 2.1.0.
...
Tensorflow 2.1.0 introduce a new dependency model where pip install tensorflow would install tf with GPU support.
Before it would just install with CPU support, thus CircleCI is looking for NVidia driver version at initialization of the
tensorflow related tests but fails as their is no NVidia Driver running.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
2020-02-10 13:24:37 +01:00
Lysandre
520e7f2119
Correct docstring for xlnet
2020-02-07 16:42:35 -05:00
Lysandre
dd28830327
Update RoBERTa tips
2020-02-07 16:42:35 -05:00
Lysandre
db97930122
Update XLM-R tips
2020-02-07 16:42:35 -05:00
Lysandre
7046de2991
E231
2020-02-07 15:28:13 -05:00
VictorSanh
0d3aa3c04c
styling
2020-02-07 15:28:13 -05:00
VictorSanh
d8b43600fd
omission
2020-02-07 15:28:13 -05:00
VictorSanh
ee5a6856ca
distilbert-base-cased weights + Readmes + omissions
2020-02-07 15:28:13 -05:00
monologg
73368963b2
Fix importing unofficial TF models with extra optimizer weights
2020-02-07 10:25:31 -05:00
Ari
d7dabfeff5
Fix documentation in ProjectedAdaptiveLogSoftmax
2020-02-07 10:14:58 -05:00
Julien Chaumond
42f08e596f
[examples] rename run_lm_finetuning to run_language_modeling
2020-02-07 09:15:28 -05:00
Julien Chaumond
4f7bdb0958
[examples] Fix broken markdown
2020-02-07 09:15:28 -05:00
thomwolf
c6c5c3fd4e
style and quality
2020-02-07 08:58:06 +01:00
thomwolf
961c69776f
@julien-c proposal for TF/PT compat in hf_buckets
2020-02-07 08:53:17 +01:00
thomwolf
d311f87bca
cleanup
2020-02-07 00:05:28 +01:00
thomwolf
7d99e05f76
file_cache has options to extract archives
2020-02-07 00:03:12 +01:00
dchurchwell
2c12464a20
Changed vocabulary save function. Variable name was inconsistent, causing an error to be thrown when passing a file name instead of a directory.
2020-02-06 16:40:07 -05:00
Peter Izsak
6fc3d34abd
Fix multi-gpu evaluation in run_glue.py
2020-02-06 16:38:55 -05:00
Julien Chaumond
7748cbbe7d
Oopsie
2020-02-06 15:30:02 -05:00
Julien Chaumond
432c12521e
[docs] Add menu w/ links to other pages on hf.co
2020-02-06 15:30:02 -05:00
Clement
c069932f5d
Add contributors snapshot
...
powered by https://github.com/sourcerer-io/hall-of-fame
2020-02-06 15:25:47 -05:00
Lysandre Debut
33d3072e1c
Arxiv README ( #2747 )
...
* Arxiv README
* ArXiv-NLP readme
2020-02-05 15:26:28 -05:00
Julien Chaumond
eae8ee0389
[doc] model sharing: mention README.md + tweaks
...
cc @lysandrejik @thomwolf
2020-02-05 14:20:03 -05:00
James Betker
6bb6a01765
Fix GPT2 config set to trainable
...
This prevents the model from being saved, and who knows
what else.
2020-02-05 13:55:41 -05:00
Julien Chaumond
ada24def22
[run_lm_finetuning] Tweak fix for non-long tensor, close #2728
...
see 1ebfeb7946
and #2728
Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2020-02-05 12:49:18 -05:00
Lysandre
2184f87003
RoBERTa TensorFlow Tests
2020-02-04 18:05:35 -05:00
Lysandre
e615269cb8
Correct slow test
2020-02-04 18:05:35 -05:00
Lysandre
5f96ebc0be
Style
2020-02-04 18:05:35 -05:00
Lysandre
950c6a4f09
Flaubert PyTorch tests
2020-02-04 18:05:35 -05:00
Lysandre
d28b81dc29
RoBERTa Pytorch tests
2020-02-04 18:05:35 -05:00
Yuval Pinter
d1ab1fab1b
pass langs parameter to certain XLM models ( #2734 )
...
* pass langs parameter to certain XLM models
Adding an argument that specifies the language the SQuAD dataset is in so language-sensitive XLMs (e.g. `xlm-mlm-tlm-xnli15-1024`) don't default to language `0`.
Allows resolution of issue #1799 .
* fixing from `make style`
* fixing style (again)
2020-02-04 17:12:42 -05:00
sshleifer
9e5b549b4d
fix default getattr
2020-02-04 16:38:52 -05:00
sshleifer
25848a6094
double quotes
2020-02-04 16:38:52 -05:00
sshleifer
cbcb83f21d
minor cleanup of test_attention_outputs
2020-02-04 16:38:52 -05:00
Lysandre
3bf5417258
Revert erroneous fix
2020-02-04 16:31:07 -05:00
Lysandre
1ebfeb7946
Cast to long when masking tokens
2020-02-04 15:56:16 -05:00
Lysandre
9c67196b83
Update quickstart
2020-02-04 11:11:37 -05:00
Lysandre
90ab15cb7a
Remove redundant hidden states
2020-02-04 10:59:32 -05:00