Sam Shleifer
20fc18fbda
Skip flaky test_tf_question_answering ( #2845 )
...
* Skip flaky test
* Style
2020-02-18 16:14:50 -05:00
VictorSanh
2ae98336d1
fix vocab size in binarized_data (distil): int16 vs int32
2020-02-18 16:17:35 +00:00
VictorSanh
0dbddba6d2
fix typo in hans example call
2020-02-17 20:19:57 +00:00
Manuel Romero
29ab4b7f40
Create README.md
2020-02-17 10:58:43 -05:00
Stefan Schweter
c88ed74ccf
[model_cards] 🇹🇷 Add new (cased) BERTurk model
2020-02-17 09:54:46 -05:00
Thomas Wolf
5b2d4f2657
Merge pull request #2881 from patrickvonplaten/add_vim_swp_to_gitignore
...
update .gitignore to ignore .swp files created when using vim
2020-02-17 14:36:49 +01:00
Patrick von Platen
fb4d8d0832
update .gitignore to ignore .swp files created when using vim
2020-02-17 14:26:32 +01:00
Manuel Romero
6083c1566e
Update README.md
...
I trained the model for more epochs so I improved the results. This commit will update the results of the model and add a gif using it with **transformers/pipelines**
2020-02-16 10:09:34 -05:00
Julien Chaumond
73028c5df0
[model_cards] EsperBERTo
2020-02-14 15:16:33 -05:00
Timo Moeller
81fb8d3251
Update model card: new performance chart ( #2864 )
...
* Update model performance for correct German conll03 dataset
* Adjust text
* Adjust line spacing
2020-02-14 13:39:23 -05:00
Julien Chaumond
4e69104a1f
[model_cards] Also use the thumbnail as meta
...
Co-Authored-By: Ilias Chalkidis <ihalk@di.uoa.gr>
2020-02-14 10:27:11 -05:00
Julien Chaumond
73d79d42b4
[model_cards] nlptown/bert-base-multilingual-uncased-sentiment
...
cc @yvespeirsman
Co-Authored-By: Yves Peirsman <yvespeirsman@users.noreply.github.com>
2020-02-14 09:51:11 -05:00
Yves Peirsman
47b735f994
Added model card for bert-base-multilingual-uncased-sentiment ( #2859 )
...
* Created model card for nlptown/bert-base-multilingual-sentiment
* Delete model card
* Created model card for bert-base-multilingual-uncased-sentiment as README
2020-02-14 09:31:15 -05:00
Julien Chaumond
7d22fefd37
[pipeline] Alias NerPipeline as TokenClassificationPipeline
2020-02-14 09:18:10 -05:00
Manuel Romero
61a2b7dc9d
Fix typo
2020-02-14 09:13:07 -05:00
Ilias Chalkidis
6e261d3a22
Fix typos
2020-02-14 09:11:07 -05:00
Manuel Romero
4e597c8e4d
Fix typo
2020-02-14 09:07:42 -05:00
Julien Chaumond
925a13ced1
[model_cards] mv README.md
2020-02-13 23:07:29 -05:00
Manuel Romero
575a3b7aa1
Create distill-bert-base-spanish-wwm-cased-finetuned-spa-squad2-es.md
2020-02-13 23:04:52 -05:00
Julien Chaumond
4d36472b96
[run_ner] Don't crash if fine-tuning local model that doesn't end with digit
2020-02-14 03:25:29 +00:00
Ilias Chalkidis
8514018300
Update with additional information
...
Added a "Pre-training details" section
2020-02-13 21:54:42 -05:00
Ilias Chalkidis
1eec69a900
Create README.md
2020-02-13 19:27:22 -05:00
Felix MIKAELIAN
8744402f1e
add model_card flaubert-base-uncased-squad ( #2833 )
...
* add model_card
* Add tag
cc @fmikaelian
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-02-13 17:19:13 -05:00
Severin Simmler
7f98edd7e3
Model card: Literary German BERT ( #2843 )
...
* feat: create model card
* chore: add description
* feat: stats plot
* Delete prosa-jahre.svg
* feat: years plot (again)
* chore: add more details
* fix: typos
* feat: kfold plot
* feat: kfold plot
* Rename model_cards/severinsimmler/literary-german-bert.md to model_cards/severinsimmler/literary-german-bert/README.md
* Support for linked images + add tags
cc @severinsimmler
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-02-13 15:43:44 -05:00
Joe Davison
f1e8a51f08
Preserve spaces in GPT-2 tokenizers ( #2778 )
...
* Preserve spaces in GPT-2 tokenizers
Preserves spaces after special tokens in GPT-2 and inhereted (RoBERTa)
tokenizers, enabling correct BPE encoding. Automatically inserts a space
in front of first token in encode function when adding special tokens.
* Add tokenization preprocessing method
* Add framework argument to pipeline factory
Also fixes pipeline test issue. Each test input now treated as a
distinct sequence.
2020-02-13 13:29:43 -05:00
Sam Shleifer
0ed630f139
Attempt to increase timeout for circleci slow tests ( #2844 )
2020-02-13 09:11:03 -05:00
Sam Shleifer
ef74b0f07a
get_activation('relu') provides a simple mapping from strings i… ( #2807 )
...
* activations.py contains a mapping from string to activation function
* resolves some `gelu` vs `gelu_new` ambiguity
2020-02-13 08:28:33 -05:00
Lysandre
f54a5bd37f
Raise error when using an mlm flag for a clm model + correct TextDataset
2020-02-12 13:23:14 -05:00
Lysandre
569897ce2c
Fix a few issues regarding the language modeling script
2020-02-12 13:23:14 -05:00
Julien Chaumond
21da895013
[model_cards] Better image for social sharing
2020-02-11 20:30:08 -05:00
Julien Chaumond
9a70910d47
[model_cards] Tweak @mrm8488's model card
2020-02-11 20:20:39 -05:00
Julien Chaumond
9274734a0d
[model_cards] mv to correct location + tweak tag
2020-02-11 20:13:57 -05:00
Manuel Romero
69f948461f
Create bert-base-spanish-wwm-cased-finetuned-spa-squad2-es.md
2020-02-11 20:07:15 -05:00
Julien Chaumond
e0b6247cf7
[model_cards] Change formatting slightly as we updated our markdown engine
...
cc @tholor @loretoparisi @simonefrancia
2020-02-11 18:25:21 -05:00
sshleifer
5f2dd71d1b
Smaller diff
2020-02-11 17:20:09 -05:00
sshleifer
31158af57c
formatting
2020-02-11 17:20:09 -05:00
sshleifer
5dd61fb9a9
Add more specific testing advice to Contributing.md
2020-02-11 17:20:09 -05:00
Oleksiy Syvokon
ee5de0ba44
BERT decoder: Fix causal mask dtype.
...
PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.
In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.
2020-02-11 15:19:22 -05:00
jiyeon
bed38d3afe
Fix typo in src/transformers/data/processors/squad.py
2020-02-11 11:22:24 -05:00
Stefan Schweter
498d06e914
[model_cards] Add new German Europeana BERT models ( #2805 )
...
* [model_cards] New German Europeana BERT models from dbmdz
* [model_cards] Update German Europeana BERT models from dbmdz
2020-02-11 10:49:39 -05:00
Funtowicz Morgan
3e3a9e2c01
Merge pull request #2793 from huggingface/tensorflow-210-circleci-fix
...
Fix circleci cuInit error on Tensorflow >= 2.1.0.
2020-02-11 10:48:42 +00:00
Julien Chaumond
1f5db9a13c
[model_cards] Rm extraneous tag
2020-02-10 17:45:13 -05:00
Julien Chaumond
95bac8dabb
[model_cards] Add language metadata to existing model cards
...
This will enable filtering on language (amongst other tags) on the website
cc @loretoparisi, @stefan-it, @HenrykBorzymowski, @marma
2020-02-10 17:42:42 -05:00
ahotrod
ba498eac38
Create README.md ( #2785 )
...
* Create README.md
* Update README.md
* Update README.md
* Update README.md
* [model_cards] Use code fences for consistency
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-02-10 17:27:59 -05:00
Malte Pietsch
68ccc04ee6
Add model readme for deepset/roberta-base-squad2 ( #2797 )
...
* Add readme for deepset/roberta-base-squad2
* update model readme
2020-02-10 15:21:48 -05:00
Lysandre
539f601be7
intermediate_size > hidden_dim in distilbert config docstrings
2020-02-10 13:45:57 -05:00
Lysandre
cfb7d108bd
FlauBERT lang embeddings only when n_langs > 1
2020-02-10 13:24:04 -05:00
Julien Chaumond
b4691a438d
[model_cards] BERT-of-Theseus: use the visual as thumbnail
...
cc @jetrunner
Co-Authored-By: Kevin Canwen Xu <canwenxu@outlook.com>
2020-02-10 11:27:08 -05:00
Julien Chaumond
fc325e97cd
[model_cards] Showcase model tag syntax
2020-02-10 11:27:08 -05:00
Lysandre
fd639e5be3
Correct quickstart example when using the past
2020-02-10 11:25:56 -05:00