BramVanroy
9d87eafd11
Streamlining
...
- mostly stylistic streamlining
- removed 'additional context' sections. They seem to be rarely used and might cause confusion. If more details are needed, users can add them to the 'details' section
2020-01-28 10:41:10 -05:00
BramVanroy
a3b3638f6f
phrasing
2020-01-28 10:41:10 -05:00
BramVanroy
c96ca70f25
Update ---new-benchmark.md
2020-01-28 10:41:10 -05:00
BramVanroy
7b5eda32bb
Update --new-model-addition.md
...
Motivate users to @-tag authors of models to increase visibility and expand the community
2020-01-28 10:41:10 -05:00
BramVanroy
c63d91dd1c
Update bug-report.md
...
- change references to pytorch-transformers to transformers
- link to code formatting guidelines
2020-01-28 10:41:10 -05:00
BramVanroy
b2907cd06e
Update feature-request.md
...
- add 'your contribution' section
- add code formatting link to 'additional context'
2020-01-28 10:41:10 -05:00
BramVanroy
2fec88ee02
Update question-help.md
...
Prefer that general questions are asked on Stack Overflow
2020-01-28 10:41:10 -05:00
BramVanroy
7e03d2bd7c
update migration guide
...
Streamlines usages of pytorch-transformers and pytorch-pretrained-bert. Add link to the README for the migration guide.
2020-01-28 10:41:10 -05:00
Lysandre
335dd5e68a
Default save steps 50 to 500 in all scripts
2020-01-28 09:42:11 -05:00
Lysandre
ea2600bd5f
Absolute definitive HeisenDistilBug solve
...
cc @julien-c @thomwolf
2020-01-27 21:58:36 -05:00
Wietse de Vries
5c3d441ee1
Fix formatting
2020-01-27 21:00:34 -05:00
Wietse de Vries
f5a236c3ca
Add Dutch pre-trained BERT model
2020-01-27 21:00:34 -05:00
Julien Chaumond
6b4c3ee234
[run_lm_finetuning] GPT2 tokenizer doesn't have a pad_token
...
ping @lysandrejik
2020-01-27 20:14:02 -05:00
Julien Chaumond
79815bf666
[serving] Fix typo
2020-01-27 19:58:25 -05:00
Julien Chaumond
5004d5af42
[serving] Update dependencies
2020-01-27 19:58:00 -05:00
Lysandre
9ca21c838b
Style
2020-01-27 14:49:12 -05:00
thomwolf
e0849a66ac
adding in the doc
2020-01-27 14:27:07 -05:00
thomwolf
6b081f04e6
style and quality
2020-01-27 14:27:07 -05:00
thomwolf
0e31e06a75
Add AutoModelForPreTraining
2020-01-27 14:27:07 -05:00
Julien Chaumond
ea56d305be
make style
2020-01-27 12:13:32 -05:00
Malte Pietsch
d440e21f5b
add mapping of roberta for QA
2020-01-27 12:12:46 -05:00
Lysandre
875c4ae48f
Definitive HeisenDistilBug fix
...
cc @julien-c @@thomwolf
2020-01-27 12:09:58 -05:00
Lysandre
f09f42d4d3
Input Embeddings should be assigned
...
cc @julien-c
2020-01-27 11:46:00 -05:00
Maksym Del
bac51fba3a
Fix token_type_ids for XLM-R
2020-01-27 11:08:31 -05:00
Lysandre
babd41e7fa
Code quality
2020-01-24 17:06:55 -05:00
Lysandre
974d083c7b
Accurate model for configuration
2020-01-24 16:46:03 -05:00
Lysandre
983fef469c
AutoModels doc
2020-01-24 16:37:30 -05:00
Lysandre
009fcb0ec1
Configuration utils
2020-01-24 16:37:30 -05:00
Julien Chaumond
11b13e94a3
Add type to help my IDE out
2020-01-24 14:00:57 -05:00
VictorSanh
1ce3fb5cc7
update correct eval metrics (distilbert & co)
2020-01-24 11:45:22 -05:00
Nicholas Lourie
62f5804608
Update the doc string for T5WithLMHeadModel
...
T5WithLMHeadModel's doc string claims that indices of -1 are
ignored while computing the cross-entropy loss in the forward
pass; however, indices of -1 throw an error while indices of -100
are ignored. This commit updates the doc string to be consistent
with the class's behavior.
2020-01-24 10:28:20 -05:00
Lysandre
908230d261
Pickle CamemBERT tokenizer
2020-01-24 10:08:59 -05:00
Lysandre
24d5ad1dcc
Run the examples in slow
2020-01-23 09:38:45 -05:00
Lysandre
9ddf60b694
Tips + whitespaces
2020-01-23 09:38:45 -05:00
Lysandre
0e9899f451
Fixes
2020-01-23 09:38:45 -05:00
Lysandre
48ac24020d
TF CTRL
2020-01-23 09:38:45 -05:00
Lysandre
7511f3dd89
PyTorch CTRL + Style
2020-01-23 09:38:45 -05:00
Lysandre
980211a63a
XLM-RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
6bc966793a
TF DistilBERT
2020-01-23 09:38:45 -05:00
Lysandre
db1a7f27a1
PyTorch DistilBERT
2020-01-23 09:38:45 -05:00
Lysandre
b28020f590
TF RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
3e1bc27e1b
Pytorch RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
f44ff574d3
Camembert
2020-01-23 09:38:45 -05:00
Lysandre
264eb23912
TF XLM
2020-01-23 09:38:45 -05:00
Lysandre
ccebcae75f
PyTorch XLM
2020-01-23 09:38:45 -05:00
Lysandre
92b3cb786d
TF XLNet
2020-01-23 09:38:45 -05:00
Lysandre
cd656fb21a
PyTorch XLNet
2020-01-23 09:38:45 -05:00
Lysandre
83fa8d9fb5
TF Transformer-XL
2020-01-23 09:38:45 -05:00
Lysandre
98edad418e
PyTorch Transformer-XL
2020-01-23 09:38:45 -05:00
Lysandre
96d21ad06b
TF OpenAI GPT
2020-01-23 09:38:45 -05:00