Malte Pietsch
d440e21f5b
add mapping of roberta for QA
2020-01-27 12:12:46 -05:00
Lysandre
875c4ae48f
Definitive HeisenDistilBug fix
...
cc @julien-c @@thomwolf
2020-01-27 12:09:58 -05:00
Lysandre
f09f42d4d3
Input Embeddings should be assigned
...
cc @julien-c
2020-01-27 11:46:00 -05:00
Maksym Del
bac51fba3a
Fix token_type_ids for XLM-R
2020-01-27 11:08:31 -05:00
Lysandre
babd41e7fa
Code quality
2020-01-24 17:06:55 -05:00
Lysandre
974d083c7b
Accurate model for configuration
2020-01-24 16:46:03 -05:00
Lysandre
983fef469c
AutoModels doc
2020-01-24 16:37:30 -05:00
Lysandre
009fcb0ec1
Configuration utils
2020-01-24 16:37:30 -05:00
Julien Chaumond
11b13e94a3
Add type to help my IDE out
2020-01-24 14:00:57 -05:00
VictorSanh
1ce3fb5cc7
update correct eval metrics (distilbert & co)
2020-01-24 11:45:22 -05:00
Nicholas Lourie
62f5804608
Update the doc string for T5WithLMHeadModel
...
T5WithLMHeadModel's doc string claims that indices of -1 are
ignored while computing the cross-entropy loss in the forward
pass; however, indices of -1 throw an error while indices of -100
are ignored. This commit updates the doc string to be consistent
with the class's behavior.
2020-01-24 10:28:20 -05:00
Lysandre
908230d261
Pickle CamemBERT tokenizer
2020-01-24 10:08:59 -05:00
Lysandre
24d5ad1dcc
Run the examples in slow
2020-01-23 09:38:45 -05:00
Lysandre
9ddf60b694
Tips + whitespaces
2020-01-23 09:38:45 -05:00
Lysandre
0e9899f451
Fixes
2020-01-23 09:38:45 -05:00
Lysandre
48ac24020d
TF CTRL
2020-01-23 09:38:45 -05:00
Lysandre
7511f3dd89
PyTorch CTRL + Style
2020-01-23 09:38:45 -05:00
Lysandre
980211a63a
XLM-RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
6bc966793a
TF DistilBERT
2020-01-23 09:38:45 -05:00
Lysandre
db1a7f27a1
PyTorch DistilBERT
2020-01-23 09:38:45 -05:00
Lysandre
b28020f590
TF RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
3e1bc27e1b
Pytorch RoBERTa
2020-01-23 09:38:45 -05:00
Lysandre
f44ff574d3
Camembert
2020-01-23 09:38:45 -05:00
Lysandre
264eb23912
TF XLM
2020-01-23 09:38:45 -05:00
Lysandre
ccebcae75f
PyTorch XLM
2020-01-23 09:38:45 -05:00
Lysandre
92b3cb786d
TF XLNet
2020-01-23 09:38:45 -05:00
Lysandre
cd656fb21a
PyTorch XLNet
2020-01-23 09:38:45 -05:00
Lysandre
83fa8d9fb5
TF Transformer-XL
2020-01-23 09:38:45 -05:00
Lysandre
98edad418e
PyTorch Transformer-XL
2020-01-23 09:38:45 -05:00
Lysandre
96d21ad06b
TF OpenAI GPT
2020-01-23 09:38:45 -05:00
Lysandre
850795c487
Pytorch GPT
2020-01-23 09:38:45 -05:00
Lysandre
1487b840d3
TF GPT2
2020-01-23 09:38:45 -05:00
Lysandre
bd0d3fd76e
GPT-2 PyTorch models + better tips for BERT
2020-01-23 09:38:45 -05:00
Lysandre
dbeb7fb4e6
BERT TensorFlow
2020-01-23 09:38:45 -05:00
Lysandre
cd77c750c5
BERT PyTorch models
2020-01-23 09:38:45 -05:00
Lysandre
3922a2497e
TF ALBERT + TF Utilities + Fix warnings
2020-01-23 09:38:45 -05:00
Lysandre
00df3d4de0
ALBERT Modeling + required changes to utilities
2020-01-23 09:38:45 -05:00
Lysandre
f81b6c95f2
Flake8 violation
2020-01-23 09:38:45 -05:00
Lysandre
632675ea88
Can test examples spread over multiple blocks
2020-01-23 09:38:45 -05:00
Lysandre
eaa6b9afc6
Require Torch when testing examples
2020-01-23 09:38:45 -05:00
Lysandre
9bab9b83d2
Glossary
2020-01-23 09:38:45 -05:00
Lysandre
64abd3e0aa
Multi-line examples can be tested + ALBERT patch for CircleCI
...
All tests should now work fine.
2020-01-23 09:38:45 -05:00
Lysandre
837577256b
Automatic testing of examples
...
The CircleCI test should fail.
2020-01-23 09:38:45 -05:00
Julien Chaumond
90b7df444f
Upload CLI: on win32, use slashes, not os.sep
2020-01-22 22:41:21 -05:00
Julien Chaumond
119dc50e2a
Doc tweak on model sharing
2020-01-22 22:40:38 -05:00
Julien Chaumond
34a3c25a30
Fix for XLMRobertaConfig inherits from RobertaConfig
...
hat/tip @stefan-it
2020-01-22 17:50:24 -05:00
Julien Chaumond
1a8e87be4e
Line-by-line text dataset (including padding)
2020-01-21 16:57:38 -05:00
Julien Chaumond
b94cf7faac
change order
2020-01-21 16:57:38 -05:00
Julien Chaumond
2eaa8b6e56
Easier to not support this, as it could be confusing
...
cc @lysandrejik
2020-01-21 16:57:38 -05:00
Julien Chaumond
801aaa5508
make style
2020-01-21 16:57:38 -05:00