LysandreJik
39f426be65
Added special tokens <pad> and <mask> to RoBERTa.
2019-08-13 15:19:50 -04:00
Julien Chaumond
baf08ca1d4
[RoBERTa] run_glue: correct pad_token + reorder labels
2019-08-13 12:51:15 -04:00
LysandreJik
3d87991f60
Fixed error with encoding
2019-08-13 12:00:24 -04:00
tuvuumass
ba4bce2581
fix issue #824
2019-08-13 11:26:27 -04:00
LysandreJik
634a3172d8
Added integration tests for sequence builders.
2019-08-12 15:14:15 -04:00
LysandreJik
22ac004a7c
Added documentation and changed parameters for special_tokens_sentences_pair.
2019-08-12 15:13:53 -04:00
Julien Chaumond
912fdff899
[RoBERTa] Update run_glue
for RoBERTa
2019-08-12 13:49:50 -04:00
Julien Chaumond
b3d83d68db
Fixup 9d0603148b
2019-08-12 12:28:55 -04:00
carefree0910
a7b4cfe919
Update README.md
...
I assume that it should test the `re-load` functionality after testing the `save` functionality, however I'm also surprised that nobody points this out after such a long time, so maybe I've misunderstood the purpose. This PR is just in case :)
2019-08-12 09:53:05 -04:00
erenup
b219029c45
refactoring old run_swag. This script is mainly refatored from run_squad in pytorch_transformers
2019-08-11 15:20:37 +08:00
thomwolf
aaedfc35a8
Merge branch 'master' of https://github.com/huggingface/pytorch-transformers
2019-08-10 20:04:37 +02:00
thomwolf
c683c3d5a5
fix #993
2019-08-10 20:04:35 +02:00
Kevin Trebing
7060766490
Corrected logger.error info
...
Signed-off-by: Kevin Trebing <Kevin.Trebing@gmx.net>
2019-08-09 19:36:44 -04:00
LysandreJik
75d5f98fd2
Roberta tokenization + fixed tests (py3 + py2).
2019-08-09 15:02:13 -04:00
LysandreJik
14e970c271
Tokenization encode/decode class-based sequence handling
2019-08-09 15:01:38 -04:00
LysandreJik
3566d27919
Clarified PreTrainedModel.from_pretrained warning messages in documentation.
2019-08-08 19:04:34 -04:00
LysandreJik
fbd746bd06
Updated test architecture
2019-08-08 18:21:34 -04:00
LysandreJik
6c41a8f5dc
Encode and Decode are back in the superclass. They now handle sentence pairs special tokens.
2019-08-08 18:20:32 -04:00
Julien Chaumond
e367ac469c
[RoBERTa] Re-apply 39d72bcc7b
...
cc @lysandrejik
2019-08-08 11:26:11 -04:00
Julien Chaumond
9d0603148b
[RoBERTa] RobertaForSequenceClassification + conversion
2019-08-08 11:24:54 -04:00
LysandreJik
f2b300df6b
fix #976
2019-08-08 10:38:57 -04:00
LysandreJik
7df303f5ad
fix #971
2019-08-08 10:36:26 -04:00
LysandreJik
d2cc6b101e
Merge branch 'master' into RoBERTa
2019-08-08 09:42:05 -04:00
LysandreJik
39d72bcc7b
Fixed the RoBERTa checkpoint conversion script according to the LM head refactoring.
2019-08-07 14:21:57 -04:00
LysandreJik
770043eea2
Sentence-pair tasks handling. Using common tests on RoBERTa. Forced push to fix indentation.
2019-08-07 12:53:19 -04:00
Thomas Wolf
7729ef7381
Merge pull request #955 from FeiWang96/master
...
Fix comment typo
2019-08-07 10:11:25 +02:00
Thomas Wolf
5c6ecf37e7
Merge pull request #958 from saket404/typo-fix
...
Fixed small typo
2019-08-07 10:10:20 +02:00
Thomas Wolf
b4f9464f90
Merge pull request #960 from ethanjperez/patch-1
...
Fixing unused weight_decay argument
2019-08-07 10:09:55 +02:00
Thomas Wolf
822d6768eb
Merge pull request #962 from guotong1988/patch-1
...
Update modeling_xlnet.py
2019-08-07 10:09:20 +02:00
Thomas Wolf
7e6102ce74
Merge pull request #963 from guotong1988/patch-2
...
Update modeling_bert.py
2019-08-07 10:09:04 +02:00
Thomas Wolf
3773ba44f0
Merge pull request #977 from chrisgzf/master
...
Fixed typo in migration guide
2019-08-07 10:08:45 +02:00
Thomas Wolf
a80aa03bda
Merge pull request #973 from FeiWang96/bert_config
...
Fix examples of loading pretrained models in docstring
2019-08-07 10:08:22 +02:00
Christopher Goh
a6f412da01
Fixed typo in migration guide
2019-08-07 02:19:14 +08:00
wangfei
6ec1ee9ec2
Fix examples in docstring
2019-08-06 11:32:54 +08:00
wangfei
72622926e5
Fix examples in docstring
2019-08-06 11:32:41 +08:00
wangfei
f889e77b9c
Fix examples of loading pretrained models in docstring
2019-08-06 11:30:35 +08:00
wangfei
beb03ec6c5
Fix examples of loading pretrained models in docstring
2019-08-06 11:24:46 +08:00
Thomas Wolf
4fc9f9ef54
Merge pull request #910 from huggingface/auto_models
...
Adding AutoTokenizer and AutoModel classes that automatically detect architecture - Clean up tokenizers
2019-08-05 19:17:47 +02:00
Thomas Wolf
d43dc48b34
Merge branch 'master' into auto_models
2019-08-05 19:17:35 +02:00
thomwolf
0b524b0848
remove derived classes for now
2019-08-05 19:08:19 +02:00
thomwolf
13936a9621
update doc and tests
2019-08-05 18:48:16 +02:00
thomwolf
ed4e542260
adding tests
2019-08-05 18:14:07 +02:00
thomwolf
3a126e73dd
fix #950
2019-08-05 17:26:29 +02:00
thomwolf
7223886dc9
fix #944
2019-08-05 17:16:56 +02:00
thomwolf
70c10caa06
add option mentioned in #940
2019-08-05 17:09:37 +02:00
thomwolf
077ad693e9
tweak issue templates wordings
2019-08-05 16:46:29 +02:00
thomwolf
02d4087cb8
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2019-08-05 16:26:01 +02:00
thomwolf
7c524d631e
add issue templates
2019-08-05 16:25:54 +02:00
Lysandre Debut
6f05ad72b4
Merge pull request #791 from huggingface/doc
...
RestructuredText table for pretrained models.
2019-08-05 10:18:00 -04:00
thomwolf
b90e29d52c
working on automodels
2019-08-05 16:06:34 +02:00