LysandreJik
39d72bcc7b
Fixed the RoBERTa checkpoint conversion script according to the LM head refactoring.
2019-08-07 14:21:57 -04:00
LysandreJik
770043eea2
Sentence-pair tasks handling. Using common tests on RoBERTa. Forced push to fix indentation.
2019-08-07 12:53:19 -04:00
Julien Chaumond
cb9db101c7
Python 2 must DIE
2019-08-04 22:04:15 -04:00
Julien Chaumond
05c083520a
[RoBERTa] model conversion, inference, tests 🔥
2019-08-04 21:39:21 -04:00
thomwolf
7b6e474c9a
fix #901
2019-07-26 21:26:44 +02:00
thomwolf
632d711411
fix #908
2019-07-26 21:14:37 +02:00
Thomas Wolf
6219ad7216
Merge pull request #888 from rococode/patch-1
...
Update docs for parameter rename
2019-07-25 15:01:22 +02:00
Joel Grus
ae152cec09
make save_pretrained work with added tokens
...
right now it's dumping the *decoder* when it should be dumping the *encoder*. this fixes that.
2019-07-24 16:54:48 -07:00
rococo // Ron
66b15f73f0
Update docs for parameter rename
...
OpenAIGPTLMHeadModel now accepts `labels` instead of `lm_labels`
2019-07-24 11:27:08 -07:00
Thomas Wolf
067923d326
Merge pull request #873 from huggingface/identity_replacement
...
Add nn.Identity replacement for old PyTorch
2019-07-23 18:16:35 +02:00
Thomas Wolf
368670ac31
Merge pull request #866 from xanlsh/master
...
Rework how PreTrainedModel.from_pretrained handles its arguments
2019-07-23 18:05:30 +02:00
thomwolf
1383c7b87a
Fix #869
2019-07-23 17:52:20 +02:00
Anish Moorthy
4fb56c7729
Remove unused *args parameter from PreTrainedConfig.from_pretrained
2019-07-23 10:43:01 -04:00
Anish Moorthy
e179c55490
Add docs for from_pretrained functions, rename return_unused_args
2019-07-23 10:43:01 -04:00
thomwolf
0740e63e49
updating schedules for state_dict saving
2019-07-23 15:57:18 +02:00
Thomas Wolf
c4bc66886d
Merge pull request #860 from Yiqing-Zhou/patch-1
...
read().splitlines() -> readlines()
2019-07-23 15:24:25 +02:00
Yiqing-Zhou
b1019d2a8e
token[-1] -> token.rstrip('\n')
2019-07-23 20:41:26 +08:00
thomwolf
0227b4a940
fix #827
2019-07-23 14:06:43 +02:00
Anish Moorthy
490ebbdcf7
Fix PretrainedModel.from_pretrained not passing cache_dir forward
2019-07-22 18:03:08 -04:00
Anish Moorthy
b8009cb0da
Make PreTrainedModel.from_pretrained pass unused arguments to model
2019-07-22 18:03:08 -04:00
Yiqing-Zhou
bef0c629ca
fix
...
Remove '\n' before adding token into vocab
2019-07-22 22:30:49 +08:00
Yiqing-Zhou
897d0841be
read().splitlines() -> readlines()
...
splitlines() does not work as what we expect here for bert-base-chinese because there is a '\u2028' (unicode line seperator) token in vocab file. Value of '\u2028'.splitlines() is ['', ''].
Perhaps we should use readlines() instead.
2019-07-22 20:49:09 +08:00
Minho Ryu
cd8980e1f4
import sys twice
2019-07-17 18:12:01 +09:00
thomwolf
5fe0b378d8
adding missing docstring fix #793
2019-07-16 21:35:53 +02:00
thomwolf
ed7549bb1a
release version 1.0
2019-07-16 16:10:58 +02:00
thomwolf
4acaa65068
model in evaluation mode by default after from_pretrained
2019-07-16 15:41:57 +02:00
thomwolf
f289e6cfe4
fix docstrings
2019-07-16 15:31:21 +02:00
thomwolf
9726b229cf
model name typo
2019-07-16 15:17:45 +02:00
thomwolf
1849aa7d39
update readme and pretrained model weight files
2019-07-16 15:11:29 +02:00
thomwolf
f31154cb9d
Merge branch 'xlnet'
2019-07-16 11:51:13 +02:00
thomwolf
1b35d05d4b
update conversion scripts and __main__
2019-07-16 09:41:55 +02:00
thomwolf
352e3ff998
added migration guide to readme
2019-07-16 09:03:49 +02:00
thomwolf
3b8b0e01bb
update readme
2019-07-16 00:12:55 +02:00
thomwolf
e691fc0963
update QA models tests + run_generation
2019-07-15 17:45:24 +02:00
thomwolf
15d8b1266c
update tokenizer - update squad example for xlnet
2019-07-15 17:30:42 +02:00
thomwolf
3b469cb422
updating squad for compatibility with XLNet
2019-07-15 15:28:37 +02:00
thomwolf
8ca767f13c
clean up optimization
2019-07-15 13:49:07 +02:00
thomwolf
74a24f0fe9
clean up file_utils
2019-07-15 13:49:01 +02:00
thomwolf
ab49fafc04
update tokenization docstrings for #328
2019-07-15 12:51:23 +02:00
thomwolf
a9ab15174c
fix #328
2019-07-15 12:42:12 +02:00
thomwolf
f7cd7392fd
fixed tests
2019-07-15 12:32:19 +02:00
thomwolf
e28d8bde0d
doc on base classes
2019-07-15 12:08:06 +02:00
thomwolf
44c985facd
update doc for XLM and XLNet
2019-07-15 11:36:50 +02:00
thomwolf
0201d86015
added doc for transformer-xl
2019-07-15 10:11:09 +02:00
thomwolf
4cb489457f
added doc for openai GPT
2019-07-15 09:58:01 +02:00
thomwolf
62b8eb43c1
fix add_start_docstrings on python 2 (removed)
2019-07-15 09:49:02 +02:00
thomwolf
5bc3d0cc5b
added gpt2 doc
2019-07-15 09:40:05 +02:00
thomwolf
183fedfed5
fix doc on python2
2019-07-15 09:00:09 +02:00
thomwolf
2397f958f9
updating examples and doc
2019-07-14 23:20:10 +02:00
thomwolf
7d4b200e40
good quality generation example for GPT, GPT-2, Transfo-XL, XLNet
2019-07-13 15:25:03 +02:00