erenup
46ffc28329
Merge branch 'master' into run_multiple_choice_merge
...
# Please enter a commit message to explain why this merge is necessary,
# especially if it merges an updated upstream into a topic branch.
#
# Lines starting with '#' will be ignored, and an empty message aborts
# the commit.
2019-09-18 21:43:46 +08:00
erenup
3cd6289758
Merge remote-tracking branch 'huggingface/master' into run_multiple_choice_merge
...
# Conflicts:
# examples/contrib/run_swag.py
2019-09-18 21:16:59 +08:00
Julien Chaumond
62760baf46
tiny fixes
2019-09-17 18:29:15 -04:00
thomwolf
45de034bf8
fix #1223
2019-09-17 10:25:06 +02:00
erenup
a9debaca3d
fixed init_weight
2019-09-16 19:55:24 +08:00
erenup
982f181aa7
Merge remote-tracking branch 'origin/master' into run_multiple_choice_add_doc
2019-09-16 19:12:00 +08:00
erenup
84b9d1c423
Merge remote-tracking branch 'huggingface/master'
...
# Conflicts:
# pytorch_transformers/__init__.py
2019-09-16 19:06:12 +08:00
erenup
4812a5a767
add doc string
2019-09-16 11:50:18 +08:00
Thomas Wolf
3f05de6dde
Merge branch 'master' into reorder_arguments
2019-09-09 15:42:25 +03:00
thomwolf
3401980fc4
fix #1208
2019-09-09 10:22:12 +03:00
thomwolf
2a667b1eb9
split configuration and modeling files
2019-09-05 00:27:11 +02:00
thomwolf
e25cba78cf
WIP reodering arguments for torchscript and TF
2019-09-04 22:39:23 +02:00
LysandreJik
b6992b7b47
Applied patch to OpenAI GPT, RoBERTa, TransfoL, XLM and XLNet
2019-08-31 00:33:50 -04:00
Julien Chaumond
1d438f15b3
[XLNet] Use pytorch's layernorm like in BERT
...
See #1089
cc @thomwolf @lysandrejik
Also @dhpollack
2019-08-30 20:20:15 -04:00
erenup
6e1ac34e2b
Merge remote-tracking branch 'huggingface/master'
2019-08-30 15:50:11 +08:00
ziliwang
6060b2f89b
fix: hard coding for max number
...
fp16 max number is 65504, the original 1e30 will cause Nan in fp16
2019-08-30 12:13:47 +08:00
thomwolf
0a74c88ac6
fix #1131
2019-08-28 22:41:42 +02:00
VictorSanh
7f5d85347e
fix small typo
2019-08-28 02:44:51 +00:00
thomwolf
53c8f700f4
fix #808
2019-08-20 11:29:26 +02:00
Lysandre
c589862b78
Doc: loading from config alone does not load the model weights
2019-08-19 10:17:47 -04:00
erenup
b8fde43868
a coding bug
2019-08-19 16:36:43 +08:00
erenup
5582bc4b23
add multiple choice to robreta and xlnet, test on swag, roberta=0.82.28
...
, xlnet=0.80
2019-08-18 16:01:48 +08:00
LysandreJik
f2b300df6b
fix #976
2019-08-08 10:38:57 -04:00
Thomas Wolf
822d6768eb
Merge pull request #962 from guotong1988/patch-1
...
Update modeling_xlnet.py
2019-08-07 10:09:20 +02:00
wangfei
6ec1ee9ec2
Fix examples in docstring
2019-08-06 11:32:54 +08:00
wangfei
beb03ec6c5
Fix examples of loading pretrained models in docstring
2019-08-06 11:24:46 +08:00
雷打不动!
84eb699082
Update modeling_xlnet.py
2019-08-05 08:57:09 +08:00
thomwolf
bfbe52ec39
cleaning up example docstrings
2019-07-27 20:25:39 +02:00
thomwolf
0227b4a940
fix #827
2019-07-23 14:06:43 +02:00
thomwolf
f289e6cfe4
fix docstrings
2019-07-16 15:31:21 +02:00
thomwolf
1b35d05d4b
update conversion scripts and __main__
2019-07-16 09:41:55 +02:00
thomwolf
3b8b0e01bb
update readme
2019-07-16 00:12:55 +02:00
thomwolf
15d8b1266c
update tokenizer - update squad example for xlnet
2019-07-15 17:30:42 +02:00
thomwolf
3b469cb422
updating squad for compatibility with XLNet
2019-07-15 15:28:37 +02:00
thomwolf
f7cd7392fd
fixed tests
2019-07-15 12:32:19 +02:00
thomwolf
44c985facd
update doc for XLM and XLNet
2019-07-15 11:36:50 +02:00
thomwolf
7d4b200e40
good quality generation example for GPT, GPT-2, Transfo-XL, XLNet
2019-07-13 15:25:03 +02:00
thomwolf
2918b7d2a0
updating tests
2019-07-12 10:57:58 +02:00
thomwolf
bd404735a7
embeddings resizing + tie_weights
2019-07-12 00:02:49 +02:00
Thomas Wolf
b87eb82b4f
Merge branch 'xlnet' into doc-sphinx
2019-07-11 15:46:27 +02:00
thomwolf
ec07cf5a66
rewamp optimization
2019-07-11 14:48:22 +02:00
thomwolf
4fef5919a5
updating examples
2019-07-11 12:03:08 +02:00
LysandreJik
5288913bdd
All TODOs to be checked by Thom have been added.
2019-07-10 15:16:40 -04:00
thomwolf
50b7e52a7f
WIP examples
2019-07-10 15:33:34 +02:00
LysandreJik
83fb311ef7
Patched warnings + Refactored XLNet's Docstrings
2019-07-09 16:38:30 -04:00
LysandreJik
8fe2c9d98e
Refactored Docstrings of BERT, GPT2, GPT, TransfoXL, XLM and XLNet.
2019-07-09 15:55:31 -04:00
thomwolf
d5481cbe1b
adding tests to examples - updating summary module - coverage update
2019-07-09 15:29:42 +02:00
thomwolf
b19786985d
unified tokenizer api and serialization + tests
2019-07-09 10:25:18 +02:00
thomwolf
36bca545ff
tokenization abstract class - tests for examples
2019-07-05 15:02:59 +02:00
thomwolf
0bab55d5d5
[BIG] name change
2019-07-05 11:55:36 +02:00