thomwolf
|
0f091062d4
|
Merge branch 'glue-example' into tf2
|
2019-09-25 10:21:52 +02:00 |
|
Julien Chaumond
|
62760baf46
|
tiny fixes
|
2019-09-17 18:29:15 -04:00 |
|
thomwolf
|
4b956b2a6b
|
add layer_norm_epsilon configuration for transformer xl
|
2019-09-13 17:09:20 +02:00 |
|
thomwolf
|
65c49bb27e
|
adding TF 2.0 adaptive softmax with logits + loss outputs
|
2019-09-13 15:50:51 +02:00 |
|
Zili Wang
|
8bdee1cb73
|
fixed: hard coding for max and min number will out of range in fp16, which will cause nan.
|
2019-09-11 15:41:53 +08:00 |
|
Thomas Wolf
|
3f05de6dde
|
Merge branch 'master' into reorder_arguments
|
2019-09-09 15:42:25 +03:00 |
|
thomwolf
|
1efb1f1660
|
split configuration and modeling files
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
1eb125fb95
|
be sure we have uint8
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
2a667b1eb9
|
split configuration and modeling files
|
2019-09-05 00:27:11 +02:00 |
|
thomwolf
|
0be6a2a624
|
be sure we have uint8
|
2019-09-04 22:47:38 +02:00 |
|
thomwolf
|
e25cba78cf
|
WIP reodering arguments for torchscript and TF
|
2019-09-04 22:39:23 +02:00 |
|
thomwolf
|
38b79b5a63
|
Fixing this TransformerXL bool issue
|
2019-09-04 22:36:30 +02:00 |
|
LysandreJik
|
b6992b7b47
|
Applied patch to OpenAI GPT, RoBERTa, TransfoL, XLM and XLNet
|
2019-08-31 00:33:50 -04:00 |
|
Thomas Wolf
|
50792dbdcc
|
Merge pull request #1127 from huggingface/dilbert
DilBERT
|
2019-08-28 16:43:09 +02:00 |
|
VictorSanh
|
7f5d85347e
|
fix small typo
|
2019-08-28 02:44:51 +00:00 |
|
Nikolay Korolev
|
53282b5bd0
|
Change attention mask dtype to be bool. Fix #1119
|
2019-08-27 14:19:03 +03:00 |
|
thomwolf
|
53c8f700f4
|
fix #808
|
2019-08-20 11:29:26 +02:00 |
|
Lysandre
|
c589862b78
|
Doc: loading from config alone does not load the model weights
|
2019-08-19 10:17:47 -04:00 |
|
wangfei
|
72622926e5
|
Fix examples in docstring
|
2019-08-06 11:32:41 +08:00 |
|
wangfei
|
beb03ec6c5
|
Fix examples of loading pretrained models in docstring
|
2019-08-06 11:24:46 +08:00 |
|
thomwolf
|
bfbe52ec39
|
cleaning up example docstrings
|
2019-07-27 20:25:39 +02:00 |
|
thomwolf
|
0227b4a940
|
fix #827
|
2019-07-23 14:06:43 +02:00 |
|
thomwolf
|
f289e6cfe4
|
fix docstrings
|
2019-07-16 15:31:21 +02:00 |
|
thomwolf
|
f7cd7392fd
|
fixed tests
|
2019-07-15 12:32:19 +02:00 |
|
thomwolf
|
44c985facd
|
update doc for XLM and XLNet
|
2019-07-15 11:36:50 +02:00 |
|
thomwolf
|
0201d86015
|
added doc for transformer-xl
|
2019-07-15 10:11:09 +02:00 |
|
thomwolf
|
7d4b200e40
|
good quality generation example for GPT, GPT-2, Transfo-XL, XLNet
|
2019-07-13 15:25:03 +02:00 |
|
thomwolf
|
2918b7d2a0
|
updating tests
|
2019-07-12 10:57:58 +02:00 |
|
thomwolf
|
bd404735a7
|
embeddings resizing + tie_weights
|
2019-07-12 00:02:49 +02:00 |
|
Thomas Wolf
|
b87eb82b4f
|
Merge branch 'xlnet' into doc-sphinx
|
2019-07-11 15:46:27 +02:00 |
|
thomwolf
|
4fef5919a5
|
updating examples
|
2019-07-11 12:03:08 +02:00 |
|
LysandreJik
|
8fe2c9d98e
|
Refactored Docstrings of BERT, GPT2, GPT, TransfoXL, XLM and XLNet.
|
2019-07-09 15:55:31 -04:00 |
|
thomwolf
|
d5481cbe1b
|
adding tests to examples - updating summary module - coverage update
|
2019-07-09 15:29:42 +02:00 |
|
thomwolf
|
b19786985d
|
unified tokenizer api and serialization + tests
|
2019-07-09 10:25:18 +02:00 |
|
thomwolf
|
36bca545ff
|
tokenization abstract class - tests for examples
|
2019-07-05 15:02:59 +02:00 |
|
thomwolf
|
0bab55d5d5
|
[BIG] name change
|
2019-07-05 11:55:36 +02:00 |
|