thomwolf
|
2967de06f4
|
adding intialization to bert
|
2019-09-25 22:08:38 +02:00 |
|
thomwolf
|
3b7fb48c3b
|
fix loading from tf/pt
|
2019-09-25 17:46:16 +02:00 |
|
thomwolf
|
a049c8043b
|
push fix to training
|
2019-09-25 17:33:16 +02:00 |
|
thomwolf
|
de203853cc
|
docstring for xlnet
|
2019-09-24 15:30:55 +02:00 |
|
thomwolf
|
ee261439a9
|
add save_pretrained
|
2019-09-24 14:30:28 +02:00 |
|
thomwolf
|
f5397ffc3b
|
update loading logics
|
2019-09-24 14:03:58 +02:00 |
|
thomwolf
|
271f213621
|
updating to load tf model in pt - fixing headmasking test
|
2019-09-24 13:51:28 +02:00 |
|
thomwolf
|
465870c33f
|
Xlnet working - also added simple question answering model for XLNet
|
2019-09-10 16:44:41 +02:00 |
|
thomwolf
|
32aabe8c33
|
WIP XLNet
|
2019-09-10 12:17:18 +02:00 |
|
thomwolf
|
0537139b2b
|
removing tf.function
|
2019-09-09 14:47:31 +02:00 |
|
thomwolf
|
78b2a53f10
|
debug file download in tests error
|
2019-09-09 13:38:10 +02:00 |
|
thomwolf
|
6b3438df21
|
fixing GPT2 double head model and updating the torch version tests
|
2019-09-09 12:48:36 +02:00 |
|
thomwolf
|
b7175a2701
|
fixed imports in tests and gpt2 config test
|
2019-09-09 11:04:03 +02:00 |
|
thomwolf
|
728637356c
|
WIP GPT2
|
2019-09-09 10:18:55 +03:00 |
|
thomwolf
|
34f28b2a13
|
WIP GPT2
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
ad88563bda
|
WIP GPT-2
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
64d83c7ae0
|
WIP
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
01597e5b90
|
add tf auto models + tests
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
f5c698b21a
|
add weights tying, attention and hidden states output tests
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
d68a8fe462
|
add tf bert files
|
2019-09-08 15:02:06 +03:00 |
|