Commit Graph

20 Commits

Author SHA1 Message Date
thomwolf
2967de06f4 adding intialization to bert 2019-09-25 22:08:38 +02:00
thomwolf
3b7fb48c3b fix loading from tf/pt 2019-09-25 17:46:16 +02:00
thomwolf
a049c8043b push fix to training 2019-09-25 17:33:16 +02:00
thomwolf
de203853cc docstring for xlnet 2019-09-24 15:30:55 +02:00
thomwolf
ee261439a9 add save_pretrained 2019-09-24 14:30:28 +02:00
thomwolf
f5397ffc3b update loading logics 2019-09-24 14:03:58 +02:00
thomwolf
271f213621 updating to load tf model in pt - fixing headmasking test 2019-09-24 13:51:28 +02:00
thomwolf
465870c33f Xlnet working - also added simple question answering model for XLNet 2019-09-10 16:44:41 +02:00
thomwolf
32aabe8c33 WIP XLNet 2019-09-10 12:17:18 +02:00
thomwolf
0537139b2b removing tf.function 2019-09-09 14:47:31 +02:00
thomwolf
78b2a53f10 debug file download in tests error 2019-09-09 13:38:10 +02:00
thomwolf
6b3438df21 fixing GPT2 double head model and updating the torch version tests 2019-09-09 12:48:36 +02:00
thomwolf
b7175a2701 fixed imports in tests and gpt2 config test 2019-09-09 11:04:03 +02:00
thomwolf
728637356c WIP GPT2 2019-09-09 10:18:55 +03:00
thomwolf
34f28b2a13 WIP GPT2 2019-09-08 15:02:06 +03:00
thomwolf
ad88563bda WIP GPT-2 2019-09-08 15:02:06 +03:00
thomwolf
64d83c7ae0 WIP 2019-09-08 15:02:06 +03:00
thomwolf
01597e5b90 add tf auto models + tests 2019-09-08 15:02:06 +03:00
thomwolf
f5c698b21a add weights tying, attention and hidden states output tests 2019-09-08 15:02:06 +03:00
thomwolf
d68a8fe462 add tf bert files 2019-09-08 15:02:06 +03:00