Commit Graph

20 Commits

Author SHA1 Message Date
thomwolf
e6d106a01b comment on gelu function 2018-11-03 10:47:10 +01:00
thomwolf
01b1a05343 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 03:11:15 +01:00
thomwolf
8aa22af0c5 fixing model 2018-11-03 03:11:13 +01:00
VictorSanh
72ab10399f Fix loss
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
2018-11-02 18:06:21 -04:00
VictorSanh
25d5ca48e0 Fix scatter LopngTensor 2018-11-02 17:57:46 -04:00
VictorSanh
e6a710f684 device 2018-11-02 17:54:22 -04:00
VictorSanh
3ebf1a13c9 Fix loss computation for indexes bigger than max_seq_length. 2018-11-02 17:49:35 -04:00
thomwolf
c84315ec35 model fixes + ipnb fixes 2018-11-02 15:11:16 +01:00
thomwolf
e61db0d1c0 run_squad WIP 2018-11-02 03:56:14 +01:00
thomwolf
c0065af6cb implemented BertForQuestionAnswering 2018-11-02 03:04:34 +01:00
thomwolf
dee09a40b2 various fixes 2018-11-02 01:52:54 +01:00
thomwolf
2c731fd129 small tweaks 2018-11-02 01:38:22 +01:00
thomwolf
9343a2311b model training loop working – still have to check that everything is exactly same 2018-11-02 01:31:31 +01:00
thomwolf
f690f0e167 run_classifier WIP + added classifier head and initialization to the model 2018-11-02 00:27:50 +01:00
thomwolf
4a0b59e980 run_classifier WIP 2018-11-01 21:05:04 +01:00
thomwolf
ab0e8932a8 convertion script WIP 2018-11-01 18:00:20 +01:00
thomwolf
c5d532e5f6 added conversion script 2018-11-01 17:40:05 +01:00
thomwolf
71557b165e working on model 2018-11-01 14:48:15 +01:00
thomwolf
1b95daa0c1 model conversion WIP 2018-11-01 02:22:50 +01:00
thomwolf
da017ac910 adding pytorch model file 2018-10-31 19:44:49 +01:00