Ubuntu
|
f18ae210e1
|
fix typo
|
2018-11-03 22:34:37 +00:00 |
|
thomwolf
|
04287a4d68
|
special edition script
|
2018-11-03 19:06:15 +01:00 |
|
VictorSanh
|
2c55568c40
|
scatter_ and scatter
|
2018-11-03 10:27:38 -04:00 |
|
thomwolf
|
e6d106a01b
|
comment on gelu function
|
2018-11-03 10:47:10 +01:00 |
|
thomwolf
|
01b1a05343
|
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
|
2018-11-03 03:11:15 +01:00 |
|
thomwolf
|
8aa22af0c5
|
fixing model
|
2018-11-03 03:11:13 +01:00 |
|
VictorSanh
|
72ab10399f
|
Fix loss
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
|
2018-11-02 18:06:21 -04:00 |
|
VictorSanh
|
25d5ca48e0
|
Fix scatter LopngTensor
|
2018-11-02 17:57:46 -04:00 |
|
VictorSanh
|
e6a710f684
|
device
|
2018-11-02 17:54:22 -04:00 |
|
VictorSanh
|
3ebf1a13c9
|
Fix loss computation for indexes bigger than max_seq_length.
|
2018-11-02 17:49:35 -04:00 |
|
thomwolf
|
c84315ec35
|
model fixes + ipnb fixes
|
2018-11-02 15:11:16 +01:00 |
|
thomwolf
|
e61db0d1c0
|
run_squad WIP
|
2018-11-02 03:56:14 +01:00 |
|
thomwolf
|
c0065af6cb
|
implemented BertForQuestionAnswering
|
2018-11-02 03:04:34 +01:00 |
|
thomwolf
|
dee09a40b2
|
various fixes
|
2018-11-02 01:52:54 +01:00 |
|
thomwolf
|
2c731fd129
|
small tweaks
|
2018-11-02 01:38:22 +01:00 |
|
thomwolf
|
9343a2311b
|
model training loop working – still have to check that everything is exactly same
|
2018-11-02 01:31:31 +01:00 |
|
thomwolf
|
f690f0e167
|
run_classifier WIP + added classifier head and initialization to the model
|
2018-11-02 00:27:50 +01:00 |
|
thomwolf
|
4a0b59e980
|
run_classifier WIP
|
2018-11-01 21:05:04 +01:00 |
|
thomwolf
|
ab0e8932a8
|
convertion script WIP
|
2018-11-01 18:00:20 +01:00 |
|
thomwolf
|
c5d532e5f6
|
added conversion script
|
2018-11-01 17:40:05 +01:00 |
|
thomwolf
|
71557b165e
|
working on model
|
2018-11-01 14:48:15 +01:00 |
|
thomwolf
|
1b95daa0c1
|
model conversion WIP
|
2018-11-01 02:22:50 +01:00 |
|
thomwolf
|
da017ac910
|
adding pytorch model file
|
2018-10-31 19:44:49 +01:00 |
|