Commit Graph

25 Commits

Author SHA1 Message Date
lukovnikov
b6c1cae67b branches, optim cosine fix 2019-03-18 13:32:04 +01:00
lukovnikov
20e652209c relation classification: replacing entity mention with mask token 2019-03-13 16:13:37 +01:00
lukovnikov
eac039d21f changing docker 2019-03-12 13:45:12 +01:00
lukovnikov
471daf1b6c changing docker 2019-03-12 13:32:42 +01:00
lukovnikov
9024613337 changing docker 2019-03-12 13:23:58 +01:00
lukovnikov
baf66d1419 restart cosine lr schedule 2019-03-12 13:22:23 +01:00
lukovnikov
51efde54a9 cos fix 2019-03-09 02:45:25 +01:00
lukovnikov
f113a2dfdc readme de 2019-03-09 02:29:57 +01:00
lukovnikov
90a41dbe14 BertAdam schedule objects 2019-03-09 02:23:20 +01:00
lukovnikov
88874f6cf0 BertAdam schedule objects 2019-03-08 19:08:30 +01:00
thomwolf
5c85fc3977 fix typo - logger info 2019-03-06 10:05:21 +01:00
lukovnikov
35410da758 added warning 2019-02-27 17:11:42 +01:00
lukovnikov
4d79e0d386 added warning 2019-02-27 16:50:05 +01:00
lukovnikov
66a84b63b0 added warning 2019-02-27 16:38:00 +01:00
lukovnikov
070f3b21d8 added warning 2019-02-27 16:26:45 +01:00
lukovnikov
46ef646016 added warning 2019-02-27 16:22:27 +01:00
lukovnikov
9bc3773c84 added warning 2019-02-27 16:10:31 +01:00
lukovnikov
60a372387f added warning 2019-02-27 15:54:09 +01:00
lukovnikov
da2d8ca265 fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
2019-02-26 17:16:06 +01:00
lukovnikov
e04bab59e1 fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
2019-02-26 16:22:52 +01:00
Deyu Fu
c8ea286048 change to apex for better fp16 and multi-gpu support 2018-12-11 17:13:58 -08:00
Li Li
81e1e2489f Fix optimizer to work with horovod 2018-12-10 02:08:38 -08:00
thomwolf
757750d6f6 fix tests 2018-11-17 11:58:14 +01:00
thomwolf
886cb49792 updating readme and notebooks 2018-11-16 14:31:15 +01:00
thomwolf
1de35b624b preparing for first release 2018-11-15 20:56:10 +01:00