thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
|
lukovnikov
|
35410da758
|
added warning
|
2019-02-27 17:11:42 +01:00 |
|
lukovnikov
|
4d79e0d386
|
added warning
|
2019-02-27 16:50:05 +01:00 |
|
lukovnikov
|
66a84b63b0
|
added warning
|
2019-02-27 16:38:00 +01:00 |
|
lukovnikov
|
070f3b21d8
|
added warning
|
2019-02-27 16:26:45 +01:00 |
|
lukovnikov
|
46ef646016
|
added warning
|
2019-02-27 16:22:27 +01:00 |
|
lukovnikov
|
9bc3773c84
|
added warning
|
2019-02-27 16:10:31 +01:00 |
|
lukovnikov
|
60a372387f
|
added warning
|
2019-02-27 15:54:09 +01:00 |
|
lukovnikov
|
da2d8ca265
|
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
|
2019-02-26 17:16:06 +01:00 |
|
lukovnikov
|
e04bab59e1
|
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
|
2019-02-26 16:22:52 +01:00 |
|
Deyu Fu
|
c8ea286048
|
change to apex for better fp16 and multi-gpu support
|
2018-12-11 17:13:58 -08:00 |
|
Li Li
|
81e1e2489f
|
Fix optimizer to work with horovod
|
2018-12-10 02:08:38 -08:00 |
|
thomwolf
|
757750d6f6
|
fix tests
|
2018-11-17 11:58:14 +01:00 |
|
thomwolf
|
886cb49792
|
updating readme and notebooks
|
2018-11-16 14:31:15 +01:00 |
|
thomwolf
|
1de35b624b
|
preparing for first release
|
2018-11-15 20:56:10 +01:00 |
|