thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
|
lukovnikov
|
35410da758
|
added warning
|
2019-02-27 17:11:42 +01:00 |
|
lukovnikov
|
4d79e0d386
|
added warning
|
2019-02-27 16:50:05 +01:00 |
|
lukovnikov
|
66a84b63b0
|
added warning
|
2019-02-27 16:38:00 +01:00 |
|
lukovnikov
|
070f3b21d8
|
added warning
|
2019-02-27 16:26:45 +01:00 |
|
lukovnikov
|
46ef646016
|
added warning
|
2019-02-27 16:22:27 +01:00 |
|
lukovnikov
|
60a372387f
|
added warning
|
2019-02-27 15:54:09 +01:00 |
|
lukovnikov
|
da2d8ca265
|
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
|
2019-02-26 17:16:06 +01:00 |
|
lukovnikov
|
e04bab59e1
|
fix for negative learning rate with warmup_linear in BertAdam (happens when t_total is specified incorrectly)
+ copied BERT optimization warmup functions to OpenAI optimization file + added comments
|
2019-02-26 16:22:52 +01:00 |
|
thomwolf
|
eed51c5bdf
|
add OpenAI GPT
|
2019-01-08 12:26:58 +01:00 |
|
thomwolf
|
93f563b8a8
|
adding OpenAI GPT
|
2019-01-07 12:55:36 +01:00 |
|