update to BERT-large results

2025-08-01 02:31:11 +06:00 · 2018-11-11 17:00:49 +01:00 · 2018-11-11 17:00:49 +01:00 · 6d6b916f48
commit 6d6b916f48
parent c4bfc646f5
1 changed files with 1 additions and 1 deletions
--- a/README.md
+++ b/README.md
@ -206,7 +206,7 @@ Training with the previous hyper-parameters gave us the following results:
 The options we list above allow to fine-tune BERT-large rather easily on GPU(s) instead of the TPU used by the original implementation.
-For example, fine-tuning BERT-large on SQuAD can be done on a server with 4 k-80 (these are pretty old now) in 18 hours. Our results are similar to the TensorFlow implementation results:
+For example, fine-tuning BERT-large on SQuAD can be done on a server with 4 k-80 (these are pretty old now) in 18 hours. Our results are similar to the TensorFlow implementation results (actually slightly higher):
 ```bash
 {"exact_match": 84.56953642384106, "f1": 91.04028647786927}
 ```