Commit Graph

19227 Commits

Author SHA1 Message Date
Thomas Wolf
fa1aa81f26
fix typo in readme bach examples 2018-11-12 08:37:43 +01:00
Thomas Wolf
6d6b916f48
update to BERT-large results 2018-11-11 17:00:49 +01:00
Thomas Wolf
c4bfc646f5
Add results of fine-tuning BERT-large on GPUs 2018-11-11 16:59:35 +01:00
Thomas Wolf
48930a4cff
Merge pull request #2 from elyase/patch-1
Port tokenization for the multilingual model
2018-11-10 22:27:45 +01:00
thomwolf
a81a1ef8e9 fixing learning rate schedule when using gradient_accumulation_steps 2018-11-10 16:11:14 +01:00
thomwolf
ea85cca8ab adding optimize_on_cpu explanation in readme 2018-11-09 11:42:37 +01:00
thomwolf
5f04aa00ed option to perform optimization and keep the optimizer averages on CPU 2018-11-09 11:28:14 +01:00
thomwolf
9e95cd8cd6 clean up optimizer from unused functions 2018-11-09 11:23:55 +01:00
thomwolf
34a1a01091 update code comment 2018-11-09 09:31:20 +01:00
thomwolf
34bdc8b54f remove duplicate accumulate gradient step arguments 2018-11-09 09:19:45 +01:00
Thomas Wolf
0c24db9d5f
update results for SQuAD 2018-11-09 09:11:59 +01:00
thomwolf
2c5d993ba4 update readme - fix SQuAD model on multi-GPU 2018-11-08 21:22:22 +01:00
Gopal Krishna
4850ec5888 fixed small typos in the README.md (#8) 2018-11-08 15:00:02 -05:00
Thomas Wolf
3bfbc21376
updating pytest command 2018-11-08 00:44:17 +01:00
Thomas Wolf
0ed7696191
Updated MRPC results 2018-11-08 00:39:42 +01:00
thomwolf
48d4a5317c typo fix in output tuple 2018-11-07 23:51:12 +01:00
Thomas Wolf
d92a7f7721
Removing note on run_squad.py example 2018-11-07 23:37:55 +01:00
Thomas Wolf
5c0838d846
Merge pull request #7 from huggingface/develop
Develop
2018-11-07 23:36:46 +01:00
Thomas Wolf
efeb6b1a0d
Merge branch 'master' into develop 2018-11-07 23:35:42 +01:00
thomwolf
dbc318a4c6 cleaning up - speeding up a bit multi-gpu 2018-11-07 22:22:55 +01:00
thomwolf
6bb7510a50 fixing pre-processing bug - averaging loss for gradient accumulation - no_grad on evaluation 2018-11-07 22:12:41 +01:00
lukovnikov
bd91ae654f moved bert to qelos-util 2018-11-06 18:21:44 +01:00
lukovnikov
4e52188433 bert weight loading from tf 2018-11-06 17:47:03 +01:00
thomwolf
a1126237a9 clean up logits extraction logic 2018-11-06 17:31:15 +01:00
thomwolf
2a97fe220b fixing weights initialization in the model and out of span clamping 2018-11-06 17:26:33 +01:00
thomwolf
907d3569c1 cleaning up SQuAD notebook - more explanation - fixing error 2018-11-06 11:13:43 +01:00
Thomas Wolf
1a5bbd83dc
Updating run_squad information in readme 2018-11-06 08:53:01 +01:00
Thomas Wolf
79e1b95e75
fix link in readme 2018-11-06 08:38:02 +01:00
Knut Ole Sjøli
886f595c37 Fix typo in subheader (#4) 2018-11-05 18:34:18 -05:00
Yaser Martinez Palenzuela
4d124baf8f
Add test for Chinese tokenization 2018-11-05 23:04:29 +01:00
Thomas Wolf
59d4cc5f2b
typos 2018-11-05 22:47:24 +01:00
Yaser Martinez Palenzuela
0ce2f496dc
Port tokenization for the multilingual model 2018-11-05 22:34:12 +01:00
Thomas Wolf
d983eecdd3
more readme typo fixes 2018-11-05 21:29:04 +01:00
Thomas Wolf
8f91b4de91
more typo fixes 2018-11-05 21:24:14 +01:00
Thomas Wolf
7316b0d6d0
fix typo 2018-11-05 21:22:45 +01:00
Clement
d130cb5139
typos 2018-11-05 15:09:24 -05:00
Clement
2a8fee495b
typos 2018-11-05 15:04:06 -05:00
Clement
f968b11657
typo 2018-11-05 14:59:44 -05:00
thomwolf
88e793f31a fix typos 2018-11-05 16:14:19 +01:00
thomwolf
3914eed505 update readme 2018-11-05 16:09:27 +01:00
thomwolf
bab5d13077 update optimizer documentation 2018-11-05 16:09:21 +01:00
thomwolf
7394eb47a5 update readme 2018-11-05 15:35:44 +01:00
thomwolf
e6646751ac update notebooks 2018-11-05 15:02:50 +01:00
thomwolf
b705c9eff5 remove small script, moved notebooks to notebook folder 2018-11-05 14:55:08 +01:00
thomwolf
3a301d443b update gitignore 2018-11-05 14:53:43 +01:00
thomwolf
711d3f9f2b remove tensorflow_code 2018-11-05 14:53:03 +01:00
thomwolf
7875b1a8e0 notebook update 2018-11-05 14:50:44 +01:00
thomwolf
c3527cfbc4 ignore SQuAD targets outside of seq_length 2018-11-05 14:18:48 +01:00
thomwolf
1b99cdf71b script that use a small portion of squad only 2018-11-05 13:54:54 +01:00
thomwolf
2f4765d3ed fix multi-gpu squad loss 2018-11-05 13:46:14 +01:00