Commit Graph

31 Commits

Author SHA1 Message Date
VictorSanh
5858e8e4dd Fix both loss and eval metrics -> more coherence on the loss (eval vs train and tf vs pt) 2018-11-03 16:48:24 -04:00
VictorSanh
cd09cd5b40 Fix import on initalization 2018-11-03 15:38:30 -04:00
thomwolf
139873f6e3 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 19:06:17 +01:00
thomwolf
04287a4d68 special edition script 2018-11-03 19:06:15 +01:00
VictorSanh
a1af5247e1 Add seed in initialization 2018-11-03 14:00:36 -04:00
Ubuntu
4faeb38b51 Fix loss loss logging for multi-gpu compatibility 2018-11-03 17:52:51 +00:00
thomwolf
cb76c1ddd3 add model.zero_grad() 2018-11-03 17:40:12 +01:00
thomwolf
a4086c5de5 Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT 2018-11-03 17:38:17 +01:00
thomwolf
088ad45888 fixing optimization 2018-11-03 17:38:15 +01:00
VictorSanh
5f432480c0 Create DataParallel model if several GPUs 2018-11-03 10:10:01 -04:00
VictorSanh
391a4ec2f3 Small typo in trange
I seriously don't understand why they defined num_train_epochs as a float in the originial tf code.
I Will change it at the end to avoir merge conflicts for now.
2018-11-03 08:25:15 -04:00
thomwolf
0d8d2285ba fix optimization_test 2018-11-03 12:23:00 +01:00
thomwolf
45efc9d807 removing f-string 2018-11-03 11:46:18 +01:00
Tim Rault
3ff2ec5eb3 Move command-line argparse arguments into main() function 2018-11-02 14:42:05 +01:00
VictorSanh
98b9771dfe Quick fix metrics evaluation on run_classif_pytorch 2018-11-02 03:02:06 -04:00
VictorSanh
b54de837c2 Quick fix on eval accuracy 2018-11-02 02:46:17 -04:00
VictorSanh
1d53f9cb72 Fix cuda compability for evaluation 2018-11-02 01:39:15 -04:00
VictorSanh
1d8511f8f2 FIX small bugs in run_classifier_pytorch.py 2018-11-02 01:12:52 -04:00
VictorSanh
936eb4c3ad FIX small bugs in run_classifier_pytorch.py 2018-11-02 01:11:25 -04:00
thomwolf
8e81e5e6ff working on squad 2018-11-02 04:07:52 +01:00
thomwolf
dee09a40b2 various fixes 2018-11-02 01:52:54 +01:00
thomwolf
2c731fd129 small tweaks 2018-11-02 01:38:22 +01:00
thomwolf
9343a2311b model training loop working – still have to check that everything is exactly same 2018-11-02 01:31:31 +01:00
thomwolf
f690f0e167 run_classifier WIP + added classifier head and initialization to the model 2018-11-02 00:27:50 +01:00
thomwolf
4a0b59e980 run_classifier WIP 2018-11-01 21:05:04 +01:00
VictorSanh
d3a8df6b9f typos in input_fn_builder 2018-11-01 14:17:55 -04:00
VictorSanh
836faed985 wip 2018-11-01 14:17:12 -04:00
VictorSanh
90d360a7a9 WIP 2018-11-01 12:09:23 -04:00
VictorSanh
555b7d66c9 input_fn_builder WIP 2018-11-01 02:10:46 -04:00
VictorSanh
f8e347b557 Convert all DataProcessors, _truncate_seq_pair and convert_examples_to_features 2018-11-01 01:33:01 -04:00
VictorSanh
b1dade34db Convert flags to argparse in run_classifier_pytorch.py 2018-11-01 01:05:11 -04:00