thomwolf
139873f6e3
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 19:06:17 +01:00
thomwolf
04287a4d68
special edition script
2018-11-03 19:06:15 +01:00
VictorSanh
a1af5247e1
Add seed in initialization
2018-11-03 14:00:36 -04:00
Ubuntu
4faeb38b51
Fix loss loss logging for multi-gpu compatibility
2018-11-03 17:52:51 +00:00
thomwolf
25f73add07
update optimizer run_squad
2018-11-03 17:56:34 +01:00
thomwolf
f514cbbf30
update run_squad with tqdm
2018-11-03 17:52:44 +01:00
thomwolf
cb76c1ddd3
add model.zero_grad()
2018-11-03 17:40:12 +01:00
thomwolf
a4086c5de5
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 17:38:17 +01:00
thomwolf
088ad45888
fixing optimization
2018-11-03 17:38:15 +01:00
VictorSanh
8bd6b235b7
typo on tokenization
2018-11-03 10:27:59 -04:00
VictorSanh
2c55568c40
scatter_
and scatter
2018-11-03 10:27:38 -04:00
VictorSanh
a6efe1235f
Merge pull request #1 from huggingface/multi-gpu-support
...
Create DataParallel model if several GPUs
2018-11-03 10:10:34 -04:00
VictorSanh
5f432480c0
Create DataParallel model if several GPUs
2018-11-03 10:10:01 -04:00
VictorSanh
5889765a7c
Update README.md
2018-11-03 09:18:44 -04:00
VictorSanh
8c932e37f9
Update the comparison notebook
2018-11-03 09:08:05 -04:00
VictorSanh
391a4ec2f3
Small typo in trange
...
I seriously don't understand why they defined num_train_epochs as a float in the originial tf code.
I Will change it at the end to avoir merge conflicts for now.
2018-11-03 08:25:15 -04:00
VictorSanh
5676d6f799
Remove BERT pretraining files for now
2018-11-03 08:17:22 -04:00
Tim Rault
8ec457d370
Fix imports
2018-11-03 12:35:21 +01:00
thomwolf
852e4b3c00
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 12:23:04 +01:00
thomwolf
0d8d2285ba
fix optimization_test
2018-11-03 12:23:00 +01:00
Tim Rault
574e20a92e
Move modeling_test.py to /tensorflow_code
2018-11-03 12:22:06 +01:00
thomwolf
45efc9d807
removing f-string
2018-11-03 11:46:18 +01:00
thomwolf
4df6027453
clean up
2018-11-03 11:33:10 +01:00
thomwolf
e6d106a01b
comment on gelu function
2018-11-03 10:47:10 +01:00
thomwolf
01b1a05343
Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT
2018-11-03 03:11:15 +01:00
thomwolf
8aa22af0c5
fixing model
2018-11-03 03:11:13 +01:00
VictorSanh
72ab10399f
Fix loss
...
Please review @thomwolf but i think this is equivqlent (and it mimics the loss computation of the original loss)
2018-11-02 18:06:21 -04:00
VictorSanh
25d5ca48e0
Fix scatter LopngTensor
2018-11-02 17:57:46 -04:00
VictorSanh
e6a710f684
device
2018-11-02 17:54:22 -04:00
VictorSanh
3ebf1a13c9
Fix loss computation for indexes bigger than max_seq_length.
2018-11-02 17:49:35 -04:00
Tim Rault
629bd006bf
Convert optimization_test.py to PyTorch
2018-11-02 17:50:17 +01:00
VictorSanh
38f740a1d5
Fix bug writing predictions in run_squad_pytorch
2018-11-02 11:29:45 -04:00
VictorSanh
ee29871f8d
Debug run_squad_pytorch
2018-11-02 11:07:32 -04:00
VictorSanh
101eabff90
Debug run_squad_pytorch
2018-11-02 10:44:08 -04:00
VictorSanh
bb0a510330
Print for debug run_squad
2018-11-02 10:16:07 -04:00
thomwolf
c84315ec35
model fixes + ipnb fixes
2018-11-02 15:11:16 +01:00
Tim Rault
3ff2ec5eb3
Move command-line argparse arguments into main() function
2018-11-02 14:42:05 +01:00
thomwolf
c9690e57f8
adding jupyter, updating extract features adding simple test file
2018-11-02 14:25:21 +01:00
VictorSanh
844b2f0e6f
Small update Readme
2018-11-02 08:57:15 -04:00
Tim Rault
49006d0662
Bug fix type=bool -> action='store_true' in argparse
2018-11-02 10:04:41 +01:00
VictorSanh
beb59080b3
Fix size compatibility for model.forward
...
Error was coming from "modeling_pytorch.py", line 484, in forward: start_loss = loss_fct(start_logits, start_positions) --> ValueError: Expected target size (12, 1), got torch.Size([12])
2018-11-02 04:36:42 -04:00
VictorSanh
8cbe7d6af8
FIX errors in loading eval Dataset in run_squad_pytorch
2018-11-02 04:09:53 -04:00
VictorSanh
833c3a7a25
FIX errors in loading Dataset in run_squad_pytorch
2018-11-02 04:00:00 -04:00
VictorSanh
72d69a4ef4
Update README
2018-11-02 03:37:39 -04:00
VictorSanh
62ac7e9a60
Fix small bug in run_squad_pytorch.py
2018-11-02 03:32:35 -04:00
VictorSanh
98b9771dfe
Quick fix metrics evaluation on run_classif_pytorch
2018-11-02 03:02:06 -04:00
VictorSanh
bf65d4dbb7
Begin Updating the README.md
2018-11-02 02:51:07 -04:00
VictorSanh
b54de837c2
Quick fix on eval accuracy
2018-11-02 02:46:17 -04:00
VictorSanh
1d53f9cb72
Fix cuda compability for evaluation
2018-11-02 01:39:15 -04:00
VictorSanh
1d8511f8f2
FIX small bugs in run_classifier_pytorch.py
2018-11-02 01:12:52 -04:00