Matthew Carrigan
|
0540d360f2
|
Fixed logging
|
2019-03-20 15:36:51 +00:00 |
|
Matthew Carrigan
|
976554a472
|
First commit of the new LM finetuning
|
2019-03-20 14:23:51 +00:00 |
|
lukovnikov
|
262a9992d7
|
class weights
|
2019-03-18 18:29:12 +01:00 |
|
lukovnikov
|
19cc2c084e
|
same
|
2019-03-18 15:13:35 +01:00 |
|
lukovnikov
|
2283dcca5e
|
import revert
|
2019-03-18 13:40:12 +01:00 |
|
lukovnikov
|
b6c1cae67b
|
branches, optim cosine fix
|
2019-03-18 13:32:04 +01:00 |
|
lukovnikov
|
ef28b2c747
|
branches, optim cosine fix
|
2019-03-18 13:18:07 +01:00 |
|
lukovnikov
|
90430ae7ec
|
Merge remote-tracking branch 'origin/master'
# Conflicts:
# pytorch_pretrained_bert/optimization.py
|
2019-03-18 13:15:29 +01:00 |
|
lukovnikov
|
bed6408dcc
|
branches, optim cosine fix
|
2019-03-18 13:09:55 +01:00 |
|
Ananya Harsh Jha
|
e5b63fb542
|
Merge branch 'master' of https://github.com/ananyahjha93/pytorch-pretrained-BERT
pull current master to local
|
2019-03-17 08:30:13 -04:00 |
|
Ananya Harsh Jha
|
8a4e90ff40
|
corrected folder creation error for MNLI-MM, verified GLUE results
|
2019-03-17 08:16:50 -04:00 |
|
Ananya Harsh Jha
|
e0bf01d9a9
|
added hack for mismatched MNLI
|
2019-03-16 14:10:48 -04:00 |
|
Ananya Harsh Jha
|
4c721c6b6a
|
added eval time metrics for GLUE tasks
|
2019-03-15 23:21:24 -04:00 |
|
Thomas Wolf
|
f3e5404880
|
Merge pull request #381 from tseretelitornike/master
Added missing imports.
|
2019-03-15 12:54:40 +01:00 |
|
tseretelitornike
|
83857ffeaa
|
Added missing imports.
|
2019-03-15 12:45:48 +01:00 |
|
Thomas Wolf
|
d5c037c3ed
|
Merge pull request #380 from yongbowin/patch-3
typo in annotation
|
2019-03-14 15:56:40 +01:00 |
|
Yongbo Wang
|
d1e4fa98a9
|
typo in annotation
modify `heruistic` to `heuristic` in line 660, `charcter` to `character` in line 661.
|
2019-03-14 17:32:15 +08:00 |
|
Thomas Wolf
|
59e2bdd086
|
Merge pull request #379 from yongbowin/patch-2
typo
|
2019-03-14 10:17:18 +01:00 |
|
Yongbo Wang
|
3d6452163d
|
typo
modify `mull` to `null` in line 474 annotation.
|
2019-03-14 17:03:38 +08:00 |
|
Thomas Wolf
|
76906372b0
|
Merge pull request #378 from huggingface/absolute_imports
Add absolute imports to GPT, GPT-2, Transfo-XL and and fix empty nbest_predictions.json
|
2019-03-14 10:00:47 +01:00 |
|
thomwolf
|
a98dfe4ced
|
fixing #377 (empty nbest_predictions.json)
|
2019-03-14 09:57:06 +01:00 |
|
thomwolf
|
e5f2d9122c
|
adding absolute imports to gpt2, openai and transfo-xl
|
2019-03-14 09:55:01 +01:00 |
|
Ananya Harsh Jha
|
043c8781ef
|
added code for all glue task processors
|
2019-03-14 04:24:04 -04:00 |
|
Thomas Wolf
|
eecaaa734a
|
Merge pull request #371 from yongbowin/patch-1
Simplify code, delete redundancy line
|
2019-03-14 09:03:32 +01:00 |
|
lukovnikov
|
20e652209c
|
relation classification: replacing entity mention with mask token
|
2019-03-13 16:13:37 +01:00 |
|
Yongbo Wang
|
22a465a91f
|
Simplify code, delete redundancy line
delete redundancy line `if args.train`, simplify code.
|
2019-03-13 09:42:06 +08:00 |
|
lukovnikov
|
eac039d21f
|
changing docker
|
2019-03-12 13:45:12 +01:00 |
|
lukovnikov
|
471daf1b6c
|
changing docker
|
2019-03-12 13:32:42 +01:00 |
|
lukovnikov
|
9024613337
|
changing docker
|
2019-03-12 13:23:58 +01:00 |
|
lukovnikov
|
baf66d1419
|
restart cosine lr schedule
|
2019-03-12 13:22:23 +01:00 |
|
Thomas Wolf
|
9b03d67b83
|
Merge pull request #362 from Bharat123rox/patch-1
Make the hyperlink of NVIDIA Apex clickable
|
2019-03-11 09:08:51 +01:00 |
|
Thomas Wolf
|
8435d78f0c
|
Merge pull request #361 from junjieqian/jqian/updateReadme
Correct line number in README for classes
|
2019-03-11 09:08:27 +01:00 |
|
Thomas Wolf
|
80790705e0
|
Merge pull request #359 from elonmuskceo/fix-typo
Update run_gpt2.py
|
2019-03-11 09:07:56 +01:00 |
|
Thomas Wolf
|
13aa13dbc0
|
Merge pull request #358 from cdjhz/patch-1
add 'padding_idx=0' for BertEmbeddings
|
2019-03-11 09:06:55 +01:00 |
|
Thomas Wolf
|
c0660df5dd
|
Merge pull request #357 from pglock/feature/354-use-dropout-layer-gpt
Use Dropout Layer in OpenAIGPTMultipleChoiceHead
|
2019-03-11 09:06:27 +01:00 |
|
Bharat Raghunathan
|
f91ce0b803
|
Make the hyperlink of NVIDIA Apex clickable
|
2019-03-09 20:05:39 +05:30 |
|
lukovnikov
|
51efde54a9
|
cos fix
|
2019-03-09 02:45:25 +01:00 |
|
lukovnikov
|
f113a2dfdc
|
readme de
|
2019-03-09 02:29:57 +01:00 |
|
lukovnikov
|
90a41dbe14
|
BertAdam schedule objects
|
2019-03-09 02:23:20 +01:00 |
|
Junjie Qian
|
d648a02203
|
Correct line number in README for classes
|
2019-03-08 16:28:03 -08:00 |
|
lukovnikov
|
88874f6cf0
|
BertAdam schedule objects
|
2019-03-08 19:08:30 +01:00 |
|
Elon Musk
|
66d8206809
|
Update run_gpt2.py
|
2019-03-08 11:59:08 -05:00 |
|
Haozhe Ji
|
72fa8d03a7
|
add 'padding_idx=0' for BertEmbeddings
|
2019-03-07 20:02:55 +08:00 |
|
Philipp Glock
|
6190e8ce4c
|
Fix: use dropout layer
|
2019-03-07 10:12:45 +01:00 |
|
thomwolf
|
7cc35c3104
|
fix openai gpt example and updating readme
|
2019-03-06 11:43:21 +01:00 |
|
thomwolf
|
906b638efa
|
updating readme
|
2019-03-06 10:24:19 +01:00 |
|
thomwolf
|
994d86609b
|
fixing PYTORCH_PRETRAINED_BERT_CACHE use in examples
|
2019-03-06 10:21:24 +01:00 |
|
thomwolf
|
2dd8f524f5
|
removing test for long sequences error following #337
|
2019-03-06 10:10:41 +01:00 |
|
thomwolf
|
5c85fc3977
|
fix typo - logger info
|
2019-03-06 10:05:21 +01:00 |
|
Thomas Wolf
|
8e36da7acb
|
Merge pull request #347 from jplehmann/feature/sst2-processor
Processor for SST-2 task
|
2019-03-06 09:48:27 +01:00 |
|