VictorSanh
|
f5891c3821
|
run_squad --> run_squad_w_distillation
|
2019-10-04 17:23:15 -04:00 |
|
VictorSanh
|
5f07d8f11a
|
prepare release
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
35071007cb
|
incoming release 🔥 update links to arxiv preprint
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
2a91f6071f
|
upddate README - TODO updadte link to paper
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
c51e533a5f
|
update train.py
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
a76c3f9cb0
|
update requirements
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
bb9c5ead54
|
update distiller
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
a12ab0a8db
|
update binarized_data
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
4d6dfbd376
|
update extract
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
23edebc079
|
update extract_distilbert
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
cbfcfce205
|
update token_counts
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
19e4ebbe3f
|
grouped_batch_sampler
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
594202a934
|
lm_seqs_dataset
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
38084507c4
|
add distillation_configs
|
2019-10-03 10:27:11 -04:00 |
|
thomwolf
|
481d9c4fb5
|
Merge branch 'master' into tf2
|
2019-09-26 12:02:54 +02:00 |
|
thomwolf
|
31c23bd5ee
|
[BIG] pytorch-transformers => transformers
|
2019-09-26 10:15:53 +02:00 |
|
VictorSanh
|
9f995b99d4
|
minor fixes
|
2019-09-19 21:36:06 +00:00 |
|
VictorSanh
|
354944e607
|
[distillation] big update w/ new weights
|
2019-09-19 19:25:21 +00:00 |
|
VictorSanh
|
32e1332acf
|
[distil] fix once for all general logger for scripts
|
2019-09-11 14:19:07 +00:00 |
|
VictorSanh
|
364920e216
|
fix small bug/typo
|
2019-09-10 21:45:01 +00:00 |
|
VictorSanh
|
a95ced6260
|
[Distillation] save last chkpt as pytorch_model.bin
|
2019-09-09 19:53:35 +00:00 |
|
VictorSanh
|
dddd6b9927
|
Update DistilBERT training code
|
2019-09-05 18:26:14 +00:00 |
|
Stefan Schweter
|
a1c34bd286
|
distillation: fix ModuleNotFoundError error in token counts script
|
2019-08-31 12:21:38 +02:00 |
|
VictorSanh
|
282c276e09
|
typos + file name coherence in distillation README
|
2019-08-30 12:02:29 -04:00 |
|
VictorSanh
|
803c1cc4ea
|
fix relative import bug cf Issue #1140
|
2019-08-30 12:01:27 -04:00 |
|
LysandreJik
|
caf1d116a6
|
Closing bracket in DistilBERT's token count.
|
2019-08-29 15:30:10 -04:00 |
|
thomwolf
|
b5eb283aaa
|
update credits
|
2019-08-28 16:36:55 +02:00 |
|
thomwolf
|
912a377e90
|
dilbert -> distilbert
|
2019-08-28 13:59:42 +02:00 |
|
thomwolf
|
4ce5f36f78
|
update readmes
|
2019-08-28 12:14:31 +02:00 |
|
VictorSanh
|
93e82ab424
|
Write README for DilBERT
|
2019-08-28 06:26:09 +00:00 |
|
VictorSanh
|
fea921d382
|
add licensing
|
2019-08-28 04:45:39 +00:00 |
|
VictorSanh
|
da1e4e53fc
|
some fixes in train.py for loading previous checkpoint
|
2019-08-28 04:01:03 +00:00 |
|
VictorSanh
|
0d8f8848d5
|
add scripts/extract_for_distil.py
|
2019-08-28 04:00:19 +00:00 |
|
VictorSanh
|
7f2c384c80
|
add scripts/token_counts.py
|
2019-08-28 04:00:03 +00:00 |
|
VictorSanh
|
4d16b279e5
|
add scripts/binarized_data.py
|
2019-08-28 03:59:48 +00:00 |
|
VictorSanh
|
b247b0d880
|
add train.py for distillation
|
2019-08-28 02:12:47 +00:00 |
|
VictorSanh
|
780f183e55
|
add requirements
|
2019-08-28 01:39:52 +00:00 |
|
VictorSanh
|
e424d2e45d
|
add README
|
2019-08-28 01:10:10 +00:00 |
|
VictorSanh
|
1ae81e4aa1
|
add dataset. distiller, utils
|
2019-08-28 01:10:05 +00:00 |
|