VictorSanh
|
35ff345fc9
|
update requirements
|
2019-12-05 12:07:04 -05:00 |
|
VictorSanh
|
552c44a9b1
|
release distilm-bert
|
2019-12-05 10:14:58 -05:00 |
|
Stefan Schweter
|
e7cf2ccd15
|
distillation: add German distilbert model
|
2019-11-19 19:55:19 +01:00 |
|
Rémi Louf
|
2276bf69b7
|
update the examples, docs and template
|
2019-11-14 20:38:02 +01:00 |
|
thomwolf
|
89d6272898
|
Fix #1623
|
2019-11-04 16:21:12 +01:00 |
|
Victor SANH
|
fa735208c9
|
update readme - fix example command distil*
|
2019-10-30 14:27:28 -04:00 |
|
Thomas Wolf
|
36174696cc
|
Merge branch 'master' into clean-roberta
|
2019-10-30 16:51:06 +01:00 |
|
VictorSanh
|
5b6cafb11b
|
[release] fix table weirdness
|
2019-10-23 10:35:16 -04:00 |
|
VictorSanh
|
8ad5c591cd
|
[RELEASE] DistilRoBERTa
|
2019-10-23 10:29:47 -04:00 |
|
Lysandre
|
7d709e55ed
|
Remove
|
2019-10-22 14:12:33 -04:00 |
|
VictorSanh
|
d844db4005
|
Add citation bibtex
|
2019-10-11 16:55:42 -04:00 |
|
Bilal Khan
|
5ce8d29abe
|
Change tensorboard imports to use built-in tensorboard if available
|
2019-10-08 16:29:43 -05:00 |
|
VictorSanh
|
7ce83b4931
|
update weights for distilgpt2
|
2019-10-07 12:30:27 -04:00 |
|
VictorSanh
|
f5891c3821
|
run_squad --> run_squad_w_distillation
|
2019-10-04 17:23:15 -04:00 |
|
VictorSanh
|
5f07d8f11a
|
prepare release
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
35071007cb
|
incoming release 🔥 update links to arxiv preprint
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
2a91f6071f
|
upddate README - TODO updadte link to paper
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
c51e533a5f
|
update train.py
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
a76c3f9cb0
|
update requirements
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
bb9c5ead54
|
update distiller
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
a12ab0a8db
|
update binarized_data
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
4d6dfbd376
|
update extract
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
23edebc079
|
update extract_distilbert
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
cbfcfce205
|
update token_counts
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
19e4ebbe3f
|
grouped_batch_sampler
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
594202a934
|
lm_seqs_dataset
|
2019-10-03 10:27:11 -04:00 |
|
VictorSanh
|
38084507c4
|
add distillation_configs
|
2019-10-03 10:27:11 -04:00 |
|
thomwolf
|
481d9c4fb5
|
Merge branch 'master' into tf2
|
2019-09-26 12:02:54 +02:00 |
|
thomwolf
|
31c23bd5ee
|
[BIG] pytorch-transformers => transformers
|
2019-09-26 10:15:53 +02:00 |
|
VictorSanh
|
9f995b99d4
|
minor fixes
|
2019-09-19 21:36:06 +00:00 |
|
VictorSanh
|
354944e607
|
[distillation] big update w/ new weights
|
2019-09-19 19:25:21 +00:00 |
|
VictorSanh
|
32e1332acf
|
[distil] fix once for all general logger for scripts
|
2019-09-11 14:19:07 +00:00 |
|
VictorSanh
|
364920e216
|
fix small bug/typo
|
2019-09-10 21:45:01 +00:00 |
|
VictorSanh
|
a95ced6260
|
[Distillation] save last chkpt as pytorch_model.bin
|
2019-09-09 19:53:35 +00:00 |
|
VictorSanh
|
dddd6b9927
|
Update DistilBERT training code
|
2019-09-05 18:26:14 +00:00 |
|
Stefan Schweter
|
a1c34bd286
|
distillation: fix ModuleNotFoundError error in token counts script
|
2019-08-31 12:21:38 +02:00 |
|
VictorSanh
|
282c276e09
|
typos + file name coherence in distillation README
|
2019-08-30 12:02:29 -04:00 |
|
VictorSanh
|
803c1cc4ea
|
fix relative import bug cf Issue #1140
|
2019-08-30 12:01:27 -04:00 |
|
LysandreJik
|
caf1d116a6
|
Closing bracket in DistilBERT's token count.
|
2019-08-29 15:30:10 -04:00 |
|
thomwolf
|
b5eb283aaa
|
update credits
|
2019-08-28 16:36:55 +02:00 |
|
thomwolf
|
912a377e90
|
dilbert -> distilbert
|
2019-08-28 13:59:42 +02:00 |
|
thomwolf
|
4ce5f36f78
|
update readmes
|
2019-08-28 12:14:31 +02:00 |
|
VictorSanh
|
93e82ab424
|
Write README for DilBERT
|
2019-08-28 06:26:09 +00:00 |
|
VictorSanh
|
fea921d382
|
add licensing
|
2019-08-28 04:45:39 +00:00 |
|
VictorSanh
|
da1e4e53fc
|
some fixes in train.py for loading previous checkpoint
|
2019-08-28 04:01:03 +00:00 |
|
VictorSanh
|
0d8f8848d5
|
add scripts/extract_for_distil.py
|
2019-08-28 04:00:19 +00:00 |
|
VictorSanh
|
7f2c384c80
|
add scripts/token_counts.py
|
2019-08-28 04:00:03 +00:00 |
|
VictorSanh
|
4d16b279e5
|
add scripts/binarized_data.py
|
2019-08-28 03:59:48 +00:00 |
|
VictorSanh
|
b247b0d880
|
add train.py for distillation
|
2019-08-28 02:12:47 +00:00 |
|
VictorSanh
|
780f183e55
|
add requirements
|
2019-08-28 01:39:52 +00:00 |
|