LysandreJik
|
e0caab0cf0
|
fix link
|
2019-08-30 10:09:17 -04:00 |
|
LysandreJik
|
a600b30cc3
|
Fix index number in documentation
|
2019-08-30 10:08:14 -04:00 |
|
LysandreJik
|
20c06fa37d
|
Added DistilBERT to documentation index
|
2019-08-30 10:06:51 -04:00 |
|
Thomas Wolf
|
41f35d0b3d
|
Merge pull request #1089 from dhpollack/dhp/use_pytorch_layernorm
change layernorm code to pytorch's native layer norm
|
2019-08-30 14:49:08 +02:00 |
|
Thomas Wolf
|
01ad55f8cf
|
Merge pull request #1026 from rabeehk/master
loads the tokenizer for each checkpoint, to solve the reproducability…
|
2019-08-30 14:15:36 +02:00 |
|
Thomas Wolf
|
f7978490b2
|
Merge pull request #1148 from huggingface/circleci
Documentation auto-deploy
|
2019-08-30 13:28:16 +02:00 |
|
LysandreJik
|
caf1d116a6
|
Closing bracket in DistilBERT's token count.
|
2019-08-29 15:30:10 -04:00 |
|
LysandreJik
|
e7fba4bef5
|
Documentation auto-deploy
|
2019-08-29 12:14:29 -04:00 |
|
Luis
|
fe8fb10b44
|
Small modification of comment in the run_glue.py example
Add RoBERTa to the comment as it was not explicit that RoBERTa don't use token_type_ids.
|
2019-08-29 14:43:30 +02:00 |
|
LysandreJik
|
bf3dc778b8
|
Changed learning rate for run_squad test
|
2019-08-28 18:24:43 -04:00 |
|
thomwolf
|
0a74c88ac6
|
fix #1131
|
2019-08-28 22:41:42 +02:00 |
|
Thomas Wolf
|
5f297c7be3
|
Merge pull request #1087 from huggingface/fix-warnings
Decode now calls private property instead of public method
|
2019-08-28 22:22:11 +02:00 |
|
Thomas Wolf
|
d9847678b3
|
Merge pull request #1136 from adai183/update_SQuAD_script
swap order of optimizer.step() and scheduler.step()
|
2019-08-28 22:00:52 +02:00 |
|
Thomas Wolf
|
0f8ad89206
|
Merge pull request #1135 from stefan-it/master
distilbert: fix number of hidden_size
|
2019-08-28 22:00:12 +02:00 |
|
LysandreJik
|
9ce42dc540
|
Pretrained models table fix
|
2019-08-28 13:56:28 -04:00 |
|
Andreas Daiminger
|
1d15a7f278
|
swap order of optimizer.step() and scheduler.step()
|
2019-08-28 19:18:27 +02:00 |
|
Stefan Schweter
|
ed2ab1c220
|
distilbert: fix number of hidden_size
|
2019-08-28 18:08:16 +02:00 |
|
Thomas Wolf
|
0ecfd17f49
|
Merge pull request #987 from huggingface/generative-finetuning
Generative finetuning
|
2019-08-28 16:51:50 +02:00 |
|
Thomas Wolf
|
50792dbdcc
|
Merge pull request #1127 from huggingface/dilbert
DilBERT
|
2019-08-28 16:43:09 +02:00 |
|
thomwolf
|
e7706f514b
|
update again
|
2019-08-28 16:37:22 +02:00 |
|
thomwolf
|
b5eb283aaa
|
update credits
|
2019-08-28 16:36:55 +02:00 |
|
LysandreJik
|
f753d4e32b
|
Removed typings for Python 2
|
2019-08-28 10:15:02 -04:00 |
|
LysandreJik
|
75bc2a03cc
|
Updated article link
|
2019-08-28 10:05:15 -04:00 |
|
LysandreJik
|
1dc43e56c9
|
Documentation additions
|
2019-08-28 09:37:27 -04:00 |
|
thomwolf
|
912a377e90
|
dilbert -> distilbert
|
2019-08-28 13:59:42 +02:00 |
|
thomwolf
|
c9bce1811c
|
fixing model to add torchscript, embedding resizing, head pruning and masking + tests
|
2019-08-28 13:22:45 +02:00 |
|
thomwolf
|
62df4ba59a
|
add dilbert tokenizer and tests
|
2019-08-28 12:22:56 +02:00 |
|
thomwolf
|
4ce5f36f78
|
update readmes
|
2019-08-28 12:14:31 +02:00 |
|
VictorSanh
|
a5fe16687b
|
fix typo
|
2019-08-28 07:22:54 +00:00 |
|
VictorSanh
|
497f73c964
|
add DilBERT to master REAME
|
2019-08-28 07:16:30 +00:00 |
|
VictorSanh
|
93e82ab424
|
Write README for DilBERT
|
2019-08-28 06:26:09 +00:00 |
|
VictorSanh
|
19b7c9b0b7
|
add DilBert model for squad
|
2019-08-28 06:25:44 +00:00 |
|
VictorSanh
|
fea921d382
|
add licensing
|
2019-08-28 04:45:39 +00:00 |
|
VictorSanh
|
da1e4e53fc
|
some fixes in train.py for loading previous checkpoint
|
2019-08-28 04:01:03 +00:00 |
|
VictorSanh
|
0d8f8848d5
|
add scripts/extract_for_distil.py
|
2019-08-28 04:00:19 +00:00 |
|
VictorSanh
|
7f2c384c80
|
add scripts/token_counts.py
|
2019-08-28 04:00:03 +00:00 |
|
VictorSanh
|
4d16b279e5
|
add scripts/binarized_data.py
|
2019-08-28 03:59:48 +00:00 |
|
LysandreJik
|
c513415b19
|
Dilbert tests from CommonTests
|
2019-08-27 23:59:00 -04:00 |
|
LysandreJik
|
778a263f09
|
GilBert added to AutoModels
|
2019-08-27 23:14:00 -04:00 |
|
VictorSanh
|
74d78beeb4
|
fix: add qa_dropout and seq_classif_dropout
|
2019-08-28 03:13:11 +00:00 |
|
VictorSanh
|
7f5d85347e
|
fix small typo
|
2019-08-28 02:44:51 +00:00 |
|
VictorSanh
|
906581ae3c
|
add s3 links for dilbert (+fix small typo)
|
2019-08-28 02:43:33 +00:00 |
|
VictorSanh
|
b247b0d880
|
add train.py for distillation
|
2019-08-28 02:12:47 +00:00 |
|
VictorSanh
|
780f183e55
|
add requirements
|
2019-08-28 01:39:52 +00:00 |
|
VictorSanh
|
e424d2e45d
|
add README
|
2019-08-28 01:10:10 +00:00 |
|
VictorSanh
|
1ae81e4aa1
|
add dataset. distiller, utils
|
2019-08-28 01:10:05 +00:00 |
|
VictorSanh
|
5d29f8e99b
|
fix bugs
|
2019-08-28 00:57:16 +00:00 |
|
VictorSanh
|
a8ad83040d
|
fix bugs
|
2019-08-28 00:45:33 +00:00 |
|
VictorSanh
|
60c984da6c
|
fix bugs
|
2019-08-27 22:25:55 +00:00 |
|
VictorSanh
|
42968138c8
|
wip wouf
|
2019-08-27 22:00:38 +00:00 |
|