thomwolf
9d0a11a68c
update dependencies and circle-ci
2019-09-08 15:02:06 +03:00
thomwolf
24a20483f5
update conversion script names
2019-09-08 15:02:06 +03:00
thomwolf
6f152572cd
add conversion script, rename conversion scripts
2019-09-08 15:02:06 +03:00
thomwolf
a4704b1263
skipping tf tests if tf is not installed
2019-09-08 15:02:06 +03:00
thomwolf
ad0ab9afe9
fix test when tf is not here
2019-09-08 15:02:06 +03:00
thomwolf
59fe641b8b
also gathering file names in file_utils
2019-09-08 15:02:06 +03:00
thomwolf
d68a8fe462
add tf bert files
2019-09-08 15:02:06 +03:00
thomwolf
7ae642b72d
update conversion scripts
2019-09-08 15:02:06 +03:00
thomwolf
69bff89935
clean ups
2019-09-08 15:02:06 +03:00
thomwolf
1efb1f1660
split configuration and modeling files
2019-09-08 15:02:06 +03:00
thomwolf
1eb125fb95
be sure we have uint8
2019-09-08 15:02:06 +03:00
thomwolf
7fba47b7d9
WIP reordering
2019-09-04 22:39:23 +02:00
thomwolf
e25cba78cf
WIP reodering arguments for torchscript and TF
2019-09-04 22:39:23 +02:00
thomwolf
38b79b5a63
Fixing this TransformerXL bool issue
2019-09-04 22:36:30 +02:00
LysandreJik
0b52642d37
1.2.0 in docs
2019-09-04 11:03:32 -04:00
thomwolf
89fd3450a6
Release: 1.2.0
2019-09-04 13:32:18 +02:00
Thomas Wolf
9fd6e7ab9f
Merge pull request #1190 from shijie-wu/xlm-tokenization
...
Fix reference of import in XLM tokenization
2019-09-04 12:50:49 +02:00
Shijie Wu
a15562e170
Fix reference of import when called for the second time
2019-09-03 18:27:29 -07:00
Thomas Wolf
0287d264e9
Merge pull request #1162 from huggingface/xlnet-bias
...
XLNet bias fix on resize embeddings (cf #1124 )
2019-09-02 23:14:04 +02:00
LysandreJik
7f522437bc
Updated documentation for LM finetuning script
2019-09-02 13:40:25 -04:00
LysandreJik
3fbf301bba
[CI] Updated resource size for python 3 tests
2019-09-02 12:35:14 -04:00
Julien Chaumond
2dcc5a1629
[doc] Add blurb about large-scale model downloads
...
cc @n1t0 @lysandrejik @thomwolf
2019-09-02 12:27:11 -04:00
Thomas Wolf
7b0c99add9
Merge pull request #1174 from huggingface/fix_byte_level_added_tokens
...
Fix byte-level BPE decoding error when using added tokens
2019-09-02 09:01:16 +02:00
LysandreJik
31d3373bc9
Appends space before special token
2019-09-01 21:07:00 -04:00
thomwolf
fede4ef45d
fixing #1133
2019-09-02 02:27:39 +02:00
Thomas Wolf
b6cd856b08
Merge pull request #1164 from stefan-it/master
...
distillation: fix ModuleNotFoundError error in token counts script
2019-09-02 02:00:07 +02:00
Thomas Wolf
ff7368eb6b
Merge pull request #1077 from huggingface/pruning-save-and-load
...
Pruning changes so that deleted heads are kept on save/load
2019-09-01 09:42:15 +02:00
LysandreJik
6ae0bb5291
XLM 100 different URLs
2019-08-31 14:46:31 -04:00
LysandreJik
819b468f70
Fixed XLM model url
2019-08-31 14:40:51 -04:00
Stefan Schweter
a1c34bd286
distillation: fix ModuleNotFoundError error in token counts script
2019-08-31 12:21:38 +02:00
LysandreJik
ea86bef545
Check for None
2019-08-31 00:56:22 -04:00
LysandreJik
e0f867a9ba
XLNet bias fix on resize embeddings (cf #1124 )
2019-08-31 00:50:59 -04:00
LysandreJik
11600edc6e
Rebase on master + DistilBERT head pruning patch
2019-08-31 00:37:41 -04:00
LysandreJik
b6992b7b47
Applied patch to OpenAI GPT, RoBERTa, TransfoL, XLM and XLNet
2019-08-31 00:33:50 -04:00
thomwolf
bdb4409ed8
updated pruning logic with sets - Bert and GPT-2
2019-08-31 00:33:50 -04:00
LysandreJik
0c8e823b03
Added patch to remaining models
2019-08-31 00:33:50 -04:00
LysandreJik
0cd283522a
Attempt to fix head index
2019-08-31 00:33:50 -04:00
LysandreJik
c85b5db61a
Conditional append/init + fixed warning
2019-08-31 00:33:50 -04:00
LysandreJik
5c2b94c82a
Changed string so that Circle CI accepts the warning
2019-08-31 00:33:50 -04:00
LysandreJik
87747518e9
Blocks deletion from already deleted heads. Necessary integration test.
...
Now raises a warning when a head to be deleted already has been deleted. An integration test verifying the total pipeline (-> from config -> save model -> load model -> additional head pruning) has been added.
2019-08-31 00:33:50 -04:00
LysandreJik
719cb3738d
Pruning for GPT and GPT-2
2019-08-31 00:33:50 -04:00
LysandreJik
fc1fbae45d
XLM can be pruned
2019-08-31 00:33:50 -04:00
Lysandre
42e00cf9e1
Pruning saved to configuration first try
2019-08-31 00:33:50 -04:00
LysandreJik
d7a4c3252e
Fixed filename
2019-08-31 00:08:56 -04:00
LysandreJik
7f006cdd87
Set seed for head_masking test
2019-08-30 23:58:49 -04:00
Julien Chaumond
0fd0b674e6
[ci] legible output [skip ci]
2019-08-30 20:36:26 -04:00
Julien Chaumond
b65a994f59
[ci] decrease parallelism to increase success prob
2019-08-30 20:33:16 -04:00
Julien Chaumond
1d438f15b3
[XLNet] Use pytorch's layernorm like in BERT
...
See #1089
cc @thomwolf @lysandrejik
Also @dhpollack
2019-08-30 20:20:15 -04:00
Julien Chaumond
574c5b3a72
[RoBERTa] LayerNorm's eps is not a nn.Parameter so there's no point setting it on the model
...
Instead we correctly store it on the config
(regenerating the hosted config files)
cc @lysandrejik
2019-08-30 20:09:24 -04:00
LysandreJik
09363f2a8b
Fix documentation index
2019-08-30 19:48:32 -04:00