LysandreJik
c85b5db61a
Conditional append/init + fixed warning
2019-08-31 00:33:50 -04:00
LysandreJik
5c2b94c82a
Changed string so that Circle CI accepts the warning
2019-08-31 00:33:50 -04:00
LysandreJik
87747518e9
Blocks deletion from already deleted heads. Necessary integration test.
...
Now raises a warning when a head to be deleted already has been deleted. An integration test verifying the total pipeline (-> from config -> save model -> load model -> additional head pruning) has been added.
2019-08-31 00:33:50 -04:00
LysandreJik
719cb3738d
Pruning for GPT and GPT-2
2019-08-31 00:33:50 -04:00
LysandreJik
fc1fbae45d
XLM can be pruned
2019-08-31 00:33:50 -04:00
Lysandre
42e00cf9e1
Pruning saved to configuration first try
2019-08-31 00:33:50 -04:00
LysandreJik
d7a4c3252e
Fixed filename
2019-08-31 00:08:56 -04:00
LysandreJik
7f006cdd87
Set seed for head_masking test
2019-08-30 23:58:49 -04:00
Julien Chaumond
0fd0b674e6
[ci] legible output [skip ci]
2019-08-30 20:36:26 -04:00
Julien Chaumond
b65a994f59
[ci] decrease parallelism to increase success prob
2019-08-30 20:33:16 -04:00
Julien Chaumond
1d438f15b3
[XLNet] Use pytorch's layernorm like in BERT
...
See #1089
cc @thomwolf @lysandrejik
Also @dhpollack
2019-08-30 20:20:15 -04:00
Julien Chaumond
574c5b3a72
[RoBERTa] LayerNorm's eps is not a nn.Parameter so there's no point setting it on the model
...
Instead we correctly store it on the config
(regenerating the hosted config files)
cc @lysandrejik
2019-08-30 20:09:24 -04:00
LysandreJik
09363f2a8b
Fix documentation index
2019-08-30 19:48:32 -04:00
Thomas Wolf
51e980ce36
Merge pull request #1155 from anhnt170489/apex_fp16
...
Update apex fp16 implementation
2019-08-30 23:29:11 +02:00
Thomas Wolf
206c35e9a4
Merge pull request #1154 from ziliwang/master
...
fix: hard coding for max number
2019-08-30 23:23:08 +02:00
Thomas Wolf
f3d18c71ec
Merge pull request #1152 from epwalsh/fix-special-tokens
...
fix adding special tokens
2019-08-30 23:21:59 +02:00
Thomas Wolf
d483cd8e46
Merge pull request #1074 from huggingface/improved_testing
...
Shortcut to special tokens' ids - fix GPT2 & RoBERTa tokenizers - improved testing for GPT/GPT-2
2019-08-30 23:18:58 +02:00
Thomas Wolf
d2f21f08f5
Merge pull request #1092 from shijie-wu/xlm-tokenization
...
Added cleaned configuration properties for tokenizer with serialization - improve tokenization of XLM
2019-08-30 23:15:40 +02:00
Thomas Wolf
12b9cc9e26
Merge pull request #1110 from huggingface/automodels
...
Torch.hub now based on AutoModels - Updating AutoModels with AutoModelWithLMHead, Sequence Classification and Question Answering
2019-08-30 23:08:57 +02:00
thomwolf
bfe93a5a21
fix distilbert in auto tokenizer
2019-08-30 22:43:26 +02:00
thomwolf
256086bc69
clean up and simplify hubconf
2019-08-30 22:34:23 +02:00
thomwolf
80aa87d9a3
fix distilbert tokenizer
2019-08-30 22:24:23 +02:00
thomwolf
455a4c842c
add distilbert tokenizer
2019-08-30 22:20:51 +02:00
thomwolf
7a1f174a9d
update names of torch.hub to simpler names - update docstring
2019-08-30 22:20:44 +02:00
thomwolf
c665e0fcfe
Merge branch 'automodels' of https://github.com/huggingface/pytorch-transformers into automodels
2019-08-30 21:53:36 +02:00
LysandreJik
9b6e3b34d9
Docstrings
2019-08-30 14:09:02 -04:00
LysandreJik
dec8f4d6fd
Added DistilBERT models to all other AutoModels.
2019-08-30 13:52:18 -04:00
LysandreJik
bc29aa67a9
HubConf configuration
2019-08-30 12:48:55 -04:00
thomwolf
f35f612280
updating docstring for AutoModel
2019-08-30 12:48:55 -04:00
LysandreJik
7ca9653852
Pytorch Hub & AutoModels
2019-08-30 12:48:55 -04:00
LysandreJik
25e8389439
Tests for added AutoModels
2019-08-30 12:48:55 -04:00
LysandreJik
dc43215c01
Added multiple AutoModel classes: AutoModelWithLMHead, AutoModelForQuestionAnswering and AutoModelForSequenceClassification
2019-08-30 12:48:55 -04:00
VictorSanh
282c276e09
typos + file name coherence in distillation README
2019-08-30 12:02:29 -04:00
VictorSanh
803c1cc4ea
fix relative import bug cf Issue #1140
2019-08-30 12:01:27 -04:00
thomwolf
7044ed6b05
fix tokenizers serialization
2019-08-30 17:36:11 +02:00
Thomas Wolf
cd65c41a83
Merge branch 'master' into xlm-tokenization
2019-08-30 17:15:16 +02:00
thomwolf
69da972ace
added test and debug tokenizer configuration serialization
2019-08-30 17:09:36 +02:00
thomwolf
88111de07c
saving and reloading tokenizer configurations
2019-08-30 16:55:48 +02:00
Thomas Wolf
b66e9b4433
Merge pull request #1158 from rabeehk/master
...
regarding #1026 pull request
2019-08-30 16:30:33 +02:00
Thomas Wolf
0a2fecdf90
Merge branch 'master' into master
2019-08-30 16:30:08 +02:00
thomwolf
3871b8a107
adding xlm 17 and 100 models and config on aws
2019-08-30 16:28:42 +02:00
thomwolf
8678ff8df5
adding 17 and 100 xlm models
2019-08-30 16:26:04 +02:00
LysandreJik
e0caab0cf0
fix link
2019-08-30 10:09:17 -04:00
LysandreJik
a600b30cc3
Fix index number in documentation
2019-08-30 10:08:14 -04:00
LysandreJik
20c06fa37d
Added DistilBERT to documentation index
2019-08-30 10:06:51 -04:00
Rabeeh KARIMI
39eb31e11e
remove reloading tokenizer in the training, adding it to the evaluation part
2019-08-30 15:44:41 +02:00
Rabeeh KARIMI
350bb6bffa
updated tokenizer loading for addressing reproducibility issues
2019-08-30 15:34:28 +02:00
thomwolf
82462c5cba
Added option to setup pretrained tokenizer arguments
2019-08-30 15:30:41 +02:00
Thomas Wolf
41f35d0b3d
Merge pull request #1089 from dhpollack/dhp/use_pytorch_layernorm
...
change layernorm code to pytorch's native layer norm
2019-08-30 14:49:08 +02:00
Thomas Wolf
01ad55f8cf
Merge pull request #1026 from rabeehk/master
...
loads the tokenizer for each checkpoint, to solve the reproducability…
2019-08-30 14:15:36 +02:00