VictorSanh
|
04b50cabf6
|
gitignore
|
2019-09-05 18:49:28 +00:00 |
|
VictorSanh
|
dddd6b9927
|
Update DistilBERT training code
|
2019-09-05 18:26:14 +00:00 |
|
Julien Chaumond
|
f9453d15e5
|
Fix broken link
|
2019-09-05 12:35:22 -04:00 |
|
Julien Chaumond
|
f7ee2e5d20
|
[README] link to Write With Transformer
|
2019-09-05 12:33:46 -04:00 |
|
maru0kun
|
d737947725
|
Fix typo
|
2019-09-05 19:24:57 +09:00 |
|
thomwolf
|
705237b4ec
|
add tf auto models + tests
|
2019-09-05 12:21:08 +02:00 |
|
thomwolf
|
600a42329b
|
add weights tying, attention and hidden states output tests
|
2019-09-05 12:02:14 +02:00 |
|
thomwolf
|
04d2006f28
|
skip transfo-xl tokenizer tests with tf for now
|
2019-09-05 11:22:13 +02:00 |
|
thomwolf
|
7f6a0c0d69
|
no pytest version checking
|
2019-09-05 11:20:56 +02:00 |
|
thomwolf
|
7c0baf9521
|
test suite independent of framework
|
2019-09-05 11:18:55 +02:00 |
|
thomwolf
|
7775a3d2ed
|
update dependencies and circle-ci
|
2019-09-05 10:23:04 +02:00 |
|
thomwolf
|
33dd59e971
|
update conversion script names
|
2019-09-05 03:13:26 +02:00 |
|
thomwolf
|
5951d86024
|
add conversion script, rename conversion scripts
|
2019-09-05 03:10:11 +02:00 |
|
thomwolf
|
aa4c8804f2
|
skipping tf tests if tf is not installed
|
2019-09-05 03:06:09 +02:00 |
|
thomwolf
|
134847db81
|
fix test when tf is not here
|
2019-09-05 02:53:52 +02:00 |
|
thomwolf
|
981f7f5253
|
Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2
|
2019-09-05 02:34:52 +02:00 |
|
thomwolf
|
bffd17a43d
|
add tf bert files
|
2019-09-05 02:34:44 +02:00 |
|
thomwolf
|
85df4f7cca
|
also gathering file names in file_utils
|
2019-09-05 02:34:09 +02:00 |
|
thomwolf
|
11fae9e636
|
add tf bert files
|
2019-09-05 02:27:39 +02:00 |
|
thomwolf
|
121f88cae3
|
update conversion scripts
|
2019-09-05 02:17:50 +02:00 |
|
thomwolf
|
d77abd4d08
|
clean ups
|
2019-09-05 00:41:24 +02:00 |
|
thomwolf
|
2a667b1eb9
|
split configuration and modeling files
|
2019-09-05 00:27:11 +02:00 |
|
thomwolf
|
0be6a2a624
|
be sure we have uint8
|
2019-09-04 22:47:38 +02:00 |
|
thomwolf
|
7fba47b7d9
|
WIP reordering
|
2019-09-04 22:39:23 +02:00 |
|
thomwolf
|
e25cba78cf
|
WIP reodering arguments for torchscript and TF
|
2019-09-04 22:39:23 +02:00 |
|
thomwolf
|
38b79b5a63
|
Fixing this TransformerXL bool issue
|
2019-09-04 22:36:30 +02:00 |
|
LysandreJik
|
0b52642d37
|
1.2.0 in docs
|
2019-09-04 11:03:32 -04:00 |
|
thomwolf
|
89fd3450a6
|
Release: 1.2.0
|
2019-09-04 13:32:18 +02:00 |
|
Thomas Wolf
|
9fd6e7ab9f
|
Merge pull request #1190 from shijie-wu/xlm-tokenization
Fix reference of import in XLM tokenization
|
2019-09-04 12:50:49 +02:00 |
|
Shijie Wu
|
a15562e170
|
Fix reference of import when called for the second time
|
2019-09-03 18:27:29 -07:00 |
|
Thomas Wolf
|
0287d264e9
|
Merge pull request #1162 from huggingface/xlnet-bias
XLNet bias fix on resize embeddings (cf #1124)
|
2019-09-02 23:14:04 +02:00 |
|
LysandreJik
|
7f522437bc
|
Updated documentation for LM finetuning script
|
2019-09-02 13:40:25 -04:00 |
|
LysandreJik
|
3fbf301bba
|
[CI] Updated resource size for python 3 tests
|
2019-09-02 12:35:14 -04:00 |
|
Julien Chaumond
|
2dcc5a1629
|
[doc] Add blurb about large-scale model downloads
cc @n1t0 @lysandrejik @thomwolf
|
2019-09-02 12:27:11 -04:00 |
|
Thomas Wolf
|
7b0c99add9
|
Merge pull request #1174 from huggingface/fix_byte_level_added_tokens
Fix byte-level BPE decoding error when using added tokens
|
2019-09-02 09:01:16 +02:00 |
|
LysandreJik
|
31d3373bc9
|
Appends space before special token
|
2019-09-01 21:07:00 -04:00 |
|
thomwolf
|
fede4ef45d
|
fixing #1133
|
2019-09-02 02:27:39 +02:00 |
|
Thomas Wolf
|
b6cd856b08
|
Merge pull request #1164 from stefan-it/master
distillation: fix ModuleNotFoundError error in token counts script
|
2019-09-02 02:00:07 +02:00 |
|
Thomas Wolf
|
ff7368eb6b
|
Merge pull request #1077 from huggingface/pruning-save-and-load
Pruning changes so that deleted heads are kept on save/load
|
2019-09-01 09:42:15 +02:00 |
|
LysandreJik
|
6ae0bb5291
|
XLM 100 different URLs
|
2019-08-31 14:46:31 -04:00 |
|
LysandreJik
|
819b468f70
|
Fixed XLM model url
|
2019-08-31 14:40:51 -04:00 |
|
LysandreJik
|
58b59a0c31
|
Random seed is accessible anywhere within the common tests
|
2019-08-31 13:17:08 -04:00 |
|
Stefan Schweter
|
a1c34bd286
|
distillation: fix ModuleNotFoundError error in token counts script
|
2019-08-31 12:21:38 +02:00 |
|
LysandreJik
|
ea86bef545
|
Check for None
|
2019-08-31 00:56:22 -04:00 |
|
LysandreJik
|
e0f867a9ba
|
XLNet bias fix on resize embeddings (cf #1124)
|
2019-08-31 00:50:59 -04:00 |
|
LysandreJik
|
11600edc6e
|
Rebase on master + DistilBERT head pruning patch
|
2019-08-31 00:37:41 -04:00 |
|
LysandreJik
|
b6992b7b47
|
Applied patch to OpenAI GPT, RoBERTa, TransfoL, XLM and XLNet
|
2019-08-31 00:33:50 -04:00 |
|
thomwolf
|
bdb4409ed8
|
updated pruning logic with sets - Bert and GPT-2
|
2019-08-31 00:33:50 -04:00 |
|
LysandreJik
|
0c8e823b03
|
Added patch to remaining models
|
2019-08-31 00:33:50 -04:00 |
|
LysandreJik
|
0cd283522a
|
Attempt to fix head index
|
2019-08-31 00:33:50 -04:00 |
|