thomwolf
|
26497d1199
|
fix tests
|
2019-09-18 12:17:21 +02:00 |
|
thomwolf
|
6a083fd447
|
update pt-tf conversion script
|
2019-09-18 12:11:32 +02:00 |
|
thomwolf
|
f6969cc12b
|
upgrade max model difference to 2e-2 (for transfo-xl adaptive softmax + inputs)
|
2019-09-18 11:12:02 +02:00 |
|
thomwolf
|
4b956b2a6b
|
add layer_norm_epsilon configuration for transformer xl
|
2019-09-13 17:09:20 +02:00 |
|
thomwolf
|
b97af8cce9
|
skip finetuned checkpoints
|
2019-09-13 16:43:49 +02:00 |
|
thomwolf
|
65c49bb27e
|
adding TF 2.0 adaptive softmax with logits + loss outputs
|
2019-09-13 15:50:51 +02:00 |
|
thomwolf
|
39c38b2ea0
|
fix
|
2019-09-12 16:47:11 +02:00 |
|
thomwolf
|
dcddf498c8
|
fix bert layernorm
|
2019-09-12 16:46:32 +02:00 |
|
thomwolf
|
d3a3a0353c
|
clean up cache after conversion
|
2019-09-12 16:42:52 +02:00 |
|
thomwolf
|
a84adddd1b
|
convert all models
|
2019-09-12 13:14:07 +02:00 |
|
thomwolf
|
969d3ae95e
|
XLMWithLMHead fixed - standardize conversion
|
2019-09-11 15:47:33 +02:00 |
|
thomwolf
|
646711e1e2
|
standardize scopes names - add conversion methods
|
2019-09-11 15:34:17 +02:00 |
|
thomwolf
|
4356f791a2
|
XLM passing tests
|
2019-09-11 11:49:54 +02:00 |
|
thomwolf
|
465870c33f
|
Xlnet working - also added simple question answering model for XLNet
|
2019-09-10 16:44:41 +02:00 |
|
thomwolf
|
16b6361792
|
xlnet paassing first test
|
2019-09-10 12:39:27 +02:00 |
|
thomwolf
|
32aabe8c33
|
WIP XLNet
|
2019-09-10 12:17:18 +02:00 |
|
thomwolf
|
f851fb55ca
|
fixing error message
|
2019-09-10 09:24:08 +02:00 |
|
thomwolf
|
50c6bc4195
|
fix tf bert model
|
2019-09-09 17:46:01 +02:00 |
|
thomwolf
|
0537139b2b
|
removing tf.function
|
2019-09-09 14:47:31 +02:00 |
|
thomwolf
|
33cb00f41a
|
add GPT2 to init - fix weights loading - remove tf.function
|
2019-09-09 14:29:24 +02:00 |
|
thomwolf
|
78b2a53f10
|
debug file download in tests error
|
2019-09-09 13:38:10 +02:00 |
|
thomwolf
|
6b3438df21
|
fixing GPT2 double head model and updating the torch version tests
|
2019-09-09 12:48:36 +02:00 |
|
thomwolf
|
e360037236
|
Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2
|
2019-09-09 11:08:49 +02:00 |
|
thomwolf
|
b7175a2701
|
fixed imports in tests and gpt2 config test
|
2019-09-09 11:04:03 +02:00 |
|
thomwolf
|
728637356c
|
WIP GPT2
|
2019-09-09 10:18:55 +03:00 |
|
thomwolf
|
34f28b2a13
|
WIP GPT2
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
ad88563bda
|
WIP GPT-2
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
64d83c7ae0
|
WIP
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
01597e5b90
|
add tf auto models + tests
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
f5c698b21a
|
add weights tying, attention and hidden states output tests
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
6dc4b6f34c
|
skip transfo-xl tokenizer tests with tf for now
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
e30579f764
|
no pytest version checking
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
518307dfcd
|
test suite independent of framework
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
9d0a11a68c
|
update dependencies and circle-ci
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
24a20483f5
|
update conversion script names
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
6f152572cd
|
add conversion script, rename conversion scripts
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
a4704b1263
|
skipping tf tests if tf is not installed
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
ad0ab9afe9
|
fix test when tf is not here
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
59fe641b8b
|
also gathering file names in file_utils
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
d68a8fe462
|
add tf bert files
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
7ae642b72d
|
update conversion scripts
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
69bff89935
|
clean ups
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
1efb1f1660
|
split configuration and modeling files
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
1eb125fb95
|
be sure we have uint8
|
2019-09-08 15:02:06 +03:00 |
|
thomwolf
|
705237b4ec
|
add tf auto models + tests
|
2019-09-05 12:21:08 +02:00 |
|
thomwolf
|
600a42329b
|
add weights tying, attention and hidden states output tests
|
2019-09-05 12:02:14 +02:00 |
|
thomwolf
|
04d2006f28
|
skip transfo-xl tokenizer tests with tf for now
|
2019-09-05 11:22:13 +02:00 |
|
thomwolf
|
7f6a0c0d69
|
no pytest version checking
|
2019-09-05 11:20:56 +02:00 |
|
thomwolf
|
7c0baf9521
|
test suite independent of framework
|
2019-09-05 11:18:55 +02:00 |
|
thomwolf
|
7775a3d2ed
|
update dependencies and circle-ci
|
2019-09-05 10:23:04 +02:00 |
|