Commit Graph

15053 Commits

Author SHA1 Message Date
erenup
4812a5a767 add doc string 2019-09-16 11:50:18 +08:00
thomwolf
4b956b2a6b add layer_norm_epsilon configuration for transformer xl 2019-09-13 17:09:20 +02:00
thomwolf
b97af8cce9 skip finetuned checkpoints 2019-09-13 16:43:49 +02:00
thomwolf
65c49bb27e adding TF 2.0 adaptive softmax with logits + loss outputs 2019-09-13 15:50:51 +02:00
thomwolf
39c38b2ea0 fix 2019-09-12 16:47:11 +02:00
thomwolf
dcddf498c8 fix bert layernorm 2019-09-12 16:46:32 +02:00
thomwolf
d3a3a0353c clean up cache after conversion 2019-09-12 16:42:52 +02:00
thomwolf
a84adddd1b convert all models 2019-09-12 13:14:07 +02:00
VictorSanh
32e1332acf [distil] fix once for all general logger for scripts 2019-09-11 14:19:07 +00:00
Thomas Wolf
b62abe87c9
Merge pull request #1249 from ziliwang/master
fixed: hard coding for max and min number will out of range in fp16, which will cause nan.
2019-09-11 15:53:28 +02:00
thomwolf
969d3ae95e XLMWithLMHead fixed - standardize conversion 2019-09-11 15:47:33 +02:00
thomwolf
646711e1e2 standardize scopes names - add conversion methods 2019-09-11 15:34:17 +02:00
thomwolf
4356f791a2 XLM passing tests 2019-09-11 11:49:54 +02:00
LysandreJik
11ac4b9555 [CI] Symbolic link for documentation 2019-09-11 10:13:44 +02:00
Zili Wang
8bdee1cb73 fixed: hard coding for max and min number will out of range in fp16, which will cause nan. 2019-09-11 15:41:53 +08:00
ziliwang
7424b2848f
Merge pull request #1 from huggingface/master
merege from original repo
2019-09-11 11:02:23 +08:00
VictorSanh
364920e216 fix small bug/typo 2019-09-10 21:45:01 +00:00
Thomas Wolf
23c23f5399
Merge pull request #1229 from SKRohit/master
changes in evaluate function in run_lm_finetuning.py
2019-09-10 22:16:45 +02:00
Thomas Wolf
99a54ac51c
Merge pull request #1233 from searchivarius/master
Fix to prevent crashing on assert len(tokens_b)>=1
2019-09-10 22:15:47 +02:00
Thomas Wolf
439b37b474
Merge pull request #1241 from mattolson93/patch-1
Fixing typo in gpt2 for doc site's class link
2019-09-10 22:14:18 +02:00
mattolson93
f2cf6ce4a9
Fixing typo in gpt2 for doc site's class link 2019-09-10 09:12:01 -07:00
thomwolf
465870c33f Xlnet working - also added simple question answering model for XLNet 2019-09-10 16:44:41 +02:00
thomwolf
16b6361792 xlnet paassing first test 2019-09-10 12:39:27 +02:00
thomwolf
32aabe8c33 WIP XLNet 2019-09-10 12:17:18 +02:00
Thomas Wolf
2c177a87eb
Merge pull request #1228 from huggingface/head-masking-test
Trying to fix the head masking test
2019-09-10 11:55:27 +02:00
thomwolf
f851fb55ca fixing error message 2019-09-10 09:24:08 +02:00
searchivarius
eab980fd68 Fix to prevent crashing on assert len(tokens_b)>=1 2019-09-09 19:58:08 -04:00
VictorSanh
a95ced6260 [Distillation] save last chkpt as pytorch_model.bin 2019-09-09 19:53:35 +00:00
thomwolf
50c6bc4195 fix tf bert model 2019-09-09 17:46:01 +02:00
Rohit Kumar Singh
4b082bd4d8
Merge pull request #1 from SKRohit/SKRohit-patch-1
changes in return statement of evaluate function
2019-09-09 19:59:27 +05:30
Rohit Kumar Singh
e5df36397b
changes in return statement of evaluate function
changed `results` to `result` and removed `results` dict defined previously
2019-09-09 19:55:57 +05:30
thomwolf
0537139b2b removing tf.function 2019-09-09 14:47:31 +02:00
Thomas Wolf
84d346b687
Merge pull request #1195 from huggingface/reorder_arguments
[2.0] Reodering arguments for torch jit #1010 and future TF2.0 compatibility
2019-09-09 15:42:51 +03:00
Thomas Wolf
3f05de6dde
Merge branch 'master' into reorder_arguments 2019-09-09 15:42:25 +03:00
thomwolf
33cb00f41a add GPT2 to init - fix weights loading - remove tf.function 2019-09-09 14:29:24 +02:00
thomwolf
78b2a53f10 debug file download in tests error 2019-09-09 13:38:10 +02:00
thomwolf
6b3438df21 fixing GPT2 double head model and updating the torch version tests 2019-09-09 12:48:36 +02:00
thomwolf
e360037236 Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2 2019-09-09 11:08:49 +02:00
thomwolf
b7175a2701 fixed imports in tests and gpt2 config test 2019-09-09 11:04:03 +02:00
Thomas Wolf
995e38b7af
Merge pull request #1214 from huggingface/new-examples
Better examples
2019-09-09 10:26:36 +03:00
thomwolf
3401980fc4 fix #1208 2019-09-09 10:22:12 +03:00
thomwolf
728637356c WIP GPT2 2019-09-09 10:18:55 +03:00
thomwolf
34f28b2a13 WIP GPT2 2019-09-08 15:02:06 +03:00
thomwolf
ad88563bda WIP GPT-2 2019-09-08 15:02:06 +03:00
thomwolf
64d83c7ae0 WIP 2019-09-08 15:02:06 +03:00
thomwolf
01597e5b90 add tf auto models + tests 2019-09-08 15:02:06 +03:00
thomwolf
f5c698b21a add weights tying, attention and hidden states output tests 2019-09-08 15:02:06 +03:00
thomwolf
6dc4b6f34c skip transfo-xl tokenizer tests with tf for now 2019-09-08 15:02:06 +03:00
thomwolf
e30579f764 no pytest version checking 2019-09-08 15:02:06 +03:00
thomwolf
518307dfcd test suite independent of framework 2019-09-08 15:02:06 +03:00