Commit Graph

1592 Commits

Author SHA1 Message Date
Thomas Wolf
e8e956dbb2
Merge pull request #1327 from huggingface/tf2-determinism
Pytorch/TF2 determinism
2019-09-24 22:49:57 +02:00
LysandreJik
1761d2091a Check to see if the models have the same results when in eval mode (pt) or when training=False (tf) 2019-09-24 14:59:10 -04:00
thomwolf
128bdd4c35 fix tests pt/tf 2019-09-24 15:43:39 +02:00
thomwolf
28a30af6d1 fix auto models 2019-09-24 15:33:39 +02:00
thomwolf
de203853cc docstring for xlnet 2019-09-24 15:30:55 +02:00
thomwolf
559790f9e4 docstring for xlm 2019-09-24 15:26:57 +02:00
thomwolf
b3087ddde8 docstring t-xl 2019-09-24 15:21:51 +02:00
thomwolf
4761a39781 doctring roberta 2019-09-24 15:19:09 +02:00
thomwolf
45a6f2edd9 docstring for GPT 2019-09-24 15:15:47 +02:00
thomwolf
e7ba5bc85b docstring for GPT2 2019-09-24 15:12:36 +02:00
thomwolf
b94f73bab7 distilbert docstring 2019-09-24 15:06:51 +02:00
thomwolf
9678c49419 docstrings for bert 2019-09-24 14:57:05 +02:00
thomwolf
f3d1511b5b fix imports 2019-09-24 14:42:09 +02:00
thomwolf
dd2d90f344 update automodels 2019-09-24 14:39:41 +02:00
thomwolf
ee261439a9 add save_pretrained 2019-09-24 14:30:28 +02:00
thomwolf
29bb3e4eb0 double loading ok 2019-09-24 14:23:46 +02:00
thomwolf
f5397ffc3b update loading logics 2019-09-24 14:03:58 +02:00
thomwolf
271f213621 updating to load tf model in pt - fixing headmasking test 2019-09-24 13:51:28 +02:00
thomwolf
cf9c1cbb60 fix tests chen only using tf 2019-09-24 13:32:47 +02:00
thomwolf
2167e366ba update circleCi 2019-09-24 13:27:45 +02:00
thomwolf
e9a103c17a bidirectional conversion TF <=> PT - extended tests 2019-09-24 13:25:50 +02:00
thomwolf
a7e01a248b converting distilled/fine-tuned models 2019-09-24 10:58:52 +02:00
thomwolf
8ba44ced95 fix roberta conversion script 2019-09-24 09:48:23 +02:00
thomwolf
2b11fa5174 update __init__ and conversion script 2019-09-23 22:35:45 +02:00
thomwolf
6448396d54 fix roberta test 2019-09-23 22:27:13 +02:00
thomwolf
1e47dee24c Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2 2019-09-23 22:08:10 +02:00
thomwolf
c9591f6fac updated models input format + tests 2019-09-23 22:08:08 +02:00
Julien Chaumond
798da627eb Fix TFBert tests in Python 3.5 2019-09-23 12:06:10 -04:00
thomwolf
c014d1f0c6 fix the skipping 2019-09-23 16:39:57 +02:00
thomwolf
0b22e47a40 skipping pretrained TF model tests for now 2019-09-23 16:38:03 +02:00
thomwolf
830d212be7 test circleCI h5py version 2019-09-23 16:26:06 +02:00
thomwolf
a31e591d27 fix XLM tests 2019-09-23 15:54:10 +02:00
thomwolf
447de34dde tests for distilbert and roberta 2019-09-23 15:38:29 +02:00
thomwolf
68a3e0223a roberta and distilbert 2019-09-20 23:14:51 +02:00
thomwolf
3a527fa820 OpenAI GPT tests ok 2019-09-18 14:15:48 +02:00
thomwolf
556442afb3 hot fix 2019-09-18 14:12:41 +02:00
thomwolf
160b5d6080 fix xlm lang_embeddings loading 2019-09-18 14:10:20 +02:00
thomwolf
26497d1199 fix tests 2019-09-18 12:17:21 +02:00
thomwolf
6a083fd447 update pt-tf conversion script 2019-09-18 12:11:32 +02:00
thomwolf
f6969cc12b upgrade max model difference to 2e-2 (for transfo-xl adaptive softmax + inputs) 2019-09-18 11:12:02 +02:00
thomwolf
4b956b2a6b add layer_norm_epsilon configuration for transformer xl 2019-09-13 17:09:20 +02:00
thomwolf
b97af8cce9 skip finetuned checkpoints 2019-09-13 16:43:49 +02:00
thomwolf
65c49bb27e adding TF 2.0 adaptive softmax with logits + loss outputs 2019-09-13 15:50:51 +02:00
thomwolf
39c38b2ea0 fix 2019-09-12 16:47:11 +02:00
thomwolf
dcddf498c8 fix bert layernorm 2019-09-12 16:46:32 +02:00
thomwolf
d3a3a0353c clean up cache after conversion 2019-09-12 16:42:52 +02:00
thomwolf
a84adddd1b convert all models 2019-09-12 13:14:07 +02:00
thomwolf
969d3ae95e XLMWithLMHead fixed - standardize conversion 2019-09-11 15:47:33 +02:00
thomwolf
646711e1e2 standardize scopes names - add conversion methods 2019-09-11 15:34:17 +02:00
thomwolf
4356f791a2 XLM passing tests 2019-09-11 11:49:54 +02:00