Commit Graph

15053 Commits

Author SHA1 Message Date
thomwolf
b5ec526f85 updated data processor and metrics 2019-09-24 17:10:50 +02:00
thomwolf
a6981076ec various updates 2019-09-24 16:46:26 +02:00
LysandreJik
0b82e3d0d9 Relative imports 2019-09-24 09:52:25 -04:00
LysandreJik
f09e5ecef0 [Proposal] GLUE processors included in library 2019-09-24 09:47:34 -04:00
thomwolf
128bdd4c35 fix tests pt/tf 2019-09-24 15:43:39 +02:00
LysandreJik
72402d1acd Fixed DistilBERT tokenizer 2019-09-24 09:41:14 -04:00
thomwolf
28a30af6d1 fix auto models 2019-09-24 15:33:39 +02:00
thomwolf
de203853cc docstring for xlnet 2019-09-24 15:30:55 +02:00
thomwolf
559790f9e4 docstring for xlm 2019-09-24 15:26:57 +02:00
thomwolf
b3087ddde8 docstring t-xl 2019-09-24 15:21:51 +02:00
thomwolf
4761a39781 doctring roberta 2019-09-24 15:19:09 +02:00
thomwolf
45a6f2edd9 docstring for GPT 2019-09-24 15:15:47 +02:00
thomwolf
e7ba5bc85b docstring for GPT2 2019-09-24 15:12:36 +02:00
LysandreJik
d340e2329e create_mask_from_sequences -> create_token_type_ids_from_sequences 2019-09-24 09:09:28 -04:00
thomwolf
b94f73bab7 distilbert docstring 2019-09-24 15:06:51 +02:00
thomwolf
9678c49419 docstrings for bert 2019-09-24 14:57:05 +02:00
thomwolf
f3d1511b5b fix imports 2019-09-24 14:42:09 +02:00
thomwolf
dd2d90f344 update automodels 2019-09-24 14:39:41 +02:00
thomwolf
ee261439a9 add save_pretrained 2019-09-24 14:30:28 +02:00
thomwolf
29bb3e4eb0 double loading ok 2019-09-24 14:23:46 +02:00
thomwolf
f5397ffc3b update loading logics 2019-09-24 14:03:58 +02:00
thomwolf
271f213621 updating to load tf model in pt - fixing headmasking test 2019-09-24 13:51:28 +02:00
thomwolf
cf9c1cbb60 fix tests chen only using tf 2019-09-24 13:32:47 +02:00
thomwolf
2167e366ba update circleCi 2019-09-24 13:27:45 +02:00
thomwolf
e9a103c17a bidirectional conversion TF <=> PT - extended tests 2019-09-24 13:25:50 +02:00
LysandreJik
c832f43a4d output_token_type -> token_type_ids 2019-09-24 07:21:38 -04:00
LysandreJik
3927d7756c Updated the GLUE pre-processing method 2019-09-24 07:15:11 -04:00
LysandreJik
0ea82b246f Updated tests 2019-09-24 07:10:09 -04:00
LysandreJik
9d44236f70 Updated DistilBERT 2019-09-24 07:03:24 -04:00
thomwolf
a7e01a248b converting distilled/fine-tuned models 2019-09-24 10:58:52 +02:00
thomwolf
8ba44ced95 fix roberta conversion script 2019-09-24 09:48:23 +02:00
thomwolf
2b11fa5174 update __init__ and conversion script 2019-09-23 22:35:45 +02:00
thomwolf
6448396d54 fix roberta test 2019-09-23 22:27:13 +02:00
thomwolf
1e47dee24c Merge branch 'tf2' of https://github.com/huggingface/pytorch-transformers into tf2 2019-09-23 22:08:10 +02:00
thomwolf
c9591f6fac updated models input format + tests 2019-09-23 22:08:08 +02:00
Julien Chaumond
798da627eb Fix TFBert tests in Python 3.5 2019-09-23 12:06:10 -04:00
thomwolf
c014d1f0c6 fix the skipping 2019-09-23 16:39:57 +02:00
thomwolf
0b22e47a40 skipping pretrained TF model tests for now 2019-09-23 16:38:03 +02:00
thomwolf
830d212be7 test circleCI h5py version 2019-09-23 16:26:06 +02:00
Thomas Wolf
7c0f2d0a6a
Merge pull request #1294 from sshleifer/delete-n-special-doc
Delete n_special reference in docstring
2019-09-23 14:54:55 +01:00
thomwolf
a31e591d27 fix XLM tests 2019-09-23 15:54:10 +02:00
thomwolf
447de34dde tests for distilbert and roberta 2019-09-23 15:38:29 +02:00
Santiago Castro
98dd19b96b
Remove unnecessary use of FusedLayerNorm 2019-09-22 20:31:36 -04:00
Lorenzo Ampil
4b543c3007 Add option to use a 'stop token' which will be used to truncate the output text to everything till right before the 'stop token' 2019-09-22 21:38:38 +08:00
thomwolf
68a3e0223a roberta and distilbert 2019-09-20 23:14:51 +02:00
Maxpa1n
a2d4950f5c fix annotation 2019-09-20 10:59:35 -04:00
VictorSanh
9f995b99d4 minor fixes 2019-09-19 21:36:06 +00:00
VictorSanh
3fe5c8e8a8 update bert-base-uncased rslts 2019-09-19 19:34:22 +00:00
VictorSanh
354944e607 [distillation] big update w/ new weights 2019-09-19 19:25:21 +00:00
danai-antoniou
2e6797cc7d Added valuerror for duplicate added tokens 2019-09-19 15:40:42 +01:00