Commit Graph

15053 Commits

Author SHA1 Message Date
thomwolf
c0239e09e6 first commit 2019-07-04 17:06:30 +02:00
thomwolf
cf86d23eff parallelism in circlci 2019-07-04 17:02:21 +02:00
thomwolf
15b70338ba adding squad model to xlnet and xlm 2019-07-04 16:50:42 +02:00
thomwolf
fbe04423b6 Common SequenceSummary class 2019-07-04 00:25:30 +02:00
thomwolf
c22545aa40 fix xlm torchscript 2019-07-03 23:03:57 +02:00
thomwolf
3b23a846b6 Merge branch 'xlnet' of https://github.com/huggingface/pytorch-pretrained-BERT into xlnet 2019-07-03 22:54:58 +02:00
thomwolf
8fa3a1f0d8 updating tests 2019-07-03 22:54:53 +02:00
thomwolf
c41f2bad69 WIP XLM + refactoring 2019-07-03 22:54:39 +02:00
Thomas Wolf
64ce4dbd86
Merge pull request #748 from huggingface/torchscript
Release 0.7 - Add Torchscript capabilities
2019-07-03 22:52:03 +02:00
LysandreJik
b43b130f35 TorchScript flag in config; Tied weights when not running TorchScript; tuple concatenation clean-up. 2019-07-03 16:21:17 -04:00
LysandreJik
4703148f0c TransformerXL can't be exported to TorchScript because of control-flow. Exception added to tests. 2019-07-03 14:50:23 -04:00
LysandreJik
971c24687f XLNET can be exported to TorchScript 2019-07-03 11:03:09 -04:00
LysandreJik
be54b16960 GPT can be exported to TorchScript 2019-07-02 18:09:45 -04:00
LysandreJik
d8e83de792 GPT2 can be exported to TorchScript 2019-07-02 18:01:09 -04:00
thomwolf
288be7b7ea xlm 2019-07-02 23:42:31 +02:00
LysandreJik
e891bb43d5 BERT can be exported to TorchScript 2019-07-02 17:23:18 -04:00
LysandreJik
6ce1ee04fc TorchScript testing with output_attentions and output_hidden_state 2019-07-02 17:22:59 -04:00
thomwolf
7ed5bf706f add tests 2019-07-02 16:42:22 +02:00
thomwolf
708877958a updating tests and models, adding weights initialization test 2019-07-02 16:35:29 +02:00
thomwolf
99ae5ab883 update config tests and circle-ci 2019-07-02 12:40:39 +02:00
thomwolf
1484d67de9 [LARGE] updating all tests and API 2019-07-02 12:13:17 +02:00
Lei Mao
64b2a828c0 fix evaluation bug 2019-07-01 14:56:24 -07:00
thomwolf
4f8b5f687c add fix for serialization of tokenizer 2019-06-29 23:35:21 +02:00
thomwolf
d9184620f9 fix tests and new API 2019-06-29 23:10:40 +02:00
Thomas Wolf
dad3c7a485
Merge pull request #723 from tonianelope/master
Update Adam optimizer to follow pytorch convention for betas parameter (#510)
2019-06-28 17:28:25 +02:00
Thomas Wolf
e296d5bef1
Merge pull request #704 from deepset-ai/master
Adjust s3 german Bert file storage
2019-06-28 17:10:58 +02:00
Thomas Wolf
c68b4eceed
Merge pull request #718 from Rocketknight1/master
Incorrect docstring for BertForMaskedLM
2019-06-28 17:08:51 +02:00
thomwolf
213981d8cb updating bert API 2019-06-28 16:45:24 +02:00
thomwolf
2b56e98892 standardizing API across models - XLNetForSeqClass working 2019-06-28 16:35:09 +02:00
thomwolf
3a00674cbf fix imports 2019-06-27 17:18:46 +02:00
thomwolf
d939d6fd02 fix hidden-state extraction 2019-06-27 09:39:44 +02:00
thomwolf
0c2ff34815 extracting double hidden-state from xlnet 2019-06-27 09:27:50 +02:00
Mayhul Arora
08ff056c43 Added option to use multiple workers to create training data for lm fine tuning 2019-06-26 16:16:12 -07:00
thomwolf
3deea56c07 fixing loading fucntion 2019-06-26 13:41:12 +02:00
thomwolf
f56b8033f0 more versatile loading 2019-06-26 13:13:15 +02:00
thomwolf
4d47f4985d slight refactoring, add abstract class for model loading 2019-06-26 12:52:44 +02:00
thomwolf
59cefd4f98 fix #726 - get_lr in examples 2019-06-26 11:28:27 +02:00
thomwolf
ddc2cc61a6 fix python2 tests 2019-06-26 11:17:42 +02:00
thomwolf
7e3070ae4f add from_pretrained method to all configuration classes 2019-06-26 11:12:00 +02:00
thomwolf
93e9971c54 fix tests 2019-06-26 10:02:45 +02:00
thomwolf
092dacfd62 changing is_regression to unified API 2019-06-26 09:54:05 +02:00
thomwolf
e55d4c4ede various updates to conversion, models and examples 2019-06-26 00:57:53 +02:00
thomwolf
603c513b35 update main conversion script and readme 2019-06-25 10:45:07 +02:00
thomwolf
7de1740490 add ability to restore fine-tuned TF mdoel 2019-06-25 10:27:58 +02:00
tonianelope
c9885903a1 update betas to follow pytorch convention 2019-06-25 09:23:12 +01:00
thomwolf
7334bf6c21 pad on left for xlnet 2019-06-24 15:05:11 +02:00
thomwolf
c888663f18 overwrite output directories if needed 2019-06-24 14:38:24 +02:00
thomwolf
62d78aa37e updating GLUE utils for compatibility with XLNet 2019-06-24 14:36:11 +02:00
thomwolf
24ed0b9346 updating run_xlnet_classifier 2019-06-24 12:00:09 +02:00
thomwolf
f6081f2255 add xlnetforsequence classif and run_classifier example for xlnet 2019-06-24 10:01:07 +02:00