Commit Graph

18 Commits

Author SHA1 Message Date
thomwolf
0c2ff34815 extracting double hidden-state from xlnet 2019-06-27 09:27:50 +02:00
thomwolf
3deea56c07 fixing loading fucntion 2019-06-26 13:41:12 +02:00
thomwolf
4d47f4985d slight refactoring, add abstract class for model loading 2019-06-26 12:52:44 +02:00
thomwolf
7e3070ae4f add from_pretrained method to all configuration classes 2019-06-26 11:12:00 +02:00
thomwolf
092dacfd62 changing is_regression to unified API 2019-06-26 09:54:05 +02:00
thomwolf
e55d4c4ede various updates to conversion, models and examples 2019-06-26 00:57:53 +02:00
thomwolf
7de1740490 add ability to restore fine-tuned TF mdoel 2019-06-25 10:27:58 +02:00
thomwolf
62d78aa37e updating GLUE utils for compatibility with XLNet 2019-06-24 14:36:11 +02:00
thomwolf
24ed0b9346 updating run_xlnet_classifier 2019-06-24 12:00:09 +02:00
thomwolf
f6081f2255 add xlnetforsequence classif and run_classifier example for xlnet 2019-06-24 10:01:07 +02:00
thomwolf
181075635d updating model loading and adding special tokens ids 2019-06-21 23:23:37 +02:00
thomwolf
ebd2cb8d74 update from_pretrained to load XLNetModel as well 2019-06-21 21:08:44 +02:00
thomwolf
483cbc36a9 test deviation with tf model: max ~1e-3 should be ok 2019-06-21 16:38:01 +02:00
thomwolf
24d8068982 weights loading script ok 2019-06-21 12:33:44 +02:00
thomwolf
32da75486b add tokenizer and tests 2019-06-21 11:09:51 +02:00
thomwolf
45709d7532 model running with simple inputs 2019-06-21 00:28:42 +02:00
thomwolf
c2ea5aef77 work in progress on xlnet 2019-06-20 13:52:21 +02:00
thomwolf
de713fa9b4 starting 2019-06-20 10:54:19 +02:00