thomwolf
|
b19786985d
|
unified tokenizer api and serialization + tests
|
2019-07-09 10:25:18 +02:00 |
|
thomwolf
|
3d5f291386
|
updates to run_glue
|
2019-07-05 17:22:15 +02:00 |
|
thomwolf
|
99b90edab1
|
cleaning up run_glue example
|
2019-07-05 17:09:35 +02:00 |
|
thomwolf
|
1113f97f33
|
clean up glue example
|
2019-07-05 16:31:13 +02:00 |
|
thomwolf
|
162ba383b0
|
fix model loading
|
2019-07-05 15:57:14 +02:00 |
|
thomwolf
|
36bca545ff
|
tokenization abstract class - tests for examples
|
2019-07-05 15:02:59 +02:00 |
|
thomwolf
|
0bab55d5d5
|
[BIG] name change
|
2019-07-05 11:55:36 +02:00 |
|
thomwolf
|
c41f2bad69
|
WIP XLM + refactoring
|
2019-07-03 22:54:39 +02:00 |
|
thomwolf
|
2b56e98892
|
standardizing API across models - XLNetForSeqClass working
|
2019-06-28 16:35:09 +02:00 |
|
thomwolf
|
3a00674cbf
|
fix imports
|
2019-06-27 17:18:46 +02:00 |
|
thomwolf
|
59cefd4f98
|
fix #726 - get_lr in examples
|
2019-06-26 11:28:27 +02:00 |
|
thomwolf
|
092dacfd62
|
changing is_regression to unified API
|
2019-06-26 09:54:05 +02:00 |
|
thomwolf
|
e55d4c4ede
|
various updates to conversion, models and examples
|
2019-06-26 00:57:53 +02:00 |
|
thomwolf
|
7334bf6c21
|
pad on left for xlnet
|
2019-06-24 15:05:11 +02:00 |
|
thomwolf
|
c888663f18
|
overwrite output directories if needed
|
2019-06-24 14:38:24 +02:00 |
|
thomwolf
|
62d78aa37e
|
updating GLUE utils for compatibility with XLNet
|
2019-06-24 14:36:11 +02:00 |
|
thomwolf
|
24ed0b9346
|
updating run_xlnet_classifier
|
2019-06-24 12:00:09 +02:00 |
|
thomwolf
|
f6081f2255
|
add xlnetforsequence classif and run_classifier example for xlnet
|
2019-06-24 10:01:07 +02:00 |
|
thomwolf
|
181075635d
|
updating model loading and adding special tokens ids
|
2019-06-21 23:23:37 +02:00 |
|
thomwolf
|
ebd2cb8d74
|
update from_pretrained to load XLNetModel as well
|
2019-06-21 21:08:44 +02:00 |
|
thomwolf
|
edfe91c36e
|
first version bertology ok
|
2019-06-19 23:43:04 +02:00 |
|
thomwolf
|
7766ce66dd
|
update bertology
|
2019-06-19 22:29:51 +02:00 |
|
thomwolf
|
e4b46d86ce
|
update head pruning
|
2019-06-19 22:16:30 +02:00 |
|
thomwolf
|
0f40e8d6a6
|
debugger
|
2019-06-19 15:38:46 +02:00 |
|
thomwolf
|
0e1e8128bf
|
more logging
|
2019-06-19 15:35:49 +02:00 |
|
thomwolf
|
909d4f1af2
|
cuda again
|
2019-06-19 15:32:10 +02:00 |
|
thomwolf
|
14f0e8e557
|
fix cuda
|
2019-06-19 15:29:28 +02:00 |
|
thomwolf
|
34d706a0e1
|
pruning in bertology
|
2019-06-19 15:25:49 +02:00 |
|
thomwolf
|
dc8e0019b7
|
updating examples
|
2019-06-19 13:23:20 +02:00 |
|
thomwolf
|
68ab9599ce
|
small fix and updates to readme
|
2019-06-19 09:38:38 +02:00 |
|
thomwolf
|
f7e2ac01ea
|
update barrier
|
2019-06-18 22:43:35 +02:00 |
|
thomwolf
|
4d8c4337ae
|
test barrier in distrib training
|
2019-06-18 22:41:28 +02:00 |
|
thomwolf
|
3359955622
|
updating run_classif
|
2019-06-18 22:23:10 +02:00 |
|
thomwolf
|
29b7b30eaa
|
updating evaluation on a single gpu
|
2019-06-18 22:20:21 +02:00 |
|
thomwolf
|
7d2001aa44
|
overwrite_output_dir
|
2019-06-18 22:13:30 +02:00 |
|
thomwolf
|
16a1f338c4
|
fixing
|
2019-06-18 17:06:31 +02:00 |
|
thomwolf
|
92e0ad5aba
|
no numpy
|
2019-06-18 17:00:52 +02:00 |
|
thomwolf
|
4e6edc3274
|
hop
|
2019-06-18 16:57:15 +02:00 |
|
thomwolf
|
f55b60b9ee
|
fixing again
|
2019-06-18 16:56:52 +02:00 |
|
thomwolf
|
8bd9118294
|
quick fix
|
2019-06-18 16:54:41 +02:00 |
|
thomwolf
|
3e847449ad
|
fix out_label_ids
|
2019-06-18 16:53:31 +02:00 |
|
thomwolf
|
aad3a54e9c
|
fix paths
|
2019-06-18 16:48:04 +02:00 |
|
thomwolf
|
40dbda6871
|
updating classification example
|
2019-06-18 16:45:52 +02:00 |
|
thomwolf
|
7388c83b60
|
update run_classifier for distributed eval
|
2019-06-18 16:32:49 +02:00 |
|
thomwolf
|
9727723243
|
fix pickle
|
2019-06-18 16:02:42 +02:00 |
|
thomwolf
|
9710b68dbc
|
fix pickles
|
2019-06-18 16:01:15 +02:00 |
|
thomwolf
|
15ebd67d4e
|
cache in run_classifier + various fixes to the examples
|
2019-06-18 15:58:22 +02:00 |
|
thomwolf
|
e6e5f19257
|
fix
|
2019-06-18 14:45:14 +02:00 |
|
thomwolf
|
a432b3d466
|
distributed traing t_total
|
2019-06-18 14:39:09 +02:00 |
|
thomwolf
|
c5407f343f
|
split squad example in two
|
2019-06-18 14:29:03 +02:00 |
|