Mayhul Arora
|
08ff056c43
|
Added option to use multiple workers to create training data for lm fine tuning
|
2019-06-26 16:16:12 -07:00 |
|
thomwolf
|
3deea56c07
|
fixing loading fucntion
|
2019-06-26 13:41:12 +02:00 |
|
thomwolf
|
f56b8033f0
|
more versatile loading
|
2019-06-26 13:13:15 +02:00 |
|
thomwolf
|
4d47f4985d
|
slight refactoring, add abstract class for model loading
|
2019-06-26 12:52:44 +02:00 |
|
thomwolf
|
59cefd4f98
|
fix #726 - get_lr in examples
|
2019-06-26 11:28:27 +02:00 |
|
thomwolf
|
ddc2cc61a6
|
fix python2 tests
|
2019-06-26 11:17:42 +02:00 |
|
thomwolf
|
7e3070ae4f
|
add from_pretrained method to all configuration classes
|
2019-06-26 11:12:00 +02:00 |
|
thomwolf
|
93e9971c54
|
fix tests
|
2019-06-26 10:02:45 +02:00 |
|
thomwolf
|
092dacfd62
|
changing is_regression to unified API
|
2019-06-26 09:54:05 +02:00 |
|
thomwolf
|
e55d4c4ede
|
various updates to conversion, models and examples
|
2019-06-26 00:57:53 +02:00 |
|
thomwolf
|
603c513b35
|
update main conversion script and readme
|
2019-06-25 10:45:07 +02:00 |
|
thomwolf
|
7de1740490
|
add ability to restore fine-tuned TF mdoel
|
2019-06-25 10:27:58 +02:00 |
|
tonianelope
|
c9885903a1
|
update betas to follow pytorch convention
|
2019-06-25 09:23:12 +01:00 |
|
thomwolf
|
7334bf6c21
|
pad on left for xlnet
|
2019-06-24 15:05:11 +02:00 |
|
thomwolf
|
c888663f18
|
overwrite output directories if needed
|
2019-06-24 14:38:24 +02:00 |
|
thomwolf
|
62d78aa37e
|
updating GLUE utils for compatibility with XLNet
|
2019-06-24 14:36:11 +02:00 |
|
thomwolf
|
24ed0b9346
|
updating run_xlnet_classifier
|
2019-06-24 12:00:09 +02:00 |
|
thomwolf
|
f6081f2255
|
add xlnetforsequence classif and run_classifier example for xlnet
|
2019-06-24 10:01:07 +02:00 |
|
Rocketknight1
|
8d6a118aee
|
Incorrect docstring for the head_mask argument to BertForMaskedLM
|
2019-06-23 18:47:05 +01:00 |
|
Matt
|
06716d7536
|
Merge pull request #3 from huggingface/master
Catch up with main repo
|
2019-06-23 18:46:03 +01:00 |
|
thomwolf
|
c946bb51a6
|
fix xlnet tokenizer and python2
|
2019-06-22 22:28:49 +02:00 |
|
Thomas Wolf
|
98dc30b21e
|
Merge pull request #714 from papower1/master
Correct a broken link on README
|
2019-06-22 21:29:41 +02:00 |
|
Thomas Wolf
|
eae5d3819d
|
Merge pull request #715 from Rocketknight1/master
Include a reference for LM finetuning
|
2019-06-22 21:29:19 +02:00 |
|
Rocketknight1
|
c7b2808ed7
|
Update LM finetuning README to include a literature reference
|
2019-06-22 15:04:01 +01:00 |
|
Matt
|
7c59e32d47
|
Merge pull request #2 from huggingface/master
Updating my fork to the latest version
|
2019-06-22 14:59:47 +01:00 |
|
Chang-Uk Shin
|
ada0d8fec7
|
Merge pull request #1 from papower1/papower1-patch-1
Correct a broken link and its context.
|
2019-06-22 20:34:45 +09:00 |
|
Chang-Uk Shin
|
fcc706343f
|
Correct a broken link and its context.
Correct a broken link(run_lm_finetuning.py) and its context.
|
2019-06-22 20:33:48 +09:00 |
|
thomwolf
|
181075635d
|
updating model loading and adding special tokens ids
|
2019-06-21 23:23:37 +02:00 |
|
thomwolf
|
ebd2cb8d74
|
update from_pretrained to load XLNetModel as well
|
2019-06-21 21:08:44 +02:00 |
|
thomwolf
|
483cbc36a9
|
test deviation with tf model: max ~1e-3 should be ok
|
2019-06-21 16:38:01 +02:00 |
|
thomwolf
|
24d8068982
|
weights loading script ok
|
2019-06-21 12:33:44 +02:00 |
|
thomwolf
|
32da75486b
|
add tokenizer and tests
|
2019-06-21 11:09:51 +02:00 |
|
thomwolf
|
45709d7532
|
model running with simple inputs
|
2019-06-21 00:28:42 +02:00 |
|
thomwolf
|
b407972e27
|
update gitignore
|
2019-06-20 13:52:56 +02:00 |
|
thomwolf
|
c2ea5aef77
|
work in progress on xlnet
|
2019-06-20 13:52:21 +02:00 |
|
thomwolf
|
de713fa9b4
|
starting
|
2019-06-20 10:54:19 +02:00 |
|
thomwolf
|
c304593d8f
|
BERTology details in readme
|
2019-06-20 10:05:06 +02:00 |
|
Thomas Wolf
|
12e892e174
|
Merge pull request #697 from huggingface/updating_examples
Updating examples
|
2019-06-20 09:58:24 +02:00 |
|
thomwolf
|
411981a080
|
remove slow circle-ci
|
2019-06-20 08:54:18 +02:00 |
|
chrislarson1
|
716cc1c4d9
|
added main() for programmatic call to convert pytorch->tf
|
2019-06-19 23:18:57 -04:00 |
|
chrislarson1
|
a8e071c690
|
added notebook to check correctness of the pytorch->tensorflow conversion
|
2019-06-19 23:08:08 -04:00 |
|
chrislarson1
|
0a4fb0da57
|
Merge remote-tracking branch 'upstream/master' into convert-back-to-tf
merging in latest changes from upstream
|
2019-06-19 22:56:20 -04:00 |
|
thomwolf
|
edfe91c36e
|
first version bertology ok
|
2019-06-19 23:43:04 +02:00 |
|
thomwolf
|
7766ce66dd
|
update bertology
|
2019-06-19 22:29:51 +02:00 |
|
thomwolf
|
7f00a36e27
|
pruning should keep on device
|
2019-06-19 22:23:12 +02:00 |
|
thomwolf
|
e4b46d86ce
|
update head pruning
|
2019-06-19 22:16:30 +02:00 |
|
timoeller
|
939cf29157
|
Adjust s3 german Bert file storage
|
2019-06-19 18:38:42 +02:00 |
|
thomwolf
|
0f40e8d6a6
|
debugger
|
2019-06-19 15:38:46 +02:00 |
|
thomwolf
|
0e1e8128bf
|
more logging
|
2019-06-19 15:35:49 +02:00 |
|
thomwolf
|
909d4f1af2
|
cuda again
|
2019-06-19 15:32:10 +02:00 |
|