Antti Virtanen
|
abc43ffbff
|
Add pretrained model documentation for FinBERT.
|
2019-12-17 20:35:25 -05:00 |
|
Antti Virtanen
|
8ac840ff87
|
Adding Finnish BERT.
|
2019-12-17 20:35:25 -05:00 |
|
Julien Chaumond
|
a0d386455b
|
Fix outdated tokenizer doc
|
2019-12-17 20:07:39 -05:00 |
|
Julien Chaumond
|
ea636440d1
|
[roberta.conversion] Do not hardcode vocab size
and support for fairseq 0.9+
|
2019-12-17 18:12:22 -05:00 |
|
Arman Cohan
|
a4df2e0113
|
update roberta conversion
- update to fix conversion for the updated fairseq model
- create save directory if not exist
|
2019-12-17 18:12:22 -05:00 |
|
Lysandre
|
5e289f69bc
|
regex 2019.12.17 install fails with Python 2
|
2019-12-17 15:54:05 -05:00 |
|
Lysandre
|
2cff4bd8f3
|
Fix segmentation fault
|
2019-12-17 15:54:05 -05:00 |
|
Thomas Wolf
|
f061606277
|
Merge pull request #2164 from huggingface/cleanup-configs
[SMALL BREAKING CHANGE] Cleaning up configuration classes - Adding Model Cards
|
2019-12-17 09:10:16 +01:00 |
|
Julien Chaumond
|
3f5ccb183e
|
[doc] Clarify uploads
cf 855ff0e91d (commitcomment-36452545)
|
2019-12-16 18:20:29 -05:00 |
|
thomwolf
|
3cb51299c3
|
Fix #2109
|
2019-12-16 16:58:44 -05:00 |
|
Lysandre
|
18a879f475
|
fix #2180
|
2019-12-16 16:44:29 -05:00 |
|
Lysandre
|
d803409215
|
Fix run squad evaluate during training
|
2019-12-16 16:31:38 -05:00 |
|
Julien Chaumond
|
855ff0e91d
|
[doc] Model upload and sharing
ping @lysandrejik @thomwolf
Is this clear enough? Anything we should add?
|
2019-12-16 12:42:22 -05:00 |
|
thomwolf
|
1bbdbacd5b
|
update __init__ and saving
|
2019-12-16 14:38:20 +01:00 |
|
thomwolf
|
031ad4eb37
|
improving JSON error messages (for model card and configurations)
|
2019-12-16 14:20:57 +01:00 |
|
thomwolf
|
db0a9ee6e0
|
adding albert to TF auto models cc @LysandreJik
|
2019-12-16 14:08:08 +01:00 |
|
thomwolf
|
a4d07b983a
|
dict of all config and model files cc @LysandreJik
|
2019-12-16 14:00:32 +01:00 |
|
thomwolf
|
d3418a94ff
|
update tests
|
2019-12-16 13:52:41 +01:00 |
|
thomwolf
|
56e98ba81a
|
add model cards cc @mfuntowicz
|
2019-12-16 11:07:27 +01:00 |
|
thomwolf
|
8669598abd
|
update t5 tf
|
2019-12-16 09:59:36 +01:00 |
|
thomwolf
|
1b8613acb3
|
updating t5 config class
|
2019-12-16 09:51:42 +01:00 |
|
thomwolf
|
7140363e09
|
update bertabs
|
2019-12-14 09:44:53 +01:00 |
|
Thomas Wolf
|
a52d56c8d9
|
Merge branch 'master' into cleanup-configs
|
2019-12-14 09:43:07 +01:00 |
|
Thomas Wolf
|
e92bcb7eb6
|
Merge pull request #1739 from huggingface/t5
[WIP] Adding Google T5 model
|
2019-12-14 09:40:43 +01:00 |
|
thomwolf
|
cbb368ca06
|
distilbert tests
|
2019-12-14 09:31:18 +01:00 |
|
Julien Chaumond
|
b6d4284b26
|
[cli] Uploads: fix + test edge case
|
2019-12-13 22:44:57 -05:00 |
|
Lysandre
|
7bd11dda6f
|
Release: v2.2.2
|
2019-12-13 16:45:30 -05:00 |
|
LysandreJik
|
c3248cf122
|
Tests for all tokenizers
|
2019-12-13 16:41:44 -05:00 |
|
Pascal Voitot
|
f2ac50cb55
|
better for python2.x
|
2019-12-13 16:41:44 -05:00 |
|
Pascal Voitot
|
4cbdc7d910
|
missed space
|
2019-12-13 16:41:44 -05:00 |
|
Pascal Voitot
|
dd2add9f6e
|
more tests
|
2019-12-13 16:41:44 -05:00 |
|
Pascal Voitot
|
df160af736
|
🐛 #2096 in tokenizer.decode, space is not joined between all subtexts instead of before added tokens
|
2019-12-13 16:41:44 -05:00 |
|
Pascal Voitot
|
5b7b78e088
|
🐛 #2096 in tokenizer.decode, adds a space after special tokens to return right formatted string
|
2019-12-13 16:41:44 -05:00 |
|
Julien Chaumond
|
866d73ca26
|
[cli] Upload is now compatible with folders
|
2019-12-13 16:39:08 -05:00 |
|
Lysandre
|
d461472948
|
return for SQuAD [BLACKED]
|
2019-12-13 15:31:52 -05:00 |
|
Lysandre
|
f24a228a93
|
Speed up tokenization process
|
2019-12-13 14:50:35 -05:00 |
|
Lysandre
|
c8ed1c82c8
|
[SQUAD] Load checkpoint when evaluating without training
|
2019-12-13 12:13:48 -05:00 |
|
thomwolf
|
5c00e344c1
|
update model doc - swith 3B/11B to 3b/11b
|
2019-12-13 16:33:29 +01:00 |
|
Thomas Wolf
|
110394b2ba
|
Merge branch 'master' into t5
|
2019-12-13 16:03:32 +01:00 |
|
Pierric Cistac
|
5a5c4349e8
|
Fix summarization to_cpu doc
|
2019-12-13 10:02:33 -05:00 |
|
thomwolf
|
8ade204098
|
fix tf
|
2019-12-13 14:48:47 +01:00 |
|
thomwolf
|
47f0e3cfb7
|
cleaning up configuration classes
|
2019-12-13 14:33:24 +01:00 |
|
thomwolf
|
33e72b08d5
|
fix inner dimensions for 3B/11B models
|
2019-12-13 11:33:05 +01:00 |
|
LysandreJik
|
7296f1010b
|
Cleanup squad and add allow train_file and predict_file usage
|
2019-12-12 13:01:04 -05:00 |
|
Julien Chaumond
|
5d67aa21ae
|
[doc] Replicate doc from #2144
|
2019-12-12 12:39:41 -05:00 |
|
LysandreJik
|
fe92755b99
|
Fix special tokens mask in encode
|
2019-12-12 11:37:19 -05:00 |
|
Alan deLevie
|
fbf5455a86
|
Fix typo in examples/run_glue.py args declaration.
deay -> decay
|
2019-12-12 11:16:19 -05:00 |
|
thomwolf
|
f19dad61c7
|
fixing XLM conversion tests with dummy input
|
2019-12-12 14:46:30 +01:00 |
|
Thomas Wolf
|
90df44f0aa
|
Merge pull request #2063 from guillaume-be/special_tokens_mask_value_not_used
special_tokens_mask value was unused and calculated twice
|
2019-12-12 08:21:46 +01:00 |
|
Thomas Wolf
|
707f9e9241
|
Merge pull request #2081 from pglock/patch-1
handle string with only whitespaces as empty
|
2019-12-12 08:20:43 +01:00 |
|