Aidan Kierans
1718fb9e74
Minor/basic text fixes ( #2229 )
...
* Small clarification
Matches line 431 to line 435 for additional clarity and consistency.
* Fixed minor typo
The letter "s" was previously omitted from the word "docstrings".
2019-12-19 16:23:18 -05:00
Julien Chaumond
9a399ead25
Revert incorrect #1778
2019-12-19 15:45:48 -05:00
Lysandre
33adab2b91
Fix albert example
2019-12-19 12:40:43 -05:00
Lysandre
a1f1dce0ae
Correct max position for SQUAD and TFDS
2019-12-19 12:25:55 -05:00
Francesco
62c1fc3c1e
Removed duplicate XLMConfig, XLMForQuestionAnswering and XLMTokenizer from import statement of run_squad.py script
2019-12-19 09:50:56 -05:00
Ejar
284572efc0
Updated typo on the link
...
Updated documentation due to typo
2019-12-19 09:36:43 -05:00
patrickvonplaten
ed6ba93912
corrected typo in example for t5 model input argument
2019-12-19 09:34:55 -05:00
Lysandre
8efc6dd544
fix #2214
2019-12-18 10:47:59 -05:00
Julien Chaumond
94c99db34c
[FinBERT] fix incorrect url
2019-12-17 20:35:25 -05:00
Julien Chaumond
7ffa817390
[s3] mv files and update links
2019-12-17 20:35:25 -05:00
Antti Virtanen
c5f35e61db
Uploaded files to AWS.
2019-12-17 20:35:25 -05:00
Antti Virtanen
abc43ffbff
Add pretrained model documentation for FinBERT.
2019-12-17 20:35:25 -05:00
Antti Virtanen
8ac840ff87
Adding Finnish BERT.
2019-12-17 20:35:25 -05:00
Julien Chaumond
a0d386455b
Fix outdated tokenizer doc
2019-12-17 20:07:39 -05:00
Julien Chaumond
ea636440d1
[roberta.conversion] Do not hardcode vocab size
...
and support for fairseq 0.9+
2019-12-17 18:12:22 -05:00
Arman Cohan
a4df2e0113
update roberta conversion
...
- update to fix conversion for the updated fairseq model
- create save directory if not exist
2019-12-17 18:12:22 -05:00
Lysandre
5e289f69bc
regex 2019.12.17 install fails with Python 2
2019-12-17 15:54:05 -05:00
Lysandre
2cff4bd8f3
Fix segmentation fault
2019-12-17 15:54:05 -05:00
Thomas Wolf
f061606277
Merge pull request #2164 from huggingface/cleanup-configs
...
[SMALL BREAKING CHANGE] Cleaning up configuration classes - Adding Model Cards
2019-12-17 09:10:16 +01:00
Julien Chaumond
3f5ccb183e
[doc] Clarify uploads
...
cf 855ff0e91d (commitcomment-36452545)
2019-12-16 18:20:29 -05:00
thomwolf
3cb51299c3
Fix #2109
2019-12-16 16:58:44 -05:00
Lysandre
18a879f475
fix #2180
2019-12-16 16:44:29 -05:00
Lysandre
d803409215
Fix run squad evaluate during training
2019-12-16 16:31:38 -05:00
Julien Chaumond
855ff0e91d
[doc] Model upload and sharing
...
ping @lysandrejik @thomwolf
Is this clear enough? Anything we should add?
2019-12-16 12:42:22 -05:00
thomwolf
1bbdbacd5b
update __init__ and saving
2019-12-16 14:38:20 +01:00
thomwolf
031ad4eb37
improving JSON error messages (for model card and configurations)
2019-12-16 14:20:57 +01:00
thomwolf
db0a9ee6e0
adding albert to TF auto models cc @LysandreJik
2019-12-16 14:08:08 +01:00
thomwolf
a4d07b983a
dict of all config and model files cc @LysandreJik
2019-12-16 14:00:32 +01:00
thomwolf
d3418a94ff
update tests
2019-12-16 13:52:41 +01:00
thomwolf
56e98ba81a
add model cards cc @mfuntowicz
2019-12-16 11:07:27 +01:00
thomwolf
8669598abd
update t5 tf
2019-12-16 09:59:36 +01:00
thomwolf
1b8613acb3
updating t5 config class
2019-12-16 09:51:42 +01:00
thomwolf
7140363e09
update bertabs
2019-12-14 09:44:53 +01:00
Thomas Wolf
a52d56c8d9
Merge branch 'master' into cleanup-configs
2019-12-14 09:43:07 +01:00
Thomas Wolf
e92bcb7eb6
Merge pull request #1739 from huggingface/t5
...
[WIP] Adding Google T5 model
2019-12-14 09:40:43 +01:00
thomwolf
cbb368ca06
distilbert tests
2019-12-14 09:31:18 +01:00
Julien Chaumond
b6d4284b26
[cli] Uploads: fix + test edge case
2019-12-13 22:44:57 -05:00
Lysandre
7bd11dda6f
Release: v2.2.2
2019-12-13 16:45:30 -05:00
LysandreJik
c3248cf122
Tests for all tokenizers
2019-12-13 16:41:44 -05:00
Pascal Voitot
f2ac50cb55
better for python2.x
2019-12-13 16:41:44 -05:00
Pascal Voitot
4cbdc7d910
missed space
2019-12-13 16:41:44 -05:00
Pascal Voitot
dd2add9f6e
more tests
2019-12-13 16:41:44 -05:00
Pascal Voitot
df160af736
🐛 #2096 in tokenizer.decode, space is not joined between all subtexts instead of before added tokens
2019-12-13 16:41:44 -05:00
Pascal Voitot
5b7b78e088
🐛 #2096 in tokenizer.decode, adds a space after special tokens to return right formatted string
2019-12-13 16:41:44 -05:00
Julien Chaumond
866d73ca26
[cli] Upload is now compatible with folders
2019-12-13 16:39:08 -05:00
Lysandre
d461472948
return for SQuAD [BLACKED]
2019-12-13 15:31:52 -05:00
Lysandre
f24a228a93
Speed up tokenization process
2019-12-13 14:50:35 -05:00
Lysandre
c8ed1c82c8
[SQUAD] Load checkpoint when evaluating without training
2019-12-13 12:13:48 -05:00
thomwolf
5c00e344c1
update model doc - swith 3B/11B to 3b/11b
2019-12-13 16:33:29 +01:00
Thomas Wolf
110394b2ba
Merge branch 'master' into t5
2019-12-13 16:03:32 +01:00