Sylvain Gugger
3b44aa935a
Model utils doc ( #6005 )
...
* Document TF modeling utils
* Document all model utils
2020-07-24 09:16:28 -04:00
sgugger
a540405213
Fix commit hash for stable doc
2020-07-24 09:07:40 -04:00
Qingqing Cao
fc0fe2a532
fix: model card readme clutter ( #6008 )
...
this removes the clutter line in the readme.md of model card `csarron/roberta-base-squad-v1`. It also fixes the result table.
2020-07-24 04:17:52 -04:00
Sylvain Gugger
f5b5c5bd7e
Avoid unnecessary warnings when loading pretrained model ( #5922 )
...
* Avoid unnecessary warnings when loading pretrained model
* Fix test
* Add other keys to ignore
* keys_to_ignore_at_load -> authorized_missing_keys
2020-07-23 18:13:36 -04:00
Philip May
29afb5764f
Bert german dbmdz uncased sentence stsb ( #6000 )
...
* Describe usage of sentence model
* fix typo usage
* add use and description to readme
* fix typo in readme
* readme formatting
* add training procedure to readme
* description name and company
* readme formatting
* dataset training readme
* typo
* readme
2020-07-23 17:56:45 -04:00
Qingqing Cao
2b5ef9706d
Model cards: add roberta-base-squad-v1 and bert-base-uncased-squad-v1 ( #6006 )
...
* add: bert-base-uncased-squad-v1
* add: roberta-base-squad-v1
2020-07-23 17:53:47 -04:00
Sam Shleifer
9827d666eb
MbartTokenizer: do not hardcode vocab size ( #5998 )
2020-07-23 15:41:14 -04:00
Sylvain Gugger
6e16195510
Fix #5974 ( #5999 )
2020-07-23 13:51:29 -04:00
Sylvain Gugger
e168488a74
Cleanup Trainer and expose customization points ( #5982 )
...
* Clean up Trainer and expose customization points
* Formatting
* eval_step -> prediction_step
2020-07-23 12:05:41 -04:00
Qingqing Cao
76f52324b1
add fine-tuned mobilebert squad v1 and squad v2 model cards ( #5980 )
...
* add mobilebert-uncased-squad-v2
* fix shell cmd, add creator info
* add mobilebert-uncased-squad-v1
2020-07-23 11:57:29 -04:00
GmailB
7e251ae039
Create README.md ( #5989 )
2020-07-23 11:41:33 -04:00
Sylvain Gugger
33d7506ea1
Update doc of the model page ( #5985 )
2020-07-22 18:14:57 -04:00
Sam Shleifer
c3206eef44
[test] partial coverage for train_mbart_enro_cc25.sh ( #5976 )
2020-07-22 14:34:49 -04:00
Stas Bekman
2c0da7803a
minor doc fixes ( #5831 )
...
* minor doc fixes
correct superclass name and small grammar fixes
* correct the instance name in the error message
It appears to be `BaseTokenizer` from looking at:
`from tokenizers.implementations import BaseTokenizer as BaseTokenizerFast`
and not `Tokenizer` as it currently says.
2020-07-22 13:22:34 -04:00
Sam Shleifer
feeb956a19
[docs] Add integration test example to copy pasta template ( #5961 )
...
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-22 12:48:38 -04:00
Sam Shleifer
01116d3c5b
T5 Model Cards ( #5759 )
...
* T5 Model Cards
* Fix paths
* Fix tags
* lang-en
2020-07-22 11:38:37 -04:00
Funtowicz Morgan
896300177b
Expose padding_strategy on squad processor to fix QA pipeline performance regression ( #5932 )
...
* Attempt to fix the way squad_convert_examples_to_features pad the elements for the QA pipeline.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Quality
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Make the code easier to read and avoid testing multiple test the same thing.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* missing enum value on truncation_strategy.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Rethinking for the easiest fix: expose the padding strategy on squad_convert_examples_to_features.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* Remove unused imports.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
2020-07-22 16:11:57 +02:00
Sam Shleifer
ae67b2439f
[CI] Install examples/requirements.txt ( #5956 )
2020-07-21 21:07:48 -04:00
Sylvain Gugger
e714412fe6
Update doc to new model outputs ( #5946 )
...
* Update doc to new model outputs
* Fix outputs in quicktour
2020-07-21 18:13:55 -04:00
Sam Shleifer
ddd40b3211
[CI] self-scheduled runner tests examples/ ( #5927 )
2020-07-21 17:01:07 -04:00
Sam Shleifer
9dab39feea
seq2seq/run_eval.py can take decoder_start_token_id ( #5949 )
2020-07-21 16:58:45 -04:00
Sam Shleifer
5b193b39b0
[examples/seq2seq]: add --label_smoothing option ( #5919 )
2020-07-21 16:51:39 -04:00
Sam Shleifer
95d1962b9c
[Doc] explaining romanian postprocessing for MBART BLEU hacking ( #5943 )
2020-07-21 14:12:48 -04:00
Jannes
604a2355dc
Create README.md ( #5876 )
2020-07-21 13:28:22 -04:00
Jannes
77c718edef
Create README.md ( #5873 )
2020-07-21 13:28:06 -04:00
Jannes
325b277db9
Create README.md ( #5874 )
2020-07-21 13:27:30 -04:00
Jannes
d15be2216c
Create README.md ( #5879 )
2020-07-21 13:27:13 -04:00
Jannes
f3e23dd90a
Create README.md ( #5878 )
2020-07-21 13:20:47 -04:00
Jannes
8b01d15c05
Create README.md ( #5877 )
2020-07-21 13:20:43 -04:00
Jannes
05bddf304e
Create README.md ( #5875 )
2020-07-21 13:20:32 -04:00
Jannes
783a0c7ee9
Create README.md ( #5872 )
2020-07-21 13:20:21 -04:00
Jannes
e7844d60c2
Create README.md ( #5871 )
2020-07-21 13:19:48 -04:00
tuner007
b1ee69763c
Create README.md ( #5864 )
2020-07-21 13:15:07 -04:00
Manuel Romero
5f809e4976
Update README.md ( #5857 )
...
Add nlp dataset used
2020-07-21 13:14:27 -04:00
Manuel Romero
4215f59c99
Update README.md ( #5856 )
...
Add dataset used as it is now part of nlp package
2020-07-21 13:11:08 -04:00
Ali Hamdi Ali Fadel
1d72460d55
Add ComVE model cards ( #5884 )
...
* Add ComVE model cards
* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-21 12:54:29 -04:00
Aditya Soni
ccbf74a685
typos in seq2seq/readme ( #5937 )
2020-07-21 09:44:59 -04:00
BatJedi
d32279438a
Created model card for my extreme summarization model ( #5839 )
...
* Created model card for my extreme summarization model
* Update model_cards/yuvraj/xSumm/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-21 03:54:57 -04:00
BatJedi
abf5c56e9d
Created model card for my summarization model ( #5838 )
...
* Created model card for my summarization model
* Update model_cards/yuvraj/summarizer-cnndm/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-07-21 03:54:14 -04:00
Manuel Romero
d73baeebc5
Create README.md ( #5921 )
...
- Maybe the result of this query answers the question You did some days ago @julien-c ;-)
2020-07-21 03:52:52 -04:00
Manuel Romero
50acfc8717
Create README.md ( #5924 )
2020-07-21 03:41:37 -04:00
Manuel Romero
7249533404
Create README.md ( #5920 )
2020-07-21 03:31:42 -04:00
Sylvain Gugger
4781afd045
Clarify arg class ( #5916 )
2020-07-20 19:47:06 -04:00
Qingqing Cao
8e0bcb56ec
DataParallel fix: multi gpu evaluation ( #5926 )
...
The DataParallel training was fixed in https://github.com/huggingface/transformers/pull/5733 , this commit also fixes the evaluation. It's more convenient when the user enables both `do_train` and `do_eval`.
2020-07-20 17:54:08 -04:00
Sylvain Gugger
a20969170b
Add AlbertForPretraining to doc ( #5914 )
2020-07-20 17:53:21 -04:00
Sam Shleifer
f1a4e06f1f
[Fix] seq2seq pack_dataset.py actually packs ( #5913 )
...
Huge MT speedup!
2020-07-20 15:18:26 -04:00
Sylvain Gugger
32883b310b
Improve doc of use_cache ( #5912 )
...
* Improve doc of use_cache
* Update src/transformers/configuration_xlnet.py
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Teven <teven.lescao@gmail.com>
2020-07-20 11:50:41 -04:00
Clement
9ccb45a263
Update gpt2-README.md
2020-07-20 11:40:33 -04:00
Clement
f19751117d
Create gpt2-medium-README.md
2020-07-20 10:47:42 -04:00
Clement
511523672b
Create gpt2-large-README.md
2020-07-20 10:47:27 -04:00