Patrick von Platen
79a82cc06a
[Benchmarks] improve Example Plotter ( #5245 )
...
* improve plotting
* better labels
* fix time plot
2020-06-26 15:00:14 +02:00
Sylvain Gugger
88d7f96e33
Gpt2 model card ( #5283 )
...
* Bert base model card
* Add metadata
* Adapt examples
* GPT2 model card
* Remove the BERT model card
* Change language code
2020-06-26 08:08:31 -04:00
Sylvain Gugger
fc5bce9e60
Bert base model card ( #5276 )
...
* Bert base model card
* Add metadata
* Adapt examples
* Comment on text generation
* Update model_cards/bert-base-uncased-README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-06-26 08:01:19 -04:00
Funtowicz Morgan
135791e8ef
Add pad_to_multiple_of on tokenizers (reimport) ( #5054 )
...
* Add new parameter `pad_to_multiple_of` on tokenizers.
* unittest for pad_to_multiple_of
* Add .name when logging enum.
* Fix missing .items() on dict in tests.
* Add special check + warning if the tokenizer doesn't have proper pad_token.
* Use the correct logger format specifier.
* Ensure tokenizer with no pad_token do not modify the underlying padding strategy.
* Skip test if tokenizer doesn't have pad_token
* Fix RobertaTokenizer on empty input
* Format.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
* fix and updating to simpler API
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
2020-06-26 11:55:57 +02:00
Lysandre Debut
7cc15bdd96
Closes #5218
2020-06-25 18:19:21 -04:00
Joe Davison
2ffef0d0c7
Training & fine-tuning quickstart ( #5034 )
...
* add initial fine-tuning guide
* split code blocks to smaller segments
* fix up trianer section of fine-tune doc
* a few last typos
* Update usage -> task summary link
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-06-25 15:11:11 -06:00
Lysandre Debut
364a5ae1f0
Refactor Code samples; Test code samples ( #5036 )
...
* Refactor code samples
* Test docstrings
* Style
* Tokenization examples
* Run rust of tests
* First step to testing source docs
* Style and BART comment
* Test the remainder of the code samples
* Style
* let to const
* Formatting fixes
* Ready for merge
* Fix fixture + Style
* Fix last tests
* Update docs/source/quicktour.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Addressing @sgugger's comments + Fix MobileBERT in TF
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-06-25 16:46:00 -04:00
Thomas Wolf
315f464b0a
[tokenizers] Several small improvements and bug fixes ( #5287 )
...
* avoid recursion in id checks for fast tokenizers
* better typings and fix #5232
* align slow and fast tokenizers behaviors for Roberta and GPT2
* style and quality
* fix tests - improve typings
2020-06-25 22:17:14 +02:00
Sylvain Gugger
24f46ea3f3
Remove links for all docs ( #5280 )
2020-06-25 11:45:05 -04:00
Thomas Wolf
27cf1d97f0
[Tokenization] Fix #5181 - make #5155 more explicit - move back the default logging level in tests to WARNING ( #5252 )
...
* fix-5181
Padding to max sequence length while truncation to another length was wrong on slow tokenizers
* clean up and fix #5155
* fix XLM test
* Fix tests for Transfo-XL
* logging only above WARNING in tests
* switch slow tokenizers tests in @slow
* fix Marian truncation tokenization test
* style and quality
* make the test a lot faster by limiting the sequence length used in tests
2020-06-25 17:24:28 +02:00
Sam Shleifer
e008d520bb
[examples/seq2seq] more README improvements ( #5274 )
2020-06-25 10:13:01 -04:00
Julien Chaumond
6a495cae00
[model_cards] Example of how to specify inputs for the widget
2020-06-25 15:58:25 +02:00
Anthony MOI
0e1fce3c01
Fix convert_graph_to_onnx ( #5230 )
2020-06-25 08:17:02 +02:00
Moumeneb1
5543efd5cc
Create README.md ( #5259 )
2020-06-25 01:56:07 -04:00
Sam Shleifer
40457bcebb
examples/seq2seq supports translation ( #5202 )
2020-06-24 23:58:11 -04:00
Sylvain Gugger
d12ceb48ba
Tokenization tutorial ( #5257 )
...
* All done
* Link to the tutorial
* Typo fixes
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
* Add metnion of the return_xxx args
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
2020-06-24 18:43:20 -04:00
Thomas Wolf
7ac9110711
Add more tests on tokenizers serialization - fix bugs ( #5056 )
...
* update tests for fast tokenizers + fix small bug in saving/loading
* better tests on serialization
* fixing serialization
* comment cleanup
2020-06-24 21:53:08 +02:00
Sylvain Gugger
0148c262e7
Fix first test ( #5255 )
2020-06-24 15:16:04 -04:00
Sylvain Gugger
70c1e1d2d5
Use master _static ( #5253 )
...
* Use _static from master everywhere
* Copy to existing too
2020-06-24 15:06:14 -04:00
Victor SANH
4965aee064
[HANS] Fix label_list for RoBERTa/BART (class flipping) ( #5196 )
...
* fix weirdness in roberta/bart for mnli trained checkpoints
* black compliance
* isort code check
2020-06-24 14:38:15 -04:00
Julien Chaumond
fc24a93e64
[HfApi] Add support for pipeline_tag
2020-06-24 16:54:00 +00:00
Setu Shah
0a3d0e02c5
Replace labels with -100 to skip loss calc ( #4718 )
2020-06-24 12:14:50 -04:00
Sylvain Gugger
6894b486d0
Fix version controller links (for realsies) ( #5251 )
2020-06-24 12:13:43 -04:00
Sai Saketh Aluru
1121ce9f98
Model cards for Hate-speech-CNERG models ( #5236 )
...
* Add dehatebert-mono-arabic readme card
* Update dehatebert-mono-arabic model card
* model cards for Hate-speech-CNERG models
2020-06-24 11:41:08 -04:00
Lysandre Debut
cf10d4cfdd
Cleaning TensorFlow models ( #5229 )
...
* Cleaning TensorFlow models
Update all classes
stylr
* Don't average loss
2020-06-24 11:37:20 -04:00
Sylvain Gugger
609e0c583f
Fix links ( #5248 )
2020-06-24 11:35:55 -04:00
Ali Modarressi
c9163a8d5a
delay decay schedule until the end of warmup ( #4940 )
2020-06-24 11:18:29 -04:00
Sylvain Gugger
f216b60671
Fix deploy doc ( #5246 )
...
* Try with the same command
* Try like this
2020-06-24 10:59:06 -04:00
Sylvain Gugger
49f6e7a3c6
Add some prints to debug ( #5244 )
2020-06-24 10:37:01 -04:00
Patrick von Platen
c2a26ec8a6
[Use cache] Align logic of use_cache
with output_attentions and output_hidden_states ( #5194 )
...
* fix use cache
* add bart use cache
* fix bart
* finish bart
2020-06-24 16:09:17 +02:00
Sylvain Gugger
64c393ee74
Don't recreate old docs ( #5243 )
2020-06-24 09:59:07 -04:00
Patrick von Platen
b29683736a
fix print in benchmark ( #5242 )
2020-06-24 15:58:49 +02:00
Patrick von Platen
9fe09cec76
[Benchmark] Extend Benchmark to all model type extensions ( #5241 )
...
* add benchmark for all kinds of models
* improved import
* delete bogus files
* make style
2020-06-24 15:11:42 +02:00
Sylvain Gugger
7c41057d50
Add hugs ( #5225 )
2020-06-24 07:56:14 -04:00
Sylvain Gugger
5e85b324ec
Use the script in utils ( #5224 )
2020-06-24 07:55:58 -04:00
flozi00
5e31a98ab7
Create README.md ( #5108 )
...
* Create README.md
* Update model_cards/a-ware/roberta-large-squad-classification/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-06-24 04:45:51 -04:00
Adriano Diniz
033124e5f8
Update README.md ( #5199 )
...
Fix/add information in README.md
2020-06-24 04:42:46 -04:00
ahotrod
7ca6627ec3
Create README.md ( #5217 )
...
electra_large_discriminator_squad2_512 Question Answering LM
2020-06-24 04:40:50 -04:00
Kevin Canwen Xu
54e9ce785d
Fix PABEE division by zero error ( #5233 )
...
* Fix PABEE division by zero error
* patience=0 by default
2020-06-24 16:10:36 +08:00
Sylvain Gugger
9022ef021a
Only put tensors on a device ( #5223 )
...
* Only put tensors on a device
* Type hint and unpack list comprehension
2020-06-23 17:30:17 -04:00
Sylvain Gugger
173528e368
Add version control menu ( #5222 )
...
* Add version control menu
* Constify things
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
2020-06-23 17:05:12 -04:00
Sam Shleifer
76e5af4cfd
[pl_examples] revert deletion of optimizer_step ( #5227 )
2020-06-23 16:40:45 -04:00
Julien Chaumond
c01480bba3
[file_utils] Type user-agent
2020-06-23 18:31:13 +02:00
Sam Shleifer
58918c76f4
[bart] add config.extra_pos_embeddings to facilitate reuse ( #5190 )
2020-06-23 11:35:42 -04:00
Thomas Wolf
b28b537131
More clear error message in the use-case of #5169 ( #5184 )
2020-06-23 13:37:29 +02:00
Thomas Wolf
11fdde0271
Tokenizers API developments ( #5103 )
...
* Add return lengths
* make pad a bit more flexible so it can be used as collate_fn
* check all kwargs sent to encoding method are known
* fixing kwargs in encodings
* New AddedToken class in python
This class let you specify specifique tokenization behaviors for some special tokens. Used in particular for GPT2 and Roberta, to control how white spaces are stripped around special tokens.
* style and quality
* switched to hugginface tokenizers library for AddedTokens
* up to tokenizer 0.8.0-rc3 - update API to use AddedToken state
* style and quality
* do not raise an error on additional or unused kwargs for tokenize() but only a warning
* transfo-xl pretrained model requires torch
* Update src/transformers/tokenization_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
2020-06-23 13:36:57 +02:00
Patrick von Platen
1ae132a07d
[Reformer] Axial Pos Emb Improve mem usage reformer ( #5209 )
...
* improve mem handling
* improve mem for pos ax encodings
2020-06-23 10:49:18 +02:00
Sam Shleifer
5144104070
[fix] remove unused import ( #5206 )
2020-06-22 23:39:04 -04:00
Sam Shleifer
0d158e38c9
[fix] mobilebert had wrong path, causing slow test failure ( #5205 )
2020-06-22 23:31:36 -04:00
Sam Shleifer
f5c2a122e3
Upgrade examples to pl=0.8.1( #5146 )
2020-06-22 20:40:10 -04:00