transformers/docs/source
Thomas Wolf ba8c4d0ac0
[Dependencies|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659)
* splitting fast and slow tokenizers [WIP]

* [WIP] splitting sentencepiece and tokenizers dependencies

* update dummy objects

* add name_or_path to models and tokenizers

* prefix added to file names

* prefix

* styling + quality

* spliting all the tokenizer files - sorting sentencepiece based ones

* update tokenizer version up to 0.9.0

* remove hard dependency on sentencepiece 🎉

* and removed hard dependency on tokenizers 🎉

* update conversion script

* update missing models

* fixing tests

* move test_tokenization_fast to main tokenization tests - fix bugs

* bump up tokenizers

* fix bert_generation

* update ad fix several tokenizers

* keep sentencepiece in deps for now

* fix funnel and deberta tests

* fix fsmt

* fix marian tests

* fix layoutlm

* fix squeezebert and gpt2

* fix T5 tokenization

* fix xlnet tests

* style

* fix mbart

* bump up tokenizers to 0.9.2

* fix model tests

* fix tf models

* fix seq2seq examples

* fix tests without sentencepiece

* fix slow => fast  conversion without sentencepiece

* update auto and bert generation tests

* fix mbart tests

* fix auto and common test without tokenizers

* fix tests without tokenizers

* clean up tests lighten up when tokenizers + sentencepiece are both off

* style quality and tests fixing

* add sentencepiece to doc/examples reqs

* leave sentencepiece on for now

* style quality split hebert and fix pegasus

* WIP Herbert fast

* add sample_text_no_unicode and fix hebert tokenization

* skip FSMT example test for now

* fix style

* fix fsmt in example tests

* update following Lysandre and Sylvain's comments

* Update src/transformers/testing_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/testing_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/tokenization_utils_base.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/tokenization_utils_base.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2020-10-18 20:51:24 +02:00
..
_static The toggle actually sticks (#7586) 2020-10-05 11:23:57 -04:00
imgs Guide to fixed-length model perplexity evaluation (#5449) 2020-07-07 16:04:15 -06:00
internal Add predict step accumulation (#7767) 2020-10-14 11:41:45 -04:00
main_classes fixed lots of typos. (#7758) 2020-10-13 10:00:20 -04:00
model_doc Gpt1 for sequence classification (#7683) 2020-10-13 05:06:15 -04:00
benchmarks.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
bertology.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
conf.py Release: v3.3.1 2020-09-29 14:17:34 -04:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
custom_datasets.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
favicon.ico Adding usage examples for common tasks (#2850) 2020-02-25 13:48:24 -05:00
glossary.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
index.rst Better links for models in READMED and doc index (#7680) 2020-10-09 11:17:16 -04:00
installation.md Make transformers install check positive (#7473) 2020-09-30 07:44:40 -04:00
migration.md Add hugs (#5225) 2020-06-24 07:56:14 -04:00
model_sharing.rst docs: fix model sharing file names (#5855) 2020-09-28 08:17:30 -04:00
model_summary.rst Document RAG again (#7377) 2020-09-28 08:31:46 -04:00
multilingual.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
perplexity.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
philosophy.rst fixed lots of typos. (#7758) 2020-10-13 10:00:20 -04:00
preprocessing.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
pretrained_models.rst Update XLM-RoBERTa details (#7669) 2020-10-09 05:16:58 -04:00
quicktour.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
serialization.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
task_summary.rst [Dependencies|tokenizers] Make both SentencePiece and Tokenizers optional dependencies (#7659) 2020-10-18 20:51:24 +02:00
testing.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
tokenizer_summary.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
training.rst Typo and fix the input of labels to cross_entropy (#7841) 2020-10-15 19:36:31 -04:00