transformers/docs/source/model_doc
Thomas Wolf 9aeacb58ba
Adding Fast tokenizers for SentencePiece based tokenizers - Breaking: remove Transfo-XL fast tokenizer (#7141)
* [WIP] SP tokenizers

* fixing tests for T5

* WIP tokenizers

* serialization

* update T5

* WIP T5 tokenization

* slow to fast conversion script

* Refactoring to move tokenzier implementations inside transformers

* Adding gpt - refactoring - quality

* WIP adding several tokenizers to the fast world

* WIP Roberta - moving implementations

* update to dev4 switch file loading to in-memory loading

* Updating and fixing

* advancing on the tokenizers - updating do_lower_case

* style and quality

* moving forward with tokenizers conversion and tests

* MBart, T5

* dumping the fast version of transformer XL

* Adding to autotokenizers + style/quality

* update init and space_between_special_tokens

* style and quality

* bump up tokenizers version

* add protobuf

* fix pickle Bert JP with Mecab

* fix newly added tokenizers

* style and quality

* fix bert japanese

* fix funnel

* limite tokenizer warning to one occurence

* clean up file

* fix new tokenizers

* fast tokenizers deep tests

* WIP adding all the special fast tests on the new fast tokenizers

* quick fix

* adding more fast tokenizers in the fast tests

* all tokenizers in fast version tested

* Adding BertGenerationFast

* bump up setup.py for CI

* remove BertGenerationFast (too early)

* bump up tokenizers version

* Clean old docstrings

* Typo

* Update following Lysandre comments

Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>
2020-10-08 11:32:16 +02:00
..
albert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
auto.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
bart.rst Cleanup documentation for BART, Marian, MBART and Pegasus (#7523) 2020-10-05 04:22:12 -04:00
bert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
bertgeneration.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
blenderbot.rst Blenderbot (#7418) 2020-10-07 19:09:23 -04:00
camembert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
ctrl.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
deberta.rst Add DeBERTa model (#5929) 2020-09-30 07:07:30 -04:00
dialogpt.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
distilbert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
dpr.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
electra.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
encoderdecoder.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
flaubert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
fsmt.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
funnel.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
gpt.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
gpt2.rst Add GPT2ForSequenceClassification based on DialogRPT (#7501) 2020-10-06 17:31:21 -04:00
layoutlm.rst Update LayoutLM doc (#7388) 2020-10-01 09:11:42 -04:00
longformer.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
lxmert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
marian.rst Cleanup documentation for BART, Marian, MBART and Pegasus (#7523) 2020-10-05 04:22:12 -04:00
mbart.rst Cleanup documentation for BART, Marian, MBART and Pegasus (#7523) 2020-10-05 04:22:12 -04:00
mobilebert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
pegasus.rst Cleanup documentation for BART, Marian, MBART and Pegasus (#7523) 2020-10-05 04:22:12 -04:00
rag.rst Document RAG again (#7377) 2020-09-28 08:31:46 -04:00
reformer.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
retribert.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
roberta.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
squeezebert.rst Fix squeezebert docs (#7587) 2020-10-06 06:22:04 -04:00
t5.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
transformerxl.rst Adding Fast tokenizers for SentencePiece based tokenizers - Breaking: remove Transfo-XL fast tokenizer (#7141) 2020-10-08 11:32:16 +02:00
xlm.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
xlmroberta.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00
xlnet.rst Models doc (#7345) 2020-09-23 13:20:45 -04:00