transformers/docs/source
Suraj Patil 88ef8893cd
Add caching mechanism to BERT, RoBERTa (#9183)
* add past_key_values

* add use_cache option

* make mask before cutting ids

* adjust position_ids according to past_key_values

* flatten past_key_values

* fix positional embeds

* fix _reorder_cache

* set use_cache to false when not decoder, fix attention mask init

* add test for caching

* add past_key_values for Roberta

* fix position embeds

* add caching test for roberta

* add doc

* make style

* doc, fix attention mask, test

* small fixes

* adress patrick's comments

* input_ids shouldn't start with pad token

* use_cache only when decoder

* make consistent with bert

* make copies consistent

* add use_cache to encoder

* add past_key_values to tapas attention

* apply suggestions from code review

* make coppies consistent

* add attn mask in tests

* remove copied from longformer

* apply suggestions from code review

* fix bart test

* nit

* simplify model outputs

* fix doc

* fix output ordering
2020-12-23 23:01:32 +05:30
..
_static v4.1.1 docs 2020-12-17 11:28:38 -05:00
imgs Guide to fixed-length model perplexity evaluation (#5449) 2020-07-07 16:04:15 -06:00
internal Fix T5 and BART for TF (#9063) 2020-12-14 18:47:00 +01:00
main_classes Add caching mechanism to BERT, RoBERTa (#9183) 2020-12-23 23:01:32 +05:30
model_doc Fix script that check objects are documented (#9259) 2020-12-22 11:12:58 -05:00
benchmarks.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
bertology.rst Fixed a broken link in documentation (#9101) 2020-12-14 09:12:27 -05:00
conf.py Release: v4.1.1 2020-12-17 11:25:49 -05:00
contributing.md Update installation page and add contributing to the doc (#5084) 2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
custom_datasets.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
examples.md per_device instead of per_gpu/error thrown when argument unknown (#4618) 2020-05-27 11:36:55 -04:00
favicon.ico Adding usage examples for common tasks (#2850) 2020-02-25 13:48:24 -05:00
glossary.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
index.rst Put all models in the constants (#9170) 2020-12-17 11:23:21 -05:00
installation.md Copyright (#8970) 2020-12-07 18:36:34 -05:00
migration.md Copyright (#8970) 2020-12-07 18:36:34 -05:00
model_sharing.rst [model_cards] Migrate cards from this repo to model repos on huggingface.co (#9013) 2020-12-11 18:24:42 -05:00
model_summary.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
multilingual.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
notebooks.md Update notebooks (#3620) 2020-04-06 14:32:39 -04:00
perplexity.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
philosophy.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
preprocessing.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
pretrained_models.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
quicktour.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
serialization.rst Copyright (#8970) 2020-12-07 18:36:34 -05:00
task_summary.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
testing.rst Testing Experimental CI Features (#9070) 2020-12-14 10:34:59 -05:00
tokenizer_summary.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00
training.rst Minor documentation revisions from copyediting (#9266) 2020-12-23 10:15:49 -05:00