transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-18 03:58:25 +06:00

History

Suraj Patil 88ef8893cd Add caching mechanism to BERT, RoBERTa (#9183 ) * add past_key_values * add use_cache option * make mask before cutting ids * adjust position_ids according to past_key_values * flatten past_key_values * fix positional embeds * fix _reorder_cache * set use_cache to false when not decoder, fix attention mask init * add test for caching * add past_key_values for Roberta * fix position embeds * add caching test for roberta * add doc * make style * doc, fix attention mask, test * small fixes * adress patrick's comments * input_ids shouldn't start with pad token * use_cache only when decoder * make consistent with bert * make copies consistent * add use_cache to encoder * add past_key_values to tapas attention * apply suggestions from code review * make coppies consistent * add attn mask in tests * remove copied from longformer * apply suggestions from code review * fix bart test * nit * simplify model outputs * fix doc * fix output ordering		2020-12-23 23:01:32 +05:30
..
_static	v4.1.1 docs	2020-12-17 11:28:38 -05:00
imgs	Guide to fixed-length model perplexity evaluation (#5449 )	2020-07-07 16:04:15 -06:00
internal	Fix T5 and BART for TF (#9063 )	2020-12-14 18:47:00 +01:00
main_classes	Add caching mechanism to BERT, RoBERTa (#9183 )	2020-12-23 23:01:32 +05:30
model_doc	Fix script that check objects are documented (#9259 )	2020-12-22 11:12:58 -05:00
benchmarks.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
bertology.rst	Fixed a broken link in documentation (#9101 )	2020-12-14 09:12:27 -05:00
conf.py	Release: v4.1.1	2020-12-17 11:25:49 -05:00
contributing.md	Update installation page and add contributing to the doc (#5084 )	2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
custom_datasets.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
examples.md	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
favicon.ico	Adding usage examples for common tasks (#2850 )	2020-02-25 13:48:24 -05:00
glossary.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
index.rst	Put all models in the constants (#9170 )	2020-12-17 11:23:21 -05:00
installation.md	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
migration.md	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
model_sharing.rst	[model_cards] Migrate cards from this repo to model repos on huggingface.co (#9013 )	2020-12-11 18:24:42 -05:00
model_summary.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
multilingual.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
notebooks.md	Update notebooks (#3620 )	2020-04-06 14:32:39 -04:00
perplexity.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
philosophy.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
preprocessing.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
pretrained_models.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
quicktour.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
serialization.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
task_summary.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
testing.rst	Testing Experimental CI Features (#9070 )	2020-12-14 10:34:59 -05:00
tokenizer_summary.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
training.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00