transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-16 11:08:23 +06:00

History

Suraj Patil 860264379f GPT Neo (#10848 ) * lets begin * boom boom * fix out proj in attn * fix attention * fix local attention * add tokenizer * fix imports * autotokenizer * fix checkpoint name * cleanup * more clean-up * more cleanup * output attentions * fix attn mask creation * fix imports * config doc * add tests * add slow tests * quality * add conversion script * copyright * typo * another bites the dust * fix attention tests * doc * add embed init in convert function * fix copies * remove tokenizer * enable caching * address review comments * improve config and create attn layer list internally * more consistent naming * init hf config from mesh-tf config json file * remove neo tokenizer from doc * handle attention_mask in local attn layer * attn_layers => attention_layers * add tokenizer_class in config * fix docstring * raise if len of attention_layers is not same as num_layers * remove tokenizer_class from config * more consistent naming * fix doc * fix checkpoint names * fp16 compat * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>		2021-03-30 09:42:30 -04:00
..
_static	Document v4.4.2	2021-03-18 15:19:25 -04:00
imgs	[Templates] Add template "call-for-model" markdown and "call-for-big-bird" markdown (#9921 )	2021-02-05 15:47:54 +03:00
internal	Instantiate model only once in pipeline (#10888 )	2021-03-29 10:39:14 -04:00
main_classes	Add ImageFeatureExtractionMixin (#10905 )	2021-03-26 11:23:56 -04:00
model_doc	GPT Neo (#10848 )	2021-03-30 09:42:30 -04:00
add_new_model.rst	Add new model docs (#9667 )	2021-02-01 17:55:10 +03:00
benchmarks.rst	Make doc styler detect lists on rst (#9488 )	2021-01-11 08:53:41 -05:00
bertology.rst	Fix documentation links always pointing to master. (#9217 )	2021-01-05 06:18:48 -05:00
community.md	Add notebook on fine-tuning Bart (#10883 )	2021-03-24 11:03:37 -04:00
conf.py	Development on v4.5.0dev0	2021-03-16 11:41:15 -04:00
contributing.md	Update installation page and add contributing to the doc (#5084 )	2020-06-17 14:01:10 -04:00
converting_tensorflow_models.rst	Fix broken links in the converting tf ckpt document (#9791 )	2021-01-26 03:37:57 -05:00
custom_datasets.rst	Rename NLP library to Datasets library (#10920 )	2021-03-26 08:07:59 -04:00
examples.md	per_device instead of per_gpu/error thrown when argument unknown (#4618 )	2020-05-27 11:36:55 -04:00
favicon.ico	Adding usage examples for common tasks (#2850 )	2020-02-25 13:48:24 -05:00
glossary.rst	Adds terms to Glossary (#10443 )	2021-02-28 08:27:54 -05:00
index.rst	GPT Neo (#10848 )	2021-03-30 09:42:30 -04:00
installation.md	split seq2seq script into summarization & translation (#10611 )	2021-03-15 09:11:42 -04:00
migration.md	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
model_sharing.rst	[doc] nested markup is invalid in rst (#9898 )	2021-01-30 09:59:19 -05:00
model_summary.rst	ConvBERT Model (#9717 )	2021-01-27 03:20:09 -05:00
multilingual.rst	Fix documentation links always pointing to master. (#9217 )	2021-01-05 06:18:48 -05:00
notebooks.md	Update notebooks (#3620 )	2020-04-06 14:32:39 -04:00
perplexity.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
philosophy.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
preprocessing.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
pretrained_models.rst	GPT Neo (#10848 )	2021-03-30 09:42:30 -04:00
quicktour.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
sagemaker.md	make local setup more clearer and added missing links (#10899 )	2021-03-25 09:01:31 -04:00
serialization.rst	Copyright (#8970 )	2020-12-07 18:36:34 -05:00
task_summary.rst	split seq2seq script into summarization & translation (#10611 )	2021-03-15 09:11:42 -04:00
testing.rst	[doc] [testing] extend the pytest -k section with more examples (#10761 )	2021-03-17 09:23:38 -04:00
tokenizer_summary.rst	Minor documentation revisions from copyediting (#9266 )	2020-12-23 10:15:49 -05:00
training.rst	[trainer] deepspeed integration (#9211 )	2021-01-12 19:05:18 -08:00