transformers/docs/source/en/model_doc/open-llama.md
Sylvain Gugger eb849f6604
Migrate doc files to Markdown. (#24376)
* Rename index.mdx to index.md

* With saved modifs

* Address review comment

* Treat all files

* .mdx -> .md

* Remove special char

* Update utils/tests_fetcher.py

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

---------

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
2023-06-20 18:07:47 -04:00

1.7 KiB

Open-Llama

Overview

The Open-Llama model was proposed in Open-Llama project by community developer s-JoL.

The model is mainly based on LLaMA with some modifications, incorporating memory-efficient attention from Xformers, stable embedding from Bloom, and shared input-output embedding from PaLM. And the model is pre-trained on both Chinese and English, which gives it better performance on Chinese language tasks.

This model was contributed by s-JoL. The original code can be found Open-Llama. Checkpoint and usage can be found at s-JoL/Open-Llama-V1.

OpenLlamaConfig

autodoc OpenLlamaConfig

OpenLlamaModel

autodoc OpenLlamaModel - forward

OpenLlamaForCausalLM

autodoc OpenLlamaForCausalLM - forward

OpenLlamaForSequenceClassification

autodoc OpenLlamaForSequenceClassification - forward