transformers/docs/source/en/model_doc
Jason Phang 0041be5b3d
LLaMA Implementation (#21955)
* LLaMA

* sharding and docs

* tweak

* black

* inits

* ruff

* LLAMA_PRETRAINED_CONFIG_ARCHIVE_MAP

* init

* no checkpoint

* docs

* ruff

* type_vocab_size

* tokenizer fixes

* tokenizer fixes

* Update tokenization_llama.py

* Update tokenization_llama.py

* Update configuration_llama.py

* Update modeling_llama.py

* tokenizer add_bos by default

* licenses

* remove decoder

* norms and mlp

* rope overhaul

* tweaks

* black

* mention OPT implementation

* off-by-one naming

* typo

* fix

* tokenization fix and slicing bug

* padding config

* cleanup

* black

* update tests

* undo typo

* fix vocab caching logic

* ruff

* docbuilder

* attn fix from BlackSamorez

* initial feedback

* typo

* docs

* llama case

* llama case

* load checkpoint docs

* comment about tokenizer

* tokenizer defaults

* clear past_key_values if use_cache=False

* last tweaks

* last tweaks

* last tweaks

* last tweaks

---------

Co-authored-by: Stella Biderman <stellabiderman@gmail.com>
2023-03-16 09:00:53 -04:00
..
albert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
align.mdx Fix typo in Align docs (#22199) 2023-03-16 13:41:48 +03:00
altclip.mdx Add AltCLIP (#20446) 2023-01-04 09:18:57 +01:00
audio-spectrogram-transformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
auto.mdx Add AutoModelForZeroShotImageClassification (#22087) 2023-03-13 12:46:14 +03:00
bart.mdx Fix en documentation typos (#21799) 2023-02-27 08:36:36 +01:00
barthez.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
bartpho.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
beit.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
bert-generation.mdx Result of new doc style with fixes (#17015) 2022-04-29 17:42:15 -04:00
bert-japanese.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
bert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
bertweet.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
big_bird.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
bigbird_pegasus.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
biogpt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
bit.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
blenderbot-small.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
blenderbot.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
blip-2.mdx [Blip2] Add Blip2Model (#21817) 2023-02-28 15:42:55 +01:00
blip.mdx blip support for training (#21021) 2023-01-18 11:24:37 +01:00
bloom.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
bort.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
bridgetower.mdx [WIP] Add BridgeTowerForContrastiveLearning (#21964) 2023-03-08 09:00:54 -05:00
byt5.mdx [Doctests] Fix all T5 doc tests (#16646) 2022-04-13 11:36:54 +02:00
camembert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
canine.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
chinese_clip.mdx Add Chinese-CLIP implementation (#20368) 2022-11-30 19:22:23 +01:00
clap.mdx [CLAP] Add CLAP to the library (#21370) 2023-02-16 20:59:27 +01:00
clip.mdx Add batch of resources (#20647) 2023-01-17 17:18:56 +01:00
clipseg.mdx [CLIPSeg] Add resources (#20118) 2022-11-09 18:31:22 +01:00
codegen.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
conditional_detr.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
convbert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
convnext.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
convnextv2.mdx Add ConvNeXT V2 (#21679) 2023-03-14 12:08:14 +03:00
cpm.mdx Allow all imports from transformers (#17050) 2022-05-02 12:47:39 -04:00
ctrl.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
cvt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
data2vec.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
deberta-v2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
deberta.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
decision_transformer.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
deformable_detr.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
deit.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
deta.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
detr.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
dialogpt.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
dinat.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
distilbert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
dit.mdx Add batch of resources (#20647) 2023-01-17 17:18:56 +01:00
donut.mdx Add Donut image processor (#20425) 2022-11-29 10:38:01 +00:00
dpr.mdx Refactor model summary (#21408) 2023-02-15 10:35:14 -08:00
dpt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
efficientformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
efficientnet.mdx Add EfficientNet (#21563) 2023-02-20 16:37:11 +03:00
electra.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
encoder-decoder.mdx [EncoderDecoder] Improve docs (#18271) 2022-07-27 10:08:59 +02:00
ernie_m.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
ernie.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
esm.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
flan-t5.mdx Update flan-t5 original model link (#20897) 2022-12-27 02:26:14 -05:00
flan-ul2.mdx Fix small typo in flan-ul2.mdx (#22068) 2023-03-10 07:44:45 -05:00
flaubert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
flava.mdx AutoImageProcessor (#20111) 2022-11-08 19:54:41 +00:00
fnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
fsmt.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
funnel.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
git.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
glpn.mdx Add batch of resources (#20647) 2023-01-17 17:18:56 +01:00
gpt_neo.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
gpt_neox_japanese.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
gpt_neox.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
gpt-sw3.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
gpt2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
gptj.mdx GPT-J specific half precision on CPU note (#22086) 2023-03-10 14:03:43 -05:00
gptsan-japanese.mdx add GPTSAN model (reopen) (#21291) 2023-02-20 11:25:27 +01:00
graphormer.mdx Graphormer model for Graph Classification (#20968) 2023-01-19 13:05:59 -05:00
groupvit.mdx Add batch of resources (#20647) 2023-01-17 17:18:56 +01:00
herbert.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
hubert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
ibert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
imagegpt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
informer.mdx [Time-Series] informer model (#21099) 2023-03-07 21:36:38 +01:00
jukebox.mdx Fix en documentation typos (#21799) 2023-02-27 08:36:36 +01:00
layoutlm.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
layoutlmv2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
layoutlmv3.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
layoutxlm.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
led.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
levit.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
lilt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
llama.mdx LLaMA Implementation (#21955) 2023-03-16 09:00:53 -04:00
longformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
longt5.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
luke.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
lxmert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
m2m_100.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
marian.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
markuplm.mdx Fix en documentation typos (#21799) 2023-02-27 08:36:36 +01:00
mask2former.mdx [Mask2Former] Add doc tests (#21232) 2023-01-25 12:34:43 +01:00
maskformer.mdx Add Mask2Former (#20792) 2023-01-16 20:37:07 +03:00
mbart.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mctct.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
megatron_gpt2.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
megatron-bert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mgp-str.mdx add new model of MGP-STR (#21418) 2023-03-13 10:11:31 +00:00
mluke.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
mobilebert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mobilenet_v1.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mobilenet_v2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mobilevit.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mpnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mt5.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
mvp.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
nat.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
nezha.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
nllb.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
nystromformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
oneformer.mdx [Mask2Former] Add doc tests (#21232) 2023-01-25 12:34:43 +01:00
openai-gpt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
opt.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
owlvit.mdx Improve OWL-ViT postprocessing (#20980) 2023-01-03 19:25:09 +03:00
pegasus_x.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
pegasus.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
perceiver.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
phobert.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
plbart.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
poolformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
prophetnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
qdqbert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
rag.mdx Refactor model summary (#21408) 2023-02-15 10:35:14 -08:00
realm.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
reformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
regnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
rembert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
resnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
retribert.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
roberta-prelayernorm.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
roberta.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
roc_bert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
roformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
segformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
sew-d.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
sew.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
speech_to_text_2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
speech_to_text.mdx Fix some doctests after PR 15775 (#20036) 2022-11-03 14:18:45 +01:00
speech-encoder-decoder.mdx Replace as_target context managers by direct calls (#18325) 2022-07-29 08:09:09 -04:00
speecht5.mdx [WIP] add SpeechT5 model (#18922) 2023-02-03 12:43:46 -05:00
splinter.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
squeezebert.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
swin.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
swin2sr.mdx Add Swin2SR (#19784) 2022-12-16 16:24:01 +01:00
swinv2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
switch_transformers.mdx Fix en documentation typos (#21799) 2023-02-27 08:36:36 +01:00
t5.mdx [Flan-UL2] Add-flan-ul2 (#21929) 2023-03-03 17:57:24 +01:00
t5v1.1.mdx docs: Resolve many typos in the English docs (#20088) 2022-11-07 09:19:04 -05:00
table-transformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
tapas.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
tapex.mdx Add TAPEX (#16473) 2022-04-08 10:57:51 +02:00
time_series_transformer.mdx [MINOR] Fix link in timeseries transformer docs (#21602) 2023-02-13 10:11:16 -05:00
timesformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
trajectory_transformer.mdx Add trajectory transformer (#17141) 2022-05-17 19:07:43 -04:00
transfo-xl.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
trocr.mdx Update doc examples feature extractor -> image processor (#20501) 2022-11-30 14:50:55 +00:00
tvlt.mdx Add TVLT (#20725) 2023-02-15 18:10:30 +00:00
ul2.mdx Add UL2 (just docs) (#17740) 2022-06-21 10:24:50 +02:00
unispeech-sat.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
unispeech.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
upernet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
van.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
videomae.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
vilt.mdx AutoImageProcessor (#20111) 2022-11-08 19:54:41 +00:00
vision-encoder-decoder.mdx Update doc examples feature extractor -> image processor (#20501) 2022-11-30 14:50:55 +00:00
vision-text-dual-encoder.mdx Add TFVisionTextDualEncoder (#21873) 2023-03-01 18:00:48 +00:00
visual_bert.mdx Update doc examples feature extractor -> image processor (#20501) 2022-11-30 14:50:55 +00:00
vit_hybrid.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
vit_mae.mdx Add batch of resources (#20647) 2023-01-17 17:18:56 +01:00
vit_msn.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
vit.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
wav2vec2_phoneme.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
wav2vec2-conformer.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
wav2vec2.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
wavlm.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
whisper.mdx [Whisper] Add model for audio classification (#21754) 2023-03-07 16:20:21 +01:00
xclip.mdx Add batch of resources (#20647) 2023-01-17 17:18:56 +01:00
xglm.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
xlm-prophetnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
xlm-roberta-xl.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
xlm-roberta.mdx Fix en documentation typos (#21799) 2023-02-27 08:36:36 +01:00
xlm-v.mdx Add XLM-V to Model Doc (#21498) 2023-02-07 16:43:19 -05:00
xlm.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
xlnet.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
xls_r.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
xlsr_wav2vec2.mdx Enable doc in Spanish (#16518) 2022-04-04 10:25:46 -04:00
xmod.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
yolos.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00
yoso.mdx Adding task guides to resources (#21704) 2023-02-21 10:35:11 -05:00