transformers/docs/source/en/model_doc
eustlb 798f948e88
Add CSM model (#36719)
* draft structure

* depth decoder with forward pre hook

* full model forward draft

* draft update

* depth decoder update

* ConversationalSpeechModelForCausalLM udpates

* add generate

* max length criteria small fix

* udpate

* updates

* generation update

* update in loss compute

* conversion script

* update for correct input embeddings

* handle interleaved rope

* update

* update

* update

* support compile

* update training

* add doc

* update doc

* correct inits

* ConversationalSpeechModel -> Csm

* conf update

* name update

* tests CsmForCausalLMTest

* convert use cached_file

* conf + modeling updates

* generate utils handle third dim shape

* integration test

* modeling + conf updates

* common test handle more than 2 dims

* add nested audio list utils

* processing handle nested audio list

* csm processing draft

* mimi util

* init updates

* modular update

* convert modular

* processing update

* csm tests update

* generate tests handle third dim

* generate utils handle third dim

* propagate _get_initial_cache_position update

* tied_weight_keys update + convert correctly

* fix inputs_embeds

* revert audio nested list

* batch inference update + return audio

* audio_utils update

* processor update

* some more integration tests

* remove old test

* porcessing output labels

* improve

* fix

* update rope values with equivalent ones

* conversion update

* udpate tests

* handle depth decoder generation config

* remove default eos_token_id

* make style

* revert modeling_mimi

* add default generation_config

* remove sdpa since handled by default

* make

* fix conflict

* fix conflicts

* correct naming

* correct imports

* make

* causal -> conditional naming

* causal -> conditional naming

* auto update

* make

* make

* add doc

* test update

* fix weight init

* audio tokens offsets as buffer

* 4d mask in conditional class

* make

* doc update

* fix causal mask

* fix causal mask

* doc update

* doc update

* add processor doc

* update doc

* fix 4d causal mask

* update make_list_of_audio

* do not default to mutable

* remove duplicates

* remove useless reset_parameters

* use GradientCheckpointingLayer

* use can_return_tuple

* formatting

* prepend placeholder in _sample

* torch compile fix

* some more fixies

* convert modular

* fix

* default max_length in convert

* handle depth decoder generation config correctly

* clearer formulation

* handle output_loading_info

* handle softmax warning

* add doc

* propagate _get_initial_cache_position changes

* generation in its own module

* add processor tests

* fix compile witu cuda graphs

* fix compile with cuda graphs

* add csm.md

* include CSM loss

* doc nit

* doc nit

* doc nit

* Update docs/source/en/model_doc/csm.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* add save_audio to processor

* Update src/transformers/models/csm/modular_csm.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* doc update

* simplify audio_codes_mask computation

* doc update

* simplify loss computation

* fix static cache test

* fix

* remove comment

* simplify encoded length computation

* use hf-internal-testing

* doc update

* cast to float before numpy

* nit

* mem efficient codebook head

* nit

* cat input values with cutoffs

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2025-05-07 10:20:13 -04:00
..
albert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
align.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
altclip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
aria.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
audio-spectrogram-transformer.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
auto.md Add auto model for image-text-to-text (#32472) 2024-10-08 14:26:43 +02:00
autoformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
aya_vision.md Add aya (#36521) 2025-03-04 12:24:33 +01:00
bamba.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bark.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bart.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
barthez.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bartpho.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
beit.md [Fast Processor] BEiT (#37005) 2025-05-06 17:40:28 -04:00
bert-generation.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bert-japanese.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bert.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
bertweet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
big_bird.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bigbird_pegasus.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
biogpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bit.md Add ImageProcessorFast to BiT processor (#37180) 2025-04-14 17:07:48 +02:00
bitnet.md Add Bitnet model (#37742) 2025-04-28 15:08:46 +02:00
blenderbot-small.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
blenderbot.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
blip-2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
blip.md Simplify soft dependencies and update the dummy-creation process (#36827) 2025-04-11 11:08:36 +02:00
bloom.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bort.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bridgetower.md Bridgetower fast image processor (#37373) 2025-04-16 22:39:18 +02:00
bros.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
byt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
camembert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
canine.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
chameleon.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
chinese_clip.md Add Fast Chinese-CLIP Processor (#37012) 2025-04-15 18:31:20 +02:00
clap.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
clip.md Updated the model card for CLIP (#37040) 2025-04-02 14:57:38 -07:00
clipseg.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
clvp.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
code_llama.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
codegen.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cohere.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
cohere2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
colpali.md Refactor ColPali model documentation (#37309) 2025-04-15 13:52:11 -07:00
conditional_detr.md Add Fast Conditional-DETR Processor (#37071) 2025-04-15 18:33:34 +02:00
convbert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
convnext.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
convnextv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cpm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cpmant.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
csm.md Add CSM model (#36719) 2025-05-07 10:20:13 -04:00
ctrl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cvt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
d_fine.md Add D-FINE Model into Transformers (#36261) 2025-04-29 12:17:55 +01:00
dab-detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dac.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
data2vec.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dbrx.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deberta-v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deberta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
decision_transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deepseek_v3.md [WIP] add deepseek-v3 (#35926) 2025-03-28 15:56:59 +01:00
deformable_detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deit.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
deplot.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
depth_anything_v2.md Add post_process_depth_estimation to image processors and support ZoeDepth's inference intricacies (#32550) 2024-10-22 15:50:54 +02:00
depth_anything.md Update model card for Depth Anything (#37065) 2025-04-04 11:36:05 -07:00
depth_pro.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
deta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dialogpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
diffllama.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dinat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dinov2_with_registers.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
dinov2.md Add usage example for DINOv2 (#37398) 2025-05-01 08:54:22 -07:00
distilbert.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
dit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
donut.md Add Fast Image Processor for Donut (#37081) 2025-04-14 16:24:01 +02:00
dpr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dpt.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
efficientformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
efficientnet.md Add EfficientNet Image PreProcessor (#37055) 2025-04-16 21:59:24 +02:00
electra.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
emu3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
encodec.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
encoder-decoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ernie_m.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ernie.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
esm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
falcon_mamba.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
falcon.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
falcon3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fastspeech2_conformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flan-t5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flan-ul2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flaubert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flava.md Add Fast Image Processor for Flava (#37135) 2025-04-14 15:05:31 +02:00
fnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
focalnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fsmt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
funnel.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fuyu.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gemma.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
gemma2.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
gemma3.md fixed gemma3 collection path pointing to llama 2 collection. (#37899) 2025-04-30 12:50:54 -07:00
git.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
glm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
glm4.md Add glm4 (#37388) 2025-04-09 14:02:04 +02:00
glpn.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
got_ocr2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_bigcode.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_neo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_neox_japanese.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_neox.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt-sw3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt2.md Aligning modling code for GPT2 to work with vLLM (fallback) (#36934) 2025-05-02 09:55:16 +02:00
gptj.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gptsan-japanese.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
granite_speech.md Add Granite Speech Support (#36801) 2025-04-11 18:52:00 +02:00
granite.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
granitemoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
granitemoehybrid.md Add GraniteMoeHybrid support for 4.0 (#37658) 2025-05-06 06:47:43 +02:00
granitemoeshared.md add shared experts for upcoming Granite 4.0 language models (#35894) 2025-02-14 16:55:28 +01:00
granitevision.md Update Granite Vision Model Path / Tests (#35998) 2025-02-03 20:06:03 +01:00
graphormer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
grounding-dino.md Add Fast Grounding-Dino Processor (#37108) 2025-04-16 12:26:08 +02:00
groupvit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
helium.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
herbert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hgnet_v2.md Add D-FINE Model into Transformers (#36261) 2025-04-29 12:17:55 +01:00
hiera.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hubert.md [MINOR:TYPO] Update hubert.md (#36733) 2025-03-17 09:07:51 -07:00
ibert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
idefics.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
idefics2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
idefics3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ijepa.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
imagegpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
informer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
instructblip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
instructblipvideo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
internvl.md [internvl] fix chat template (#37656) 2025-04-23 16:56:36 +02:00
jamba.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
janus.md Add Janus model (#36053) 2025-04-17 09:18:51 +02:00
jetmoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
jukebox.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
kosmos-2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
layoutlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
layoutlmv2.md Add Fast Image Processor for LayoutLMv2 (#37203) 2025-04-14 15:06:41 +02:00
layoutlmv3.md Add Fast Image Processor for LayoutLMv3 (#37201) 2025-04-14 15:42:11 +02:00
layoutxlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
led.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
levit.md Add Fast LeViT Processor (#37154) 2025-04-14 17:07:36 +02:00
lilt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llama.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
llama2.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
llama3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llama4.md Add llama4 (#37307) 2025-04-05 22:02:22 +02:00
llava_next_video.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
llava_next.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llava_onevision.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llava.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
longformer.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
longt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
luke.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
lxmert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
m2m_100.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
madlad-400.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mamba.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mamba2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
marian.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
markuplm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mask2former.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
maskformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
matcha.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mbart.md Updated model card for mbart and mbart50 (#37619) 2025-04-22 12:26:47 -07:00
mctct.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mega.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
megatron_gpt2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
megatron-bert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mgp-str.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mimi.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mistral.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
mistral3.md Fix Mistral3 tests (#36797) 2025-03-18 13:08:12 -04:00
mixtral.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mlcd.md Add MLCD model (#36182) 2025-04-15 11:33:09 +01:00
mllama.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mluke.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mms.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mobilebert.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
mobilenet_v1.md Add Fast Image Processor for MobileNetV1 (#37111) 2025-04-23 15:55:41 -04:00
mobilenet_v2.md Add Fast Mobilenet-V2 Processor (#37113) 2025-04-14 17:08:47 +02:00
mobilevit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mobilevitv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
modernbert.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
moonshine.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
moshi.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mpnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mra.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
musicgen_melody.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
musicgen.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mvp.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
myt5.md [WIP] Add Tokenizer for MyT5 Model (#31286) 2024-10-06 10:33:16 +02:00
nat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nemotron.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nezha.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nllb-moe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nllb.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nougat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nystromformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
olmo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
olmo2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
olmoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
omdet-turbo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
oneformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
open-llama.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
openai-gpt.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
opt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
owlv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
owlvit.md Add Fast owlvit Processor (#37164) 2025-04-14 17:58:09 +02:00
paligemma.md [docs] Attention mask image (#36970) 2025-03-26 10:11:34 -07:00
patchtsmixer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
patchtst.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pegasus_x.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pegasus.md Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
perceiver.md Add Fast Image Processor for Perceiver (#37176) 2025-04-14 13:49:13 +02:00
persimmon.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phi.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
phi3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phi4_multimodal.md Update Phi4 converter (#37594) 2025-04-17 23:08:24 +02:00
phimoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phobert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pix2struct.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pixtral.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
plbart.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
poolformer.md Add Fast Image Processor for PoolFormer (#37182) 2025-04-23 15:55:33 -04:00
pop2piano.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
prompt_depth_anything.md Add Prompt Depth Anything Model (#35401) 2025-03-20 16:12:44 +00:00
prophetnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pvt_v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pvt.md Add Fast PVT Processor (#37204) 2025-04-23 15:55:20 -04:00
qdqbert.md Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
qwen2_5_omni.md [doc] fix the code examples in qwen doc (#37803) 2025-04-28 11:56:32 +01:00
qwen2_5_vl.md Fix qwen2-vl-docs. (#37879) 2025-04-30 13:32:21 +01:00
qwen2_audio.md [qwen2 audio] remove redundant code and update docs (#36282) 2025-03-20 10:54:51 +00:00
qwen2_moe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
qwen2_vl.md [qwen-vl] Standardize config (#37268) 2025-04-17 09:38:12 +02:00
qwen2.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
qwen3_moe.md Adding Qwen3 and Qwen3MoE (#36878) 2025-03-31 09:50:49 +02:00
qwen3.md Adding Qwen3 and Qwen3MoE (#36878) 2025-03-31 09:50:49 +02:00
rag.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
realm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
recurrent_gemma.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
reformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
regnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rembert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
resnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
retribert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roberta-prelayernorm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roberta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roc_bert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rt_detr_v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rt_detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rwkv.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
sam_hq.md Samhq model addition (#35147) 2025-04-28 19:07:09 +02:00
sam.md Create and Expose SamVisionModel as public for better accessibility (#36493) 2025-03-31 11:45:07 +02:00
seamless_m4t_v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
seamless_m4t.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
segformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
seggpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
sew-d.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
sew.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
shieldgemma2.md Adding links to ShieldGemma 2 technical report (#37247) 2025-04-03 16:26:29 +01:00
siglip.md chore: update model card for SigLIP (#37585) 2025-04-18 13:30:41 -07:00
siglip2.md chore: update SigLIP2 model card (#37624) 2025-04-25 12:46:17 -07:00
smolvlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
speech_to_text_2.md Deprecate low use models (#30781) 2024-05-28 18:07:07 +01:00
speech_to_text.md chore: Fix typos in docs and examples (#36524) 2025-03-04 13:47:41 +00:00
speech-encoder-decoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
speecht5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
splinter.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
squeezebert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
stablelm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
starcoder2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
superglue.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
superpoint.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swiftformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swin.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swin2sr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swinv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
switch_transformers.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
t5.md Transformers cli clean command (#37657) 2025-04-30 12:15:43 +01:00
t5v1.1.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
table-transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tapas.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tapex.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
textnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
time_series_transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
timesfm.md Add TimesFM Time Series Forecasting Model (#34082) 2025-04-16 15:00:53 +02:00
timesformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
timm_wrapper.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
trajectory_transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
transfo-xl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
trocr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tvlt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tvp.md chore: Fix typos in docs and examples (#36524) 2025-03-04 13:47:41 +00:00
udop.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ul2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
umt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
unispeech-sat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
unispeech.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
univnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
upernet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
van.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
video_llava.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
videomae.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
vilt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vipllava.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vision-encoder-decoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vision-text-dual-encoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
visual_bert.md Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
vit_hybrid.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vit_mae.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
vit_msn.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
vit.md [docs] Model docs (#36469) 2025-03-21 15:35:22 -07:00
vitdet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vitmatte.md Fast image processor for VitMatte added and bug in slow version fixed (#37616) 2025-04-28 14:51:50 -04:00
vitpose.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vits.md Update VITS model card (#37335) 2025-04-15 13:16:05 -07:00
vivit.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
wav2vec2_phoneme.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wav2vec2-bert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wav2vec2-conformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wav2vec2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wavlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
whisper.md [docs] Model docs (#36469) 2025-03-21 15:35:22 -07:00
xclip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xglm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-prophetnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-roberta-xl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-roberta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-v.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xls_r.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlsr_wav2vec2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xmod.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
yolos.md Add Fast Yolos Processor (#37292) 2025-04-15 14:23:08 +02:00
yoso.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
zamba.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
zamba2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
zoedepth.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00