transformers/docs/source/en/model_doc
Yaswanth Gali a2ef3cf537
Add Janus model (#36053)
* Iterative generation using input embeds

* Add Janus model

* discard changes

* Janus imports

* Refactor config and processor

* Added Vision tower of Janus

* Import Janus Image processor

* Vision tower fixes

* Refactor code

* Added VQ Model

* Complete model integration

* temp conversion script

* processor refactor

* Adding files to facilitate pulling

* Fixes after debugging

* Skip test for these models

* Add Janus Model

* discard changes

* Janus imports

* Refactor config and processor

* Added Vision tower of Janus

* Import Janus Image processor

* Vision tower fixes

* Refactor code

* Added VQ Model

* Complete model integration

* temp conversion script

* processor refactor

* Adding files to facilitate pulling

* Fixes after debugging

* Refactor to Text config

*  Added generate function

* Saving intermediate convert file. Still need to read configs from the hub and convert them to our format.

* Adding version that reads from the JSON files. Still have to tweak some parameters manually.

* relative imports

* Initial tests

* Refactor image processor

* Seemingly working version of the conversion script, will need to test further.

* Adding command message

* Fixing conflicting JanusTextConfig class

* Incorporating some of the discussed changes.

* Small fix to create dir.

* Removing system from JINJA template

* Adding draft processor tests

* style fixes

* Minor fixes and enhancement

* added generation config

* Initial tests

* Small modifications, tests are now passing.

* Small changes I noticed while reading code.

* more fixes

* Added JanusModel class

* Small merge adaptations

* Small merge adaptations

* Image processing tests passing

* More tests and fixes

* Convert script updated and refactored

* Tests and cleanup

* make style

* Postprocessing for image generation

* generate refactor

* fixes

* - Passing tests that write a part of the model to cpu (e.g. test_cpu_offload)
- Passing tests of dispatching SDPA
- Only gradient checkpointing tests are left.

* Removing temporary code

* Changes

* Writing change to modular

* Added JanusVisionModel. SDPA dispatch tests pass more robustly. Gradient checkpoint tests are next

* Gradient checkpoint tests passing

* Removing debug code

* Major generate refactor 😮‍💨

* Temp changes for testing

* Green quality CI

* 2 out of 4 integration tests passing

* breadcrumbs

* Usage Examples

* Regenerate modeling after merge

* dirty code

* JanusIntegrationTest are passing

* breadcrumbs

* happy CI

* fixes

* Changing template

* nits

* Text generation logits matching original codebase at 100% precision

* Remove ./tmp from git tracking

* Remove ./tmp from git tracking

* Checkpointing changes after reviewing

* Fixing code in docstrings

* CHanging comments and small bug in convert file

* Fixing bug in image_token_id for 7B version

* Removing line that was added by both of us

* Pushing changes after discussion. Only one left is to change the key mapping for convert file.

* Updating module file

* New convert file using dict. Tested that it is equivalent to the old one by:
- comparing keys in a script
- comparing checksums of the output files between version generated with the current convert script and those generated with the old script. This is a more reliable test.

* revert changes

* mistake

* consistency change for CI

* make style

* doc fixes

* more fixes

* experimenting with masking out pad token

* checkpoint

* Batched generation with multi-images working for 1B models. Will test 7B next.

* Device fix.

* Writing changes to modular, previous ones were written to modeling just for quick testing.

* Using passed processor attention mask (only in modeling for now)

* Matching performance done in the non-standard way

* Working version of batched generation. Will change how some args are passed to make it more similar to language case

* More compliant version of the code

* Removed duplicated `_prepare_4d_causal_attention_mask_with_cache_position`

* Updating modular file, making masked filling with paddings more efficient

* Slightly more efficient version

* Modifying JanusVisionModel to be a wrapper

* Fixing test to comply with new names

* Modular overhaul

* More refactoring

* - Changing JanusVisionModel back
- Changing forward pass
- Adding boi token to the comparison

* - Removing whole context model_ids
- Using inherited implementation of prepare_inputs_for_generation

* Moving the way boi token is passed to the model

* Fixing sdpa test

* Minor changes

* testing changes

* Minor fix

* - Adding postprocessing test
- checking values of generated image on integration test

* changes

* Removing pooled attention vision module, fixing convert script as a consequence

* More changes

* Fixes

* Draft after merge

* Bug fixes

* More bug fix

* Fixing docs

* Nits

* Refactor return dict

* Moving image post processing test to main processor post process

* Passing guidance_scale as kwarg

* make style

* 🔥 refactor

* make style

* Update and green CI

* Nits and tests update

* up

* Added MID block

* fix

* Dead code

* update testcase

* update

* model_id change

* init_weight changes

---------

Co-authored-by: hsilva664 <metallic-silver@hotmail.com>
2025-04-17 09:18:51 +02:00
..
albert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
align.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
altclip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
aria.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
audio-spectrogram-transformer.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
auto.md Add auto model for image-text-to-text (#32472) 2024-10-08 14:26:43 +02:00
autoformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
aya_vision.md Add aya (#36521) 2025-03-04 12:24:33 +01:00
bamba.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bark.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bart.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
barthez.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bartpho.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
beit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bert-generation.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bert-japanese.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bert.md [docs] Model docs (#36469) 2025-03-21 15:35:22 -07:00
bertweet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
big_bird.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bigbird_pegasus.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
biogpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bit.md Add ImageProcessorFast to BiT processor (#37180) 2025-04-14 17:07:48 +02:00
blenderbot-small.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
blenderbot.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
blip-2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
blip.md Simplify soft dependencies and update the dummy-creation process (#36827) 2025-04-11 11:08:36 +02:00
bloom.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bort.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
bridgetower.md Bridgetower fast image processor (#37373) 2025-04-16 22:39:18 +02:00
bros.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
byt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
camembert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
canine.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
chameleon.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
chinese_clip.md Add Fast Chinese-CLIP Processor (#37012) 2025-04-15 18:31:20 +02:00
clap.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
clip.md Updated the model card for CLIP (#37040) 2025-04-02 14:57:38 -07:00
clipseg.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
clvp.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
code_llama.md chore: Update model doc for code_llama (#37115) 2025-04-03 10:09:41 -07:00
codegen.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cohere.md Update model card for Cohere (#37056) 2025-04-03 09:51:40 -07:00
cohere2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
colpali.md Refactor ColPali model documentation (#37309) 2025-04-15 13:52:11 -07:00
conditional_detr.md Add Fast Conditional-DETR Processor (#37071) 2025-04-15 18:33:34 +02:00
convbert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
convnext.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
convnextv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cpm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cpmant.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ctrl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
cvt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dab-detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dac.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
data2vec.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dbrx.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deberta-v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deberta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
decision_transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deepseek_v3.md [WIP] add deepseek-v3 (#35926) 2025-03-28 15:56:59 +01:00
deformable_detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
deit.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
deplot.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
depth_anything_v2.md Add post_process_depth_estimation to image processors and support ZoeDepth's inference intricacies (#32550) 2024-10-22 15:50:54 +02:00
depth_anything.md Update model card for Depth Anything (#37065) 2025-04-04 11:36:05 -07:00
depth_pro.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
deta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dialogpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
diffllama.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dinat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dinov2_with_registers.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
dinov2.md Update model-card for DINOv2 (#37104) 2025-04-07 10:11:08 -07:00
distilbert.md Updated model card for distilbert (#37157) 2025-04-04 15:22:46 -07:00
dit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
donut.md Add Fast Image Processor for Donut (#37081) 2025-04-14 16:24:01 +02:00
dpr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
dpt.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
efficientformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
efficientnet.md Add EfficientNet Image PreProcessor (#37055) 2025-04-16 21:59:24 +02:00
electra.md Update model card for electra (#37063) 2025-04-03 10:45:35 -07:00
emu3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
encodec.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
encoder-decoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ernie_m.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ernie.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
esm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
falcon_mamba.md Update falcon mamba card (#37253) 2025-04-07 10:12:44 -07:00
falcon.md Update falcon model card (#37184) 2025-04-02 17:30:37 -07:00
falcon3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fastspeech2_conformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flan-t5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flan-ul2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flaubert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
flava.md Add Fast Image Processor for Flava (#37135) 2025-04-14 15:05:31 +02:00
fnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
focalnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fsmt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
funnel.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
fuyu.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gemma.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gemma2.md Improvements in Gemma2 model card (#37076) 2025-04-07 10:51:26 -07:00
gemma3.md [docs] Attention mask image (#36970) 2025-03-26 10:11:34 -07:00
git.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
glm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
glm4.md Add glm4 (#37388) 2025-04-09 14:02:04 +02:00
glpn.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
got_ocr2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_bigcode.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_neo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_neox_japanese.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt_neox.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt-sw3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gpt2.md Update Model card for GPT2 (#37101) 2025-04-07 10:15:28 -07:00
gptj.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
gptsan-japanese.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
granite_speech.md Add Granite Speech Support (#36801) 2025-04-11 18:52:00 +02:00
granite.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
granitemoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
granitemoeshared.md add shared experts for upcoming Granite 4.0 language models (#35894) 2025-02-14 16:55:28 +01:00
granitevision.md Update Granite Vision Model Path / Tests (#35998) 2025-02-03 20:06:03 +01:00
graphormer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
grounding-dino.md Add Fast Grounding-Dino Processor (#37108) 2025-04-16 12:26:08 +02:00
groupvit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
helium.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
herbert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hiera.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
hubert.md [MINOR:TYPO] Update hubert.md (#36733) 2025-03-17 09:07:51 -07:00
ibert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
idefics.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
idefics2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
idefics3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ijepa.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
imagegpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
informer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
instructblip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
instructblipvideo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
jamba.md Update Model Card for Jamba (#37152) 2025-04-07 11:02:59 -07:00
janus.md Add Janus model (#36053) 2025-04-17 09:18:51 +02:00
jetmoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
jukebox.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
kosmos-2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
layoutlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
layoutlmv2.md Add Fast Image Processor for LayoutLMv2 (#37203) 2025-04-14 15:06:41 +02:00
layoutlmv3.md Add Fast Image Processor for LayoutLMv3 (#37201) 2025-04-14 15:42:11 +02:00
layoutxlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
led.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
levit.md Add Fast LeViT Processor (#37154) 2025-04-14 17:07:36 +02:00
lilt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llama.md [docs] Attention mask image (#36970) 2025-03-26 10:11:34 -07:00
llama2.md [docs] Attention mask image (#36970) 2025-03-26 10:11:34 -07:00
llama3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llama4.md Add llama4 (#37307) 2025-04-05 22:02:22 +02:00
llava_next_video.md fix typos in the docs directory (#36639) 2025-03-11 09:41:41 -07:00
llava_next.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llava_onevision.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
llava.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
longformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
longt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
luke.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
lxmert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
m2m_100.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
madlad-400.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mamba.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mamba2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
marian.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
markuplm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mask2former.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
maskformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
matcha.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mbart.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mctct.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mega.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
megatron_gpt2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
megatron-bert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mgp-str.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mimi.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mistral.md updated model card for Mistral (#37156) 2025-04-07 10:05:36 -07:00
mistral3.md Fix Mistral3 tests (#36797) 2025-03-18 13:08:12 -04:00
mixtral.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mlcd.md Add MLCD model (#36182) 2025-04-15 11:33:09 +01:00
mllama.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mluke.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mms.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mobilebert.md mobilebert model card update (#37256) 2025-04-04 14:28:35 -07:00
mobilenet_v1.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mobilenet_v2.md Add Fast Mobilenet-V2 Processor (#37113) 2025-04-14 17:08:47 +02:00
mobilevit.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mobilevitv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
modernbert.md Update Model Card for ModernBERT (#37052) 2025-04-03 10:14:02 -07:00
moonshine.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
moshi.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mpnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mra.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
musicgen_melody.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
musicgen.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
mvp.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
myt5.md [WIP] Add Tokenizer for MyT5 Model (#31286) 2024-10-06 10:33:16 +02:00
nat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nemotron.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nezha.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nllb-moe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nllb.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nougat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
nystromformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
olmo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
olmo2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
olmoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
omdet-turbo.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
oneformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
open-llama.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
openai-gpt.md Update OpenAI GPT model card (#37255) 2025-04-04 15:25:16 -07:00
opt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
owlv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
owlvit.md Add Fast owlvit Processor (#37164) 2025-04-14 17:58:09 +02:00
paligemma.md [docs] Attention mask image (#36970) 2025-03-26 10:11:34 -07:00
patchtsmixer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
patchtst.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pegasus_x.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pegasus.md Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
perceiver.md Add Fast Image Processor for Perceiver (#37176) 2025-04-14 13:49:13 +02:00
persimmon.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phi.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phi3.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phi4_multimodal.md [Phi4] add multimodal chat template (#36996) 2025-04-03 09:52:09 +02:00
phimoe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
phobert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pix2struct.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pixtral.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
plbart.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
poolformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pop2piano.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
prompt_depth_anything.md Add Prompt Depth Anything Model (#35401) 2025-03-20 16:12:44 +00:00
prophetnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pvt_v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
pvt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
qdqbert.md Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
qwen2_5_omni.md Add Qwen2.5-Omni (#36752) 2025-04-14 12:36:41 +02:00
qwen2_5_vl.md feat: updated model card for qwen_2.5_vl (#37099) 2025-04-03 09:13:26 -07:00
qwen2_audio.md [qwen2 audio] remove redundant code and update docs (#36282) 2025-03-20 10:54:51 +00:00
qwen2_moe.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
qwen2_vl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
qwen2.md Updated model card for Qwen2 (#37192) 2025-04-02 18:10:41 -07:00
qwen3_moe.md Adding Qwen3 and Qwen3MoE (#36878) 2025-03-31 09:50:49 +02:00
qwen3.md Adding Qwen3 and Qwen3MoE (#36878) 2025-03-31 09:50:49 +02:00
rag.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
realm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
recurrent_gemma.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
reformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
regnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rembert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
resnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
retribert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roberta-prelayernorm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roberta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roc_bert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
roformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rt_detr_v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rt_detr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
rwkv.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
sam.md Create and Expose SamVisionModel as public for better accessibility (#36493) 2025-03-31 11:45:07 +02:00
seamless_m4t_v2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
seamless_m4t.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
segformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
seggpt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
sew-d.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
sew.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
shieldgemma2.md Adding links to ShieldGemma 2 technical report (#37247) 2025-04-03 16:26:29 +01:00
siglip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
siglip2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
smolvlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
speech_to_text_2.md Deprecate low use models (#30781) 2024-05-28 18:07:07 +01:00
speech_to_text.md chore: Fix typos in docs and examples (#36524) 2025-03-04 13:47:41 +00:00
speech-encoder-decoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
speecht5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
splinter.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
squeezebert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
stablelm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
starcoder2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
superglue.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
superpoint.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swiftformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swin.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swin2sr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
swinv2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
switch_transformers.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
t5.md Updated T5 model card with standardized format (#37261) 2025-04-04 15:23:09 -07:00
t5v1.1.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
table-transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tapas.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tapex.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
textnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
time_series_transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
timesfm.md Add TimesFM Time Series Forecasting Model (#34082) 2025-04-16 15:00:53 +02:00
timesformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
timm_wrapper.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
trajectory_transformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
transfo-xl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
trocr.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tvlt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
tvp.md chore: Fix typos in docs and examples (#36524) 2025-03-04 13:47:41 +00:00
udop.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
ul2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
umt5.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
unispeech-sat.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
unispeech.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
univnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
upernet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
van.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
video_llava.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
videomae.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
vilt.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vipllava.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vision-encoder-decoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vision-text-dual-encoder.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
visual_bert.md Remove research projects (#36645) 2025-03-11 13:47:38 +00:00
vit_hybrid.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vit_mae.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
vit_msn.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
vit.md [docs] Model docs (#36469) 2025-03-21 15:35:22 -07:00
vitdet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vitmatte.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vitpose.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
vits.md Update VITS model card (#37335) 2025-04-15 13:16:05 -07:00
vivit.md Refactor Attention implementation for ViT-based models (#36545) 2025-03-20 15:15:01 +00:00
wav2vec2_phoneme.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wav2vec2-bert.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wav2vec2-conformer.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wav2vec2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
wavlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
whisper.md [docs] Model docs (#36469) 2025-03-21 15:35:22 -07:00
xclip.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xglm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-prophetnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-roberta-xl.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-roberta.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm-v.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlm.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlnet.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xls_r.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xlsr_wav2vec2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
xmod.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
yolos.md Add Fast Yolos Processor (#37292) 2025-04-15 14:23:08 +02:00
yoso.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
zamba.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
zamba2.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00
zoedepth.md [docs] Redesign (#31757) 2025-03-03 10:33:46 -08:00