transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 18:48:24 +06:00

History

Jinan Zhou a91020aed0 Add TimesFM Time Series Forecasting Model (#34082 ) * initial documentation * rename mask to attention_mask * smaller tests * fixup * fix copies * move to time series section * sort docs * isort fix * batch_size is not a configuration * rename to TimesFMModelForPrediction * initial script * add check_outputs * remove dropout_rate * works with torch.Tensor inputs * rename script * fix docstrings * fix freq when window_size is given * add loss * fix _quantile_loss * formatting * fix isort * add weight init * add support for sdpa and flash_attention_2 * fixes for flash_attention * formatting * remove flash_attention * fix tests * fix file name * fix quantile loss * added initial TimesFMModelIntegrationTests * fix formatting * fix import order * fix _quantile_loss * add doc for SDPA * use timesfm 2.0 * bug fix in timesfm decode function. * compare mean forecasts * refactor type hints, use CamelCase * consolidate decode func * more readable code for weight conversion * fix-copies * simpler init * renaem TimesFmMLP * use T5LayerNorm * fix tests * use initializer_range * TimesFmModel instead of TimesFmDecoder * TimesFmPositionalEmbedding takes config for its init * 2.0-500m-pytorch default configs * use TimesFmModel * fix formatting * ignore TimesFmModel for testing * fix docstring * override generate as its not needed * add doc strings * fix logging * add docstrings to output data classes * initial copy from t5 * added config and attention layers * add TimesFMPositionalEmbedding * calcuate scale_factor once * add more configs and TimesFMResidualBlock * fix input_dims * standardize code format with black * remove unneeded modules * TimesFM Model * order of imports * copy from Google official implementation * remove covariate forecasting * Adapting TimesFM to HF format * restructing in progress * adapted to HF convention * timesfm test * the model runs * fixing unit tests * fixing unit tests in progress * add post_init * do not change TimesFMOutput * fixing unit tests * all unit tests passed * remove timesfm_layers * add intermediate_size and initialize with config * initial documentation * rename mask to attention_mask * smaller tests * fixup * fix copies * move to time series section * sort docs * isort fix * batch_size is not a configuration * rename to TimesFMModelForPrediction * initial script * add check_outputs * remove dropout_rate * works with torch.Tensor inputs * rename script * fix docstrings * fix freq when window_size is given * add loss * fix _quantile_loss * formatting * fix isort * add weight init * add support for sdpa and flash_attention_2 * fixes for flash_attention * formatting * remove flash_attention * fix tests * fix file name * fix quantile loss * added initial TimesFMModelIntegrationTests * fix formatting * fix import order * fix _quantile_loss * add doc for SDPA * use timesfm 2.0 * bug fix in timesfm decode function. * compare mean forecasts * refactor type hints, use CamelCase * consolidate decode func * more readable code for weight conversion * fix-copies * simpler init * renaem TimesFmMLP * use T5LayerNorm * fix tests * use initializer_range * TimesFmModel instead of TimesFmDecoder * TimesFmPositionalEmbedding takes config for its init * 2.0-500m-pytorch default configs * use TimesFmModel * fix formatting * ignore TimesFmModel for testing * fix docstring * override generate as its not needed * add doc strings * fix logging * add docstrings to output data classes * add _CHECKPOINT_FOR_DOC * fix comments * Revert "fix comments" This reverts commit `8deeb3e191`. * add _prepare_4d_attention_mask * we do not have generative model classes * use Cache * return past_key_values * modules initialized with config only * update year * Update docs/source/en/model_doc/timesfm.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * add layer_idx to cache * modular timesfm * fix test * unwrap sequential class * fix toctree * remove TimesFmOnnxConfig * fix modular * remove TimesFmStackedDecoder * split qkv layer into individual layers * rename projection layers * use ALL_ATTENTION_FUNCTIONS * is_causal is True * rename config * does not support flash_attn_2 * formatting * fix typo in docsstring * rename inputs * add time series mapping * Update src/transformers/models/olmo2/modeling_olmo2.py * Update src/transformers/models/moonshine/modeling_moonshine.py * use updated arguments * fix class name * add MODEL_FOR_TIME_SERIES_PREDICTION_MAPPING * isort * consolidate _preprocess into forward * fix a typo * fix a typo * fix toc * fix modular * remove aaserts * use self.config._attn_implementation * move to _postprocess_output * remove timesfm_get_large_negative_number * use view unstead of multiple unsqueeze * make helpers static methods of the Model * use to_tuple * use to_tuple if not return_dict * remove unused intitialization block as its incorporated in nn.Linear * remove unused num_key_value_groups * use the same convention as the masking method * update modular * do not use unsqueeze * use view instead of unsqueeze * use buffer for inv_timescales * formatting * modular conversion * remove unneeded intialization * add missing docstrings * remove cache * use simple_eager_attention_forward * support tp_plan * support for flex and flash attention masks * Revert "support for flex and flash attention masks" This reverts commit `def36c4fcf`. * fix device * fix tests on gpu * remove unsued large model test * removed unneeded comments * add example usage * fix style * add import * Update docs/source/en/model_doc/timesfm.md Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> * inherit from LlamaRMSNorm * use can_return_tuple decorator * remvoe return_dict * fix year * Update docs/source/en/model_doc/timesfm.md Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> * pretrained does not inherit from GenerationMixin * use model for integration test --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: Rajat Sen <rsen91@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com> Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>		2025-04-16 15:00:53 +02:00
..
albert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
align.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
altclip.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
aria.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
audio-spectrogram-transformer.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
auto.md	Add auto model for image-text-to-text (#32472 )	2024-10-08 14:26:43 +02:00
autoformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
aya_vision.md	Add aya (#36521 )	2025-03-04 12:24:33 +01:00
bamba.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bark.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bart.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
barthez.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bartpho.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
beit.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bert-generation.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bert-japanese.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bert.md	[docs] Model docs (#36469 )	2025-03-21 15:35:22 -07:00
bertweet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
big_bird.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bigbird_pegasus.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
biogpt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bit.md	Add ImageProcessorFast to BiT processor (#37180 )	2025-04-14 17:07:48 +02:00
blenderbot-small.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
blenderbot.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
blip-2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
blip.md	Simplify soft dependencies and update the dummy-creation process (#36827 )	2025-04-11 11:08:36 +02:00
bloom.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bort.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bridgetower.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
bros.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
byt5.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
camembert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
canine.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chameleon.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
chinese_clip.md	Add Fast Chinese-CLIP Processor (#37012 )	2025-04-15 18:31:20 +02:00
clap.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
clip.md	Updated the model card for CLIP (#37040 )	2025-04-02 14:57:38 -07:00
clipseg.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
clvp.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
code_llama.md	chore: Update model doc for code_llama (#37115 )	2025-04-03 10:09:41 -07:00
codegen.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cohere.md	Update model card for Cohere (#37056 )	2025-04-03 09:51:40 -07:00
cohere2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
colpali.md	Refactor ColPali model documentation (#37309 )	2025-04-15 13:52:11 -07:00
conditional_detr.md	Add Fast Conditional-DETR Processor (#37071 )	2025-04-15 18:33:34 +02:00
convbert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
convnext.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
convnextv2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cpm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cpmant.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
ctrl.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
cvt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dab-detr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dac.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
data2vec.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dbrx.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deberta-v2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deberta.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
decision_transformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deepseek_v3.md	[WIP] add deepseek-v3 (#35926 )	2025-03-28 15:56:59 +01:00
deformable_detr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
deit.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
deplot.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
depth_anything_v2.md	Add post_process_depth_estimation to image processors and support ZoeDepth's inference intricacies (#32550 )	2024-10-22 15:50:54 +02:00
depth_anything.md	Update model card for Depth Anything (#37065 )	2025-04-04 11:36:05 -07:00
depth_pro.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
deta.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
detr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dialogpt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
diffllama.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dinat.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dinov2_with_registers.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
dinov2.md	Update model-card for DINOv2 (#37104 )	2025-04-07 10:11:08 -07:00
distilbert.md	Updated model card for distilbert (#37157 )	2025-04-04 15:22:46 -07:00
dit.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
donut.md	Add Fast Image Processor for Donut (#37081 )	2025-04-14 16:24:01 +02:00
dpr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
dpt.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
efficientformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
efficientnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
electra.md	Update model card for electra (#37063 )	2025-04-03 10:45:35 -07:00
emu3.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
encodec.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
encoder-decoder.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
ernie_m.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
ernie.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
esm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
falcon_mamba.md	Update falcon mamba card (#37253 )	2025-04-07 10:12:44 -07:00
falcon.md	Update falcon model card (#37184 )	2025-04-02 17:30:37 -07:00
falcon3.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fastspeech2_conformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
flan-t5.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
flan-ul2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
flaubert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
flava.md	Add Fast Image Processor for Flava (#37135 )	2025-04-14 15:05:31 +02:00
fnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
focalnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fsmt.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
funnel.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
fuyu.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gemma.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gemma2.md	Improvements in Gemma2 model card (#37076 )	2025-04-07 10:51:26 -07:00
gemma3.md	[docs] Attention mask image (#36970 )	2025-03-26 10:11:34 -07:00
git.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
glm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
glm4.md	Add glm4 (#37388 )	2025-04-09 14:02:04 +02:00
glpn.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
got_ocr2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gpt_bigcode.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gpt_neo.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gpt_neox_japanese.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gpt_neox.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gpt-sw3.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gpt2.md	Update Model card for GPT2 (#37101 )	2025-04-07 10:15:28 -07:00
gptj.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
gptsan-japanese.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
granite_speech.md	Add Granite Speech Support (#36801 )	2025-04-11 18:52:00 +02:00
granite.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
granitemoe.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
granitemoeshared.md	add shared experts for upcoming Granite 4.0 language models (#35894 )	2025-02-14 16:55:28 +01:00
granitevision.md	Update Granite Vision Model Path / Tests (#35998 )	2025-02-03 20:06:03 +01:00
graphormer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
grounding-dino.md	Add Fast Grounding-Dino Processor (#37108 )	2025-04-16 12:26:08 +02:00
groupvit.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
helium.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
herbert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
hiera.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
hubert.md	[MINOR:TYPO] Update hubert.md (#36733 )	2025-03-17 09:07:51 -07:00
ibert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
idefics.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
idefics2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
idefics3.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
ijepa.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
imagegpt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
informer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
instructblip.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
instructblipvideo.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
jamba.md	Update Model Card for Jamba (#37152 )	2025-04-07 11:02:59 -07:00
jetmoe.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
jukebox.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
kosmos-2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
layoutlm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
layoutlmv2.md	Add Fast Image Processor for LayoutLMv2 (#37203 )	2025-04-14 15:06:41 +02:00
layoutlmv3.md	Add Fast Image Processor for LayoutLMv3 (#37201 )	2025-04-14 15:42:11 +02:00
layoutxlm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
led.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
levit.md	Add Fast LeViT Processor (#37154 )	2025-04-14 17:07:36 +02:00
lilt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
llama.md	[docs] Attention mask image (#36970 )	2025-03-26 10:11:34 -07:00
llama2.md	[docs] Attention mask image (#36970 )	2025-03-26 10:11:34 -07:00
llama3.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
llama4.md	Add llama4 (#37307 )	2025-04-05 22:02:22 +02:00
llava_next_video.md	fix typos in the docs directory (#36639 )	2025-03-11 09:41:41 -07:00
llava_next.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
llava_onevision.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
llava.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
longformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
longt5.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
luke.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
lxmert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
m2m_100.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
madlad-400.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mamba.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mamba2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
marian.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
markuplm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mask2former.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
maskformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
matcha.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mbart.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mctct.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mega.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
megatron_gpt2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
megatron-bert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mgp-str.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mimi.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mistral.md	updated model card for Mistral (#37156 )	2025-04-07 10:05:36 -07:00
mistral3.md	Fix Mistral3 tests (#36797 )	2025-03-18 13:08:12 -04:00
mixtral.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mlcd.md	Add MLCD model (#36182 )	2025-04-15 11:33:09 +01:00
mllama.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mluke.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mms.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mobilebert.md	mobilebert model card update (#37256 )	2025-04-04 14:28:35 -07:00
mobilenet_v1.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mobilenet_v2.md	Add Fast Mobilenet-V2 Processor (#37113 )	2025-04-14 17:08:47 +02:00
mobilevit.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mobilevitv2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
modernbert.md	Update Model Card for ModernBERT (#37052 )	2025-04-03 10:14:02 -07:00
moonshine.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
moshi.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mpnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mpt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mra.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mt5.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
musicgen_melody.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
musicgen.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
mvp.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
myt5.md	[WIP] Add Tokenizer for MyT5 Model (#31286 )	2024-10-06 10:33:16 +02:00
nat.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
nemotron.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
nezha.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
nllb-moe.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
nllb.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
nougat.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
nystromformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
olmo.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
olmo2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
olmoe.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
omdet-turbo.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
oneformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
open-llama.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
openai-gpt.md	Update OpenAI GPT model card (#37255 )	2025-04-04 15:25:16 -07:00
opt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
owlv2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
owlvit.md	Add Fast owlvit Processor (#37164 )	2025-04-14 17:58:09 +02:00
paligemma.md	[docs] Attention mask image (#36970 )	2025-03-26 10:11:34 -07:00
patchtsmixer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
patchtst.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pegasus_x.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pegasus.md	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
perceiver.md	Add Fast Image Processor for Perceiver (#37176 )	2025-04-14 13:49:13 +02:00
persimmon.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
phi.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
phi3.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
phi4_multimodal.md	[Phi4] add multimodal chat template (#36996 )	2025-04-03 09:52:09 +02:00
phimoe.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
phobert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pix2struct.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pixtral.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
plbart.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
poolformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pop2piano.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
prompt_depth_anything.md	Add Prompt Depth Anything Model (#35401 )	2025-03-20 16:12:44 +00:00
prophetnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pvt_v2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
pvt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
qdqbert.md	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
qwen2_5_omni.md	Add Qwen2.5-Omni (#36752 )	2025-04-14 12:36:41 +02:00
qwen2_5_vl.md	feat: updated model card for qwen_2.5_vl (#37099 )	2025-04-03 09:13:26 -07:00
qwen2_audio.md	[qwen2 audio] remove redundant code and update docs (#36282 )	2025-03-20 10:54:51 +00:00
qwen2_moe.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
qwen2_vl.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
qwen2.md	Updated model card for Qwen2 (#37192 )	2025-04-02 18:10:41 -07:00
qwen3_moe.md	Adding Qwen3 and Qwen3MoE (#36878 )	2025-03-31 09:50:49 +02:00
qwen3.md	Adding Qwen3 and Qwen3MoE (#36878 )	2025-03-31 09:50:49 +02:00
rag.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
realm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
recurrent_gemma.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
reformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
regnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
rembert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
resnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
retribert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
roberta-prelayernorm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
roberta.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
roc_bert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
roformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
rt_detr_v2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
rt_detr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
rwkv.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
sam.md	Create and Expose SamVisionModel as public for better accessibility (#36493 )	2025-03-31 11:45:07 +02:00
seamless_m4t_v2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
seamless_m4t.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
segformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
seggpt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
sew-d.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
sew.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
shieldgemma2.md	Adding links to ShieldGemma 2 technical report (#37247 )	2025-04-03 16:26:29 +01:00
siglip.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
siglip2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
smolvlm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
speech_to_text_2.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
speech_to_text.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
speech-encoder-decoder.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
speecht5.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
splinter.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
squeezebert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
stablelm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
starcoder2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
superglue.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
superpoint.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
swiftformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
swin.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
swin2sr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
swinv2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
switch_transformers.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
t5.md	Updated T5 model card with standardized format (#37261 )	2025-04-04 15:23:09 -07:00
t5v1.1.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
table-transformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tapas.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tapex.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
textnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
time_series_transformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
timesfm.md	Add TimesFM Time Series Forecasting Model (#34082 )	2025-04-16 15:00:53 +02:00
timesformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
timm_wrapper.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
trajectory_transformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
transfo-xl.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
trocr.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tvlt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
tvp.md	chore: Fix typos in docs and examples (#36524 )	2025-03-04 13:47:41 +00:00
udop.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
ul2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
umt5.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
unispeech-sat.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
unispeech.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
univnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
upernet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
van.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
video_llava.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
videomae.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
vilt.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vipllava.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vision-encoder-decoder.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vision-text-dual-encoder.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
visual_bert.md	Remove research projects (#36645 )	2025-03-11 13:47:38 +00:00
vit_hybrid.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vit_mae.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
vit_msn.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
vit.md	[docs] Model docs (#36469 )	2025-03-21 15:35:22 -07:00
vitdet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vitmatte.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vitpose.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
vits.md	Update VITS model card (#37335 )	2025-04-15 13:16:05 -07:00
vivit.md	Refactor Attention implementation for ViT-based models (#36545 )	2025-03-20 15:15:01 +00:00
wav2vec2_phoneme.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
wav2vec2-bert.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
wav2vec2-conformer.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
wav2vec2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
wavlm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
whisper.md	[docs] Model docs (#36469 )	2025-03-21 15:35:22 -07:00
xclip.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xglm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlm-prophetnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlm-roberta-xl.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlm-roberta.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlm-v.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlm.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlnet.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xls_r.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xlsr_wav2vec2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
xmod.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
yolos.md	Add Fast Yolos Processor (#37292 )	2025-04-15 14:23:08 +02:00
yoso.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
zamba.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
zamba2.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00
zoedepth.md	[docs] Redesign (#31757 )	2025-03-03 10:33:46 -08:00