transformers/docs/source/en/model_doc
Alazar 94306352f4
Port IDEFICS to tensorflow (#26870)
* Initial commit

* Just a copy of modeling_idefics.py that will be ported to TF

* - Prepend TF to the name of all classes
- Convert pytorch ops to TF (not all operations are converted yet)

* Add TF imports

* Add autotranslated files

* Add TF classes to model_tf_auto.py

* Add the TF classes in model_doc

* include auto-translated code

* Adopted from auto-translated version

* Add a forgotten super().build

* Add test code for TF version.

* Fix indentation and load pytorch weights for now

* Some fixes. Many tests are still failing but some are passing now.

- I have added TODO's for some of the hacks I made to unblock me
  and I will address them soon
- I have the processing_idefics.py hacked in my view to support TF temporarily

* Add ALL_LAYERNORM_LAYERS to match pytorch

* Revert "Add ALL_LAYERNORM_LAYERS to match pytorch"

This reverts commit 7e0a35119b4d7a6284d04d8c543fba1b29e573c9 as it
is not needed in the tf implementation.

* Fix freeze_relevant_params()

* Some more fixes

* Fix test_attention_outputs

* Add tf stuff to processing_idefics.py

processing_idefics.py supports both pytorch and tf now.

test_processor_idefics.py for pytorch is passing, so i didn't break anything
but still some issues with tf. I also need to add tf tests in
test_processor_idefics.py.

* Pass return_tensors to image processing code and fix test

* Pass return_tensors to the image processor __init__

* Fix several test cases

- Make input to some of the forward pass of type `TFModelInputType`
- Decorate main layer forward pass with `@unpack_inputs`
- Decorate main layer with `@keras_serializable`
- Pass `inputs` to TFIdeficsModel

* Some more fixes forgotten in last commit

* Fix processing code and vision_tf.py

* Fix perceiver bug

* Import from

* Auto-add build() methods + style pass

* Fix build() errors due to `None` being passed as shape to some layers

* Change name in TFIdeficsForVisionText2Text to attribute in IdeficsForVisionText2Text

* Fix pytorch weights load for tf2

There were a lot of `name=` missing in weight initialization code.

* Attempt to fix CI

* Add back accidently removed line

* Remove torch-specific stuff from the TF test file

* make fix-copies, make style, remove autotranslated files

* Fixes to imports/docstrings

* Let's try the from future import in desperation

* Fix the core random_attention_mask fn to match the torch/flax behaviour

* Clean random_attention_mask up correctly

* Remove torch-only test

* Fix loss shape, couple of nits

* make style

* Don't test for OOB embeddings because IDEFICS uses those deliberately

* Fix loss computation to handle masking

* Fix test failures when flattening

* Fix some test failures

- Add cross attention gate which was missing and wasn't being passed arround
- Fix overwriting of image_attention_mask due to hack I had for dummy inputs

* Add a proper stateless scaled_dot_product_attention

* make style

* Adding missing attribute from the PyTorch version

* Small cleanups to decoupledlinearlayer in case that helps

* Pass epsilon to LayerNormalization

* Attemp to fix pytorch weight cross-loading for TFIdeficsEmbedding

* Fix a bug in TFIdeficsGatedCrossAttentionLayer

* Patching up build() methods

* Constant self.inv_freq

* Constant self.inv_freq

* First working version

The TF implementation works now, there was a bug in the TFIdeficsDecoupledLinear
where the weights were mis-intialized (in_features,out_features)
when it should be: (out_features, in_features)

I have tested this so far with tiny-random and idefics-9b-instruct
and gives correct output.

I also dumped the final outputs for both pytorch and TF
and they are identical.

* Fix some test failures

* remove print statement

* Fix return_tensors

* Fix CI test failure check_code_quality

* Attempt to fix CI failures by running `make fixup`

The hardcoded IDs in test_modeling_tf_idefics.py are for the integration
test and makes that file unreadable and should probably be moved to a seperate file.

* Attempt to fix tests_pr_documentation_tests

* Fix a test failure in test_image_processing_idefics.py

* Fix test test_pt_tf_model_equivalence

* Fix a few failures

* Tiny fix

* Some minor fixes

* Remove a duplicate test

* Override a few test failures for IDEFICS

- `test_keras_save_load` is passing now
- `test_compile_tf_model` is still failing

* Fix processing_idefics.py after rebase

* Guard import keras with is_tf_available

* fix check code quality

* fix check code quality

* Minor fixes

* Skip test_save_load temporarily

This test passed on my local box but fails on the CI, skipping
for now to see if there are other remaining failures on the CI.

* Run `ruff format tests src utils`

* Fix last failing test, `test_compile_tf_model`

* Add fixes for vision_tf.py

I forgot to add this file in last commit.

* Minor fixes

* Replace "<<<" with "<<" for doc tests

IDEFICS-9B is too big for doctest runner, so don't run it there

* Make code more readable

* Fix bug after code review

I added a layer_norm_eps to IdeficsConfig but I don't even need it
since the vision config has a layer_norm_eps.

* Fix after code review

Use original code tokenizer.convert_tokens_to_ids

* Keep PyTorch as the default return_tensors

* Fixes to modeling_tf after code review

* Fixes from code review

- Remove all references of `TF_IDEFICS_PRETRAINED_MODEL_ARCHIVE_LIST`
- Pass 1e-5 to LayerNormalization in perceiver

* Run ruff

* Undo a change

* Refactor processing code after Matt's suggestion

* Remove TODO's that aren't needed anymore

* For pytorch, Use original pytorch processing code from main

Since this PR is a TF port it shouldn't make any modifications
to pytorch IDEFICS code. This changes undo's the pytorch processing
modifications I made and uses original code from main.

* Update tests/models/idefics/test_modeling_idefics.py

* Update tests/models/idefics/test_modeling_tf_idefics.py

* Add missing imports for is_pt_tf_cross_test

* [DO NOT MERGE]: This is a commit for debugging and will be reverted

The cross test `test_pt_tf_model_equivalence` passes locally but
fails when running on the CI. This commit is to help debug that
and will be reverted.

* Revert "[DO NOT MERGE]: This is a commit for debugging and will be reverted"

This reverts commit 8f0d709ec5bd46685fb0b4259d914ffee794875b.

* [DO NOT MERGE]: This commit is for debugging a CI failure and will be reverted

* [DO NOT MERGE]: This commit is for debugging a CI failure and will be reverted

* Revert "[DO NOT MERGE]: This commit is for debugging a CI failure and will be reverted"

This reverts commit 998cc38b8c3d313bf5e5eb55a7f5b7b881897b89.

* Revert "[DO NOT MERGE]: This commit is for debugging a CI failure and will be reverted"

This reverts commit 1c695ac4219c4ae4d39b330b01744dc27deb7dd4.

* Don't skip test_save_load

IIRC test_save_load was also failing on the CI but not on my local
box, it might be easier to debug that on the CI first than the cross tests

* Debugging commit, will be reverted

* Revert "Debugging commit, will be reverted"

This reverts commit 8eafc8e41e20c4e95a3a90834f06a6e9f445e2d5.

* Override `test_save_load` and push model to save

Maybe this will help me repro this weird bug

* pass my repo_id

* add endpoint

* Pass a temp (write) token just for this CI

* Undo last few commits, still pushing to hub for model debugging

The issue seems to be with save_pretrained(),  when I looked at the model saved
from the CI test failure it is basically empty and has no weights.
`self.save_weights(..)` seems to be failing in save_pretrained but needs
more debugging

* Add logging to modeling tf utils, will be reverted just for debugging

* Debugging, will revert

* Revert "Debugging, will revert"

This reverts commit 9d0d3075fb7c82d8cde3a5c76bc8f3876c5c55d3.

* Revert "Add logging to modeling tf utils, will be reverted just for debugging"

This reverts commit 774b6b7b1c17b3ce5d7634ade768f2f686cee617.

* Remove `test_save_load`

The CI failures are gone after my latest rebase, no idea why
but I was still saving the model to my hub on HF and the tf_model.h5
file now has everything.

* Run make fix-copies

* Run ruff format tests src utils

* Debugging commit, will be reverted

* Run ruff, also trigger CI run

* Run ruff again

* Undo debugging commit

---------

Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
2024-05-13 15:59:46 +01:00
..
albert.md Updated albert.md doc for ALBERT model (#27223) 2023-11-16 11:44:36 -08:00
align.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
altclip.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
audio-spectrogram-transformer.md Translating en/model_doc docs to Japanese. (#27401) 2023-11-15 10:13:52 -08:00
auto.md SuperPointModel -> SuperPointForKeypointDetection (#29757) 2024-03-20 15:41:03 +00:00
autoformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bark.md F.scaled_dot_product_attention support (#26572) 2023-12-09 05:38:14 +09:00
bart.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
barthez.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bartpho.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
beit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bert-generation.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
bert-japanese.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bert.md [BERT] Add support for sdpa (#28802) 2024-04-26 16:23:44 +01:00
bertweet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
big_bird.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bigbird_pegasus.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
biogpt.md Translating en/model_doc folder docs to Japanese(from blip to clap) 🇯🇵 (#27673) 2023-12-06 10:38:21 -08:00
bit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
blenderbot-small.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
blenderbot.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
blip-2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
blip.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bloom.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bort.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bridgetower.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
bros.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
byt5.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
camembert.md small doc update for CamemBERT (#28644) 2024-01-29 15:46:32 +01:00
canine.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
chinese_clip.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
clap.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
clip.md Add SiglipForImageClassification and CLIPForImageClassification (#28952) 2024-02-14 08:41:31 +01:00
clipseg.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
clvp.md Add CLVP (#24745) 2023-11-10 13:49:10 +00:00
code_llama.md Update CodeLlama references (#30218) 2024-05-09 22:57:52 +02:00
codegen.md Add token type ids to CodeGenTokenizer (#29265) 2024-04-17 12:19:18 +02:00
cohere.md Cohere Model Release (#29622) 2024-03-15 14:29:11 +01:00
conditional_detr.md Add examples for detection models finetuning (#30422) 2024-05-08 11:42:07 +01:00
convbert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
convnext.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
convnextv2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
cpm.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
cpmant.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
ctrl.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
cvt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
data2vec.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
dbrx.md Fix link in dbrx.md (#30509) 2024-04-26 20:52:24 +01:00
deberta-v2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
deberta.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
decision_transformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
deformable_detr.md Add examples for detection models finetuning (#30422) 2024-05-08 11:42:07 +01:00
deit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
deplot.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
depth_anything.md [Docs] Add resources (#28705) 2024-02-19 15:22:29 +01:00
deta.md Add examples for detection models finetuning (#30422) 2024-05-08 11:42:07 +01:00
detr.md Add examples for detection models finetuning (#30422) 2024-05-08 11:42:07 +01:00
dialogpt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
dinat.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
dinov2.md [DPT, Dinov2] Add resources (#27655) 2023-11-23 17:44:08 +00:00
distilbert.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
dit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
donut.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
dpr.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
dpt.md [DPT, Dinov2] Add resources (#27655) 2023-11-23 17:44:08 +00:00
efficientformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
efficientnet.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
electra.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
encodec.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
encoder-decoder.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
ernie_m.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
ernie.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
esm.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
falcon.md Add proper Falcon docs and conversion script (#25954) 2023-09-04 17:18:34 +01:00
fastspeech2_conformer.md Super tiny fix 12 typos about "with with" (#29926) 2024-03-29 14:31:31 +00:00
flan-t5.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
flan-ul2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
flaubert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
flava.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
fnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
focalnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
fsmt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
funnel.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
fuyu.md Fix Fuyu doc typos (#29601) 2024-03-12 10:16:21 +00:00
gemma.md [ gemma] Adds support for Gemma 💎 (#29167) 2024-02-21 14:21:28 +01:00
git.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
glpn.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
gpt_bigcode.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
gpt_neo.md F.scaled_dot_product_attention support (#26572) 2023-12-09 05:38:14 +09:00
gpt_neox_japanese.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
gpt_neox.md F.scaled_dot_product_attention support (#26572) 2023-12-09 05:38:14 +09:00
gpt-sw3.md Fix paths to AI Sweden Models reference and model loading (#28423) 2024-01-15 09:09:22 +01:00
gpt2.md Adding Flash Attention 2 Support for GPT2 (#29226) 2024-03-28 09:31:24 +00:00
gptj.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
gptsan-japanese.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
graphormer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
grounding-dino.md [Grounding DINO] Add resources (#30232) 2024-04-19 21:03:07 +02:00
groupvit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
herbert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
hubert.md Add sdpa and fa2 the Wav2vec2 family. (#30121) 2024-04-22 18:30:38 +01:00
ibert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
idefics.md Port IDEFICS to tensorflow (#26870) 2024-05-13 15:59:46 +01:00
idefics2.md Add Idefics2 (#30253) 2024-04-15 17:03:03 +01:00
imagegpt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
informer.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
instructblip.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
jamba.md Add jamba (#29943) 2024-04-18 11:04:02 +02:00
jukebox.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
kosmos-2.md [KOSMOS-2] Update docs (#27157) 2023-10-30 21:42:19 +01:00
layoutlm.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
layoutlmv2.md [Docs] Add language identifiers to fenced code blocks (#28955) 2024-02-12 10:48:31 -08:00
layoutlmv3.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
layoutxlm.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
led.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
levit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
lilt.md [Docs] Add language identifiers to fenced code blocks (#28955) 2024-02-12 10:48:31 -08:00
llama.md Adds LlamaForQuestionAnswering class in modeling_llama.py along with AutoModel Support (#28777) 2024-02-06 03:41:42 +01:00
llama2.md Fix FA2 integration (#28142) 2023-12-20 14:25:07 +05:30
llama3.md Update llama3.md, fix typo (#30739) 2024-05-10 12:40:57 +01:00
llava_next.md Nits for model docs (#29795) 2024-04-22 10:41:03 +01:00
llava.md Fix Llava chat template examples (#30130) 2024-04-11 10:38:24 +02:00
longformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
longt5.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
luke.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
lxmert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
m2m_100.md Add Flash Attention 2 to M2M100 model (#30256) 2024-04-18 10:27:58 +02:00
madlad-400.md Add madlad-400 MT models (#27471) 2023-11-28 13:19:50 +00:00
mamba.md [Mamba doc] Post merge updates (#29472) 2024-03-11 09:46:24 +01:00
marian.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
markuplm.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
mask2former.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
maskformer.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
matcha.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mbart.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mctct.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mega.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
megatron_gpt2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
megatron-bert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mgp-str.md [Docs] Fix broken links and syntax issues (#28918) 2024-02-08 14:13:35 -08:00
mistral.md [Mistral, Mixtral] Improve docs (#29084) 2024-02-22 11:48:01 +01:00
mixtral.md [Mistral, Mixtral] Improve docs (#29084) 2024-02-22 11:48:01 +01:00
mluke.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mms.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
mobilebert.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
mobilenet_v1.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mobilenet_v2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mobilevit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mobilevitv2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mpnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mpt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mra.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
mt5.md Adding [T5/MT5/UMT5]ForTokenClassification (#28443) 2024-02-01 03:53:49 +01:00
musicgen_melody.md Add MusicGen Melody (#28819) 2024-03-18 13:06:12 +00:00
musicgen.md [Docs] Add language identifiers to fenced code blocks (#28955) 2024-02-12 10:48:31 -08:00
mvp.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
nat.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
nezha.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
nllb-moe.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
nllb.md Add Flash Attention 2 to M2M100 model (#30256) 2024-04-18 10:27:58 +02:00
nougat.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
nystromformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
olmo.md Add OLMo model family (#29890) 2024-04-17 17:59:07 +02:00
oneformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
open-llama.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
openai-gpt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
opt.md F.scaled_dot_product_attention support (#26572) 2023-12-09 05:38:14 +09:00
owlv2.md Update bounding box format everywhere (#27944) 2023-12-11 18:03:42 +00:00
owlvit.md Update bounding box format everywhere (#27944) 2023-12-11 18:03:42 +00:00
patchtsmixer.md [Docs] Add resources (#28705) 2024-02-19 15:22:29 +01:00
patchtst.md [Docs] Add resources (#28705) 2024-02-19 15:22:29 +01:00
pegasus_x.md [Docs] Fix broken links and syntax issues (#28918) 2024-02-08 14:13:35 -08:00
pegasus.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
perceiver.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
persimmon.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
phi.md Fix doctest more (for docs/source/en) (#30247) 2024-04-15 14:10:59 +02:00
phi3.md phi3 chat_template does not support system role (#30606) 2024-05-02 15:30:21 +02:00
phobert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
pix2struct.md 🌐 [i18n-ZH] Translate chat_templating.md into Chinese (#28790) 2024-02-26 08:42:24 -08:00
plbart.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
poolformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
pop2piano.md [Docs] Add language identifiers to fenced code blocks (#28955) 2024-02-12 10:48:31 -08:00
prophetnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
pvt_v2.md Add PvT-v2 Model (#26812) 2024-03-13 19:05:20 +00:00
pvt.md [Docs] Fix broken links and syntax issues (#28918) 2024-02-08 14:13:35 -08:00
qdqbert.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
qwen2_moe.md Update model card and link of blog post. (#29928) 2024-03-30 17:49:03 +01:00
qwen2.md [Doc] update model doc qwen2 (#29238) 2024-02-23 10:43:31 +01:00
rag.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
realm.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
recurrent_gemma.md [Docs] Update recurrent_gemma.md for some minor nits (#30238) 2024-04-15 18:30:59 +02:00
reformer.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
regnet.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
rembert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
resnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
retribert.md Deprecate models (#24787) 2023-07-13 11:46:54 -04:00
roberta-prelayernorm.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
roberta.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
roc_bert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
roformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
rwkv.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
sam.md [Grounding DINO] Add resources (#30232) 2024-04-19 21:03:07 +02:00
seamless_m4t_v2.md [Seamless] Fix links in docs (#27905) 2023-12-14 15:14:13 +00:00
seamless_m4t.md [Seamless] Fix links in docs (#27905) 2023-12-14 15:14:13 +00:00
segformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
seggpt.md [SegGPT] Fix seggpt image processor (#29550) 2024-04-26 19:40:12 +01:00
sew-d.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
sew.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
siglip.md [Docs] Add resources (#28705) 2024-02-19 15:22:29 +01:00
speech_to_text_2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
speech_to_text.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
speech-encoder-decoder.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
speecht5.md add generate method to SpeechT5ForTextToSpeech (#25233) 2023-08-03 14:12:07 +01:00
splinter.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
squeezebert.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
stablelm.md Fix doctest more (for docs/source/en) (#30247) 2024-04-15 14:10:59 +02:00
starcoder2.md Fix doctest more (for docs/source/en) (#30247) 2024-04-15 14:10:59 +02:00
superpoint.md SuperPointModel -> SuperPointForKeypointDetection (#29757) 2024-03-20 15:41:03 +00:00
swiftformer.md Add TF swiftformer (#23342) 2024-04-19 18:31:43 +01:00
swin.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
swin2sr.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
swinv2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
switch_transformers.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
t5.md Fix doctest more (for docs/source/en) (#30247) 2024-04-15 14:10:59 +02:00
t5v1.1.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
table-transformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
tapas.md [docs] fixed links with 404 (#27327) 2023-11-06 19:45:03 +00:00
tapex.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
time_series_transformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
timesformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
trajectory_transformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
transfo-xl.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
trocr.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
tvlt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
tvp.md Update TVP arxiv link (#27672) 2023-11-23 17:02:16 +00:00
udop.md [UDOP] Improve docs, add resources (#29571) 2024-04-10 16:02:50 +02:00
ul2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
umt5.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
unispeech-sat.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
unispeech.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
univnet.md Add UnivNet Vocoder Model for Tortoise TTS Diffusers Integration (#24799) 2023-11-22 17:21:36 +01:00
upernet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
van.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
videomae.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vilt.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vipllava.md Fix Vip-llava docs (#28085) 2023-12-15 20:16:47 +01:00
vision-encoder-decoder.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
vision-text-dual-encoder.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
visual_bert.md Update all references to canonical models (#29001) 2024-02-16 08:16:58 +01:00
vit_hybrid.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vit_mae.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vit_msn.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vitdet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vitmatte.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vits.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
vivit.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
wav2vec2_phoneme.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
wav2vec2-bert.md Add new meta w2v2-conformer BERT-like model (#28165) 2024-01-18 13:37:34 +00:00
wav2vec2-conformer.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
wav2vec2.md Add sdpa and fa2 the Wav2vec2 family. (#30121) 2024-04-22 18:30:38 +01:00
wavlm.md [Docs] Fix spelling and grammar mistakes (#28825) 2024-02-02 08:45:00 +01:00
whisper.md [Docs] Add resources (#28705) 2024-02-19 15:22:29 +01:00
xclip.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
xglm.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlm-prophetnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlm-roberta-xl.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlm-roberta.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlm-v.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlm.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlnet.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xls_r.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xlsr_wav2vec2.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
xmod.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00
yolos.md Add examples for detection models finetuning (#30422) 2024-05-08 11:42:07 +01:00
yoso.md [Docs] Model_doc structure/clarity improvements (#26876) 2023-11-03 10:57:03 -04:00