transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-15 18:48:24 +06:00

History

Avishai Elmakies a265600c60 add sdpa to OPT (#33298 ) * add sdpa to OPT * chore: remove redundant whitespace in OPTDecoder class * fixup * bug fix * add sdpa and attention generate test * fixup * Refactor OPTAttention forward method for improved readability and maintainability * undo refactor for _shape and key,val states * add OPT to doc, fixup didn't find it for some reason * change order * change default attn_implemntation in testing to eager * [run-slow] opt * change test_eager_matches_sdpa_generate to the one llama * Update default attention implementation in testing common * [run-slow] opt * remove uneeded print * [run-slow] opt * refactor model testers to have attn_implementation="eager" * [run-slow] opt * convert test_eager_matches_sdpa_generate to opt-350M * bug fix when creating mask for opt * [run-slow] opt * if layer head mask default to eager * if head mask is not none fall to eager * [run-slow] opt * Update src/transformers/models/opt/modeling_opt.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Clean up Unpack imports (#33631) clean up Unpack imports * Fix DPT /Dinov2 sdpa regression on main (#33660) * fallback to eager if output attentions. * fix copies * handle dependency errors in check_imports (#33622) * handle dependency errors in check_imports * change log level to warning * add back self.max_position_embeddings = config.max_position_embeddings (#33550) * add back self.max_position_embeddings = config.max_position_embeddings * fix-copies * Fix Llava conversion for LlavaQwen2ForCausalLM with Clip vision tower (#33613) fix llavaqwen2 model conversion * Uniformize kwargs for Udop processor and update docs (#33628) * Add optional kwargs and uniformize udop * cleanup Unpack * nit Udop * Generation: deprecate `PreTrainedModel` inheriting from `GenerationMixin` (#33203) * Enable BNB multi-backend support (#31098) * enable cpu bnb path * fix style * fix code style * fix 4 bit path * Update src/transformers/utils/import_utils.py Co-authored-by: Aarni Koskela <akx@iki.fi> * add multi backend refactor tests * fix style * tweak 4bit quantizer + fix corresponding tests * tweak 8bit quantizer + try fixing corresponding tests * fix dequant bnb 8bit * account for Intel CPU in variability of expected outputs * enable cpu and xpu device map * further tweaks to account for Intel CPU * fix autocast to work with both cpu + cuda * fix comments * fix comments * switch to testing_utils.torch_device * allow for xpu in multi-gpu tests * fix tests 4bit for CPU NF4 * fix bug with is_torch_xpu_available needing to be called as func * avoid issue where test reports attr err due to other failure * fix formatting * fix typo from resolving of merge conflict * polish based on last PR review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * fix CI * Update src/transformers/integrations/integration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Update src/transformers/integrations/integration_utils.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix error log * fix error msg * add \n in error log * make quality * rm bnb cuda restriction in doc * cpu model don't need dispatch * fix doc * fix style * check cuda avaliable in testing * fix tests * Update docs/source/en/model_doc/chameleon.md Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update docs/source/en/model_doc/llava_next.md Co-authored-by: Aarni Koskela <akx@iki.fi> * Update tests/quantization/bnb/test_4bit.py Co-authored-by: Aarni Koskela <akx@iki.fi> * Update tests/quantization/bnb/test_4bit.py Co-authored-by: Aarni Koskela <akx@iki.fi> * fix doc * fix check multibackends * fix import sort * remove check torch in bnb * docs: update bitsandbytes references with multi-backend info * docs: fix small mistakes in bnb paragraph * run formatting * reveret bnb check * move bnb multi-backend check to import_utils * Update src/transformers/utils/import_utils.py Co-authored-by: Aarni Koskela <akx@iki.fi> * fix bnb check * minor fix for bnb * check lib first * fix code style * Revert "run formatting" This reverts commit `ac108c6d6b`. * fix format * give warning when bnb version is low and no cuda found] * fix device assignment check to be multi-device capable * address akx feedback on get_avlbl_dev fn * revert partially, as we don't want the function that public, as docs would be too much (enforced) --------- Co-authored-by: Aarni Koskela <akx@iki.fi> Co-authored-by: Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * Fix error string after refactoring into get_chat_template (#33652) * Fix error string after refactoring into get_chat_template * Take suggestion from CR Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * uniformize git processor (#33668) * uniformize git processor * update doctring * Modular `transformers`: modularity and inheritance for new model additions (#33248) * update exampel * update * push the converted diff files for testing and ci * correct one example * fix class attributes and docstring * nits * oups * fixed config! * update * nitd * class attributes are not matched against the other, this is missing * fixed overwriting self.xxx now onto the attributes I think * partial fix, now order with docstring * fix docstring order? * more fixes * update * fix missing docstrings! * examples don't all work yet * fixup * nit * updated * hick * update * delete * update * update * update * fix * all default * no local import * fix more diff * some fix related to "safe imports" * push fixed * add helper! * style * add a check * all by default * add the * update * FINALLY! * nit * fix config dependencies * man that is it * fix fix * update diffs * fix the last issue * re-default to all * alll the fixes * nice * fix properties vs setter * fixup * updates * update dependencies * make sure to install what needs to be installed * fixup * quick fix for now * fix! * fixup * update * update * updates * whitespaces * nit * fix * simplify everything, and make it file agnostic (should work for image processors) * style * finish fixing all import issues * fixup * empty modeling should not be written! * Add logic to find who depends on what * update * cleanup * update * update gemma to support positions * some small nits * this is the correct docstring for gemma2 * fix merging of docstrings * update * fixup * update * take doc into account * styling * update * fix hidden activation * more fixes * final fixes! * fixup * fixup instruct blip video * update * fix bugs * align gemma2 with the rest as well * updats * revert * update * more reversiom * grind * more * arf * update * order will matter * finish del stuff * update * rename to modular * fixup * nits * update makefile * fixup * update order of the checks! * fix * fix docstring that has a call inside * fiix conversion check * style * add some initial documentation * update * update doc * some fixup * updates * yups * Mostly todo gimme a minut * update * fixup * revert some stuff * Review docs for the modular transformers (#33472) Docs * good update * fixup * mmm current updates lead to this code * okay, this fixes it * cool * fixes * update * nit * updates * nits * fix doc * update * revert bad changes * update * updates * proper update * update * update? * up * update * cool * nits * nits * bon bon * fix * ? * minimise changes * update * update * update * updates? * fixed gemma2 * kind of a hack * nits * update * remove `diffs` in favor of `modular` * fix make fix copies --------- Co-authored-by: Lysandre Debut <hi@lysand.re> * Fix CIs post merging modular transformers (#33681) update * Fixed docstring for cohere model regarding unavailability of prune_he… (#33253) * Fixed docstring for cohere model regarding unavailability of prune_head() methods The docstring mentions that cohere model supports prune_heads() methods. I have fixed the docstring by explicitly mentioning that it doesn't support that functionality. * Update src/transformers/models/cohere/modeling_cohere.py --------- Co-authored-by: Lysandre Debut <hi@lysand.re> * Generation tests: update imagegpt input name, remove unused functions (#33663) * Improve Error Messaging for Flash Attention 2 on CPU (#33655) Update flash-attn error message on CPU Rebased to latest branch * Gemma2: fix config initialization (`cache_implementation`) (#33684) * Fix ByteLevel alphabet missing when Sequence pretokenizer is used (#33556) * Fix ByteLevel alphabet missing when Sequence pretokenizer is used * Fixed formatting with `ruff`. * Uniformize kwargs for image-text-to-text processors (#32544) * uniformize FUYU processor kwargs * Uniformize instructblip processor kwargs * Fix processor kwargs and tests Fuyu, InstructBlip, Kosmos2 * Uniformize llava_next processor * Fix save_load test for processor with chat_template only as extra init args * Fix import Unpack * Fix Fuyu Processor import * Fix FuyuProcessor import * Fix FuyuProcessor * Add defaults for specific kwargs kosmos2 * Fix Udop to return BatchFeature instead of BatchEncoding and uniformize kwargs * Add tests processor Udop * remove Copied from in processing Udop as change of input orders caused by BatchEncoding -> BatchFeature * Fix overwrite tests kwargs processors * Add warnings and BC for changes in processor inputs order, change docs, add BC for text_pair as arg for Udop * Fix processing test fuyu * remove unnecessary pad_token check in instructblip ProcessorTest * Fix BC tests and cleanup * FIx imports fuyu * Uniformize Pix2Struct * Fix wrong name for FuyuProcessorKwargs * Fix slow tests reversed inputs align fuyu llava-next, change udop warning * Fix wrong logging import udop * Add check images text input order * Fix copies * change text pair handling when positional arg * rebase on main, fix imports in test_processing_common * remove optional args and udop uniformization from this PR * fix failing tests * remove unnecessary test, fix processing utils and test processing common * cleanup Unpack * cleanup * fix conflict grounding dino * 🚨🚨 Setting default behavior of assisted decoding (#33657) * tests: fix pytorch tensor placement errors (#33485) This commit fixes the following errors: * Fix "expected all tensors to be on the same device" error * Fix "can't convert device type tensor to numpy" According to pytorch documentation torch.Tensor.numpy(force=False) performs conversion only if tensor is on CPU (plus few other restrictions) which is not the case. For our case we need force=True since we just need a data and don't care about tensors coherency. Fixes: #33517 See: https://pytorch.org/docs/2.4/generated/torch.Tensor.numpy.html Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> * bump tokenizers, fix added tokens fast (#32535) * update based on tokenizers release * update * nits * update * revert re addition * don't break that yet * fmt * revert unwanted * update tokenizers version * update dep table * update * update in conversion script as well * some fix * revert * fully revert * fix training * remove set trace * fixup * update * update * [Pixtral] Improve docs, rename model (#33491) * Improve docs, rename model * Fix style * Update repo id * fix code quality after merge * HFQuantizer implementation for compressed-tensors library (#31704) * Add compressed-tensors HFQuantizer implementation * flag serializable as False * run * revive lines deleted by ruff * fixes to load+save from sparseml, edit config to quantization_config, and load back * address satrat comment * compressed_tensors to compressed-tensors and revert back is_serializable * rename quant_method from sparseml to compressed-tensors * tests * edit tests * clean up tests * make style * cleanup * cleanup * add test skip for when compressed tensors is not installed * remove pydantic import + style * delay torch import in test * initial docs * update main init for compressed tensors config * make fix-copies * docstring * remove fill_docstring * Apply suggestions from code review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * review comments * review comments * comments - suppress warnings on state dict load, tests, fixes * bug-fix - remove unnecessary call to apply quant lifecycle * run_compressed compatability * revert changes not needed for compression * no longer need unexpected keys fn * unexpected keys not needed either * Apply suggestions from code review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * add to_diff_dict * update docs and expand testing * Update _toctree.yml with compressed-tensors * Update src/transformers/utils/quantization_config.py Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * update doc * add note about saving a loaded model --------- Co-authored-by: George Ohashi <george@neuralmagic.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Sara Adkins <sara@neuralmagic.com> Co-authored-by: Sara Adkins <sara.adkins65@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Dipika Sikka <ds3822@columbia.edu> Co-authored-by: Dipika <dipikasikka1@gmail.com> * update model card for opt * add batch size to inference table * [slow-run] opt * [run-slow] opt --------- Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> Co-authored-by: Avishai Elmakies <avishai.elma@cs.huji.ac.il> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com> Co-authored-by: chengchengpei <5881383+chengchengpei@users.noreply.github.com> Co-authored-by: Isotr0py <2037008807@qq.com> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: jiqing-feng <jiqing.feng@intel.com> Co-authored-by: Aarni Koskela <akx@iki.fi> Co-authored-by: Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by: Tibor Reiss <75096465+tibor-reiss@users.noreply.github.com> Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> Co-authored-by: Lysandre Debut <hi@lysand.re> Co-authored-by: Muhammad Naufil <m.naufil1@gmail.com> Co-authored-by: sizhky <yyeshr@gmail.com> Co-authored-by: Umar Butler <umar@umar.au> Co-authored-by: Jonathan Mamou <jonathan.mamou@intel.com> Co-authored-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> Co-authored-by: George Ohashi <george@neuralmagic.com> Co-authored-by: Sara Adkins <sara@neuralmagic.com> Co-authored-by: Sara Adkins <sara.adkins65@gmail.com> Co-authored-by: Dipika Sikka <ds3822@columbia.edu> Co-authored-by: Dipika <dipikasikka1@gmail.com>		2024-10-10 11:49:34 +02:00
..
albert.md	Add sdpa support for Albert (#32092 )	2024-09-03 14:01:00 +01:00
align.md	Uniformize kwargs for image-text-to-text processors (#32544 )	2024-09-24 21:28:19 -04:00
altclip.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
audio-spectrogram-transformer.md	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
auto.md	Add auto model for image-text-to-text (#32472 )	2024-10-08 14:26:43 +02:00
autoformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bark.md	F.scaled_dot_product_attention support (#26572 )	2023-12-09 05:38:14 +09:00
bart.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
barthez.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bartpho.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
beit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bert-generation.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
bert-japanese.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bert.md	[`BERT`] Add support for sdpa (#28802 )	2024-04-26 16:23:44 +01:00
bertweet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
big_bird.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bigbird_pegasus.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
biogpt.md	Add sdpa for BioGpt (#33592 )	2024-09-20 14:27:32 +01:00
bit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
blenderbot-small.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
blenderbot.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
blip-2.md	🚨 Add Blip2ForImageTextRetrieval (#29261 )	2024-08-27 18:50:27 +01:00
blip.md	Blip: Deprecate `BlipModel` (#31235 )	2024-06-04 18:29:45 +02:00
bloom.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bort.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bridgetower.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
bros.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
byt5.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
camembert.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
canine.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
chameleon.md	Uniformize kwargs for chameleon processor (#32181 )	2024-09-26 10:18:07 -04:00
chinese_clip.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
clap.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
clip.md	Add `sdpa` and FA2 for CLIP (#31940 )	2024-07-18 10:30:37 +05:30
clipseg.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
clvp.md	Add CLVP (#24745 )	2023-11-10 13:49:10 +00:00
code_llama.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
codegen.md	Add token type ids to CodeGenTokenizer (#29265 )	2024-04-17 12:19:18 +02:00
cohere.md	Cohere Model Release (#29622 )	2024-03-15 14:29:11 +01:00
conditional_detr.md	Add examples for detection models finetuning (#30422 )	2024-05-08 11:42:07 +01:00
convbert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
convnext.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
convnextv2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
cpm.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
cpmant.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
ctrl.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
cvt.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
dac.md	Add Descript-Audio-Codec model (#31494 )	2024-08-19 10:21:51 +01:00
data2vec.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
dbrx.md	Follow up: Fix link in dbrx.md (#30514 )	2024-05-27 14:57:43 +02:00
deberta-v2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
deberta.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
decision_transformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
deformable_detr.md	Add examples for detection models finetuning (#30422 )	2024-05-08 11:42:07 +01:00
deit.md	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
deplot.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
depth_anything_v2.md	Depth Anything: update conversion script for V2 (#31522 )	2024-07-05 19:28:41 +01:00
depth_anything.md	Depth Anything: update conversion script for V2 (#31522 )	2024-07-05 19:28:41 +01:00
deta.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
detr.md	Fixed typo repeated word in DETR docs (#33250 )	2024-09-02 17:19:18 +02:00
dialogpt.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
dinat.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
dinov2.md	Add Flax Dinov2 (#31960 )	2024-08-19 09:28:13 +01:00
distilbert.md	Add sdpa for DistilBert (#33724 )	2024-10-02 13:55:19 +01:00
dit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
donut.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
dpr.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
dpt.md	[DPT, Dinov2] Add resources (#27655 )	2023-11-23 17:44:08 +00:00
efficientformer.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
efficientnet.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
electra.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
encodec.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
encoder-decoder.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
ernie_m.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
ernie.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
esm.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
falcon_mamba.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
falcon.md	Add proper Falcon docs and conversion script (#25954 )	2023-09-04 17:18:34 +01:00
fastspeech2_conformer.md	Super tiny fix 12 typos about "with with" (#29926 )	2024-03-29 14:31:31 +00:00
flan-t5.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
flan-ul2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
flaubert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
flava.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
fnet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
focalnet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
fsmt.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
funnel.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
fuyu.md	Uniformize kwargs for image-text-to-text processors (#32544 )	2024-09-24 21:28:19 -04:00
gemma.md	Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878 )	2024-05-20 10:06:57 +02:00
gemma2.md	Gemma2: add cache warning (#32279 )	2024-08-07 10:03:05 +05:00
git.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
glpn.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
gpt_bigcode.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
gpt_neo.md	F.scaled_dot_product_attention support (#26572 )	2023-12-09 05:38:14 +09:00
gpt_neox_japanese.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
gpt_neox.md	[`GPT-NeoX`] Add SDPA support (#31031 )	2024-06-26 13:56:36 +01:00
gpt-sw3.md	Fix paths to AI Sweden Models reference and model loading (#28423 )	2024-01-15 09:09:22 +01:00
gpt2.md	[`GPT2`] Add SDPA support (#31172 )	2024-06-19 09:40:57 +02:00
gptj.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
gptsan-japanese.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
granite.md	Granite language models (#31502 )	2024-08-27 21:27:21 +02:00
granitemoe.md	Granitemoe (#33207 )	2024-09-21 01:43:50 +02:00
graphormer.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
grounding-dino.md	Fix code snippet for Grounding DINO (#32229 )	2024-07-25 19:20:47 +01:00
groupvit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
herbert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
hiera.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
hubert.md	Add sdpa and fa2 the Wav2vec2 family. (#30121 )	2024-04-22 18:30:38 +01:00
ibert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
idefics.md	Port IDEFICS to tensorflow (#26870 )	2024-05-13 15:59:46 +01:00
idefics2.md	🚨 [Idefics2] Update ignore index (#30898 )	2024-05-21 19:38:02 +02:00
idefics3.md	Add Idefics 3! (#32473 )	2024-09-25 21:28:49 +02:00
imagegpt.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
informer.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
instructblip.md	Add video modality for InstrucBLIP (#30182 )	2024-06-25 15:45:39 +05:00
instructblipvideo.md	Add video modality for InstrucBLIP (#30182 )	2024-06-25 15:45:39 +05:00
jamba.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
jetmoe.md	Add JetMoE model (#30005 )	2024-05-14 16:32:01 +02:00
jukebox.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
kosmos-2.md	[KOSMOS-2] Update docs (#27157 )	2023-10-30 21:42:19 +01:00
layoutlm.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
layoutlmv2.md	[Docs] Add language identifiers to fenced code blocks (#28955 )	2024-02-12 10:48:31 -08:00
layoutlmv3.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
layoutxlm.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
led.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
levit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
lilt.md	[Docs] Add language identifiers to fenced code blocks (#28955 )	2024-02-12 10:48:31 -08:00
llama.md	Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878 )	2024-05-20 10:06:57 +02:00
llama2.md	Fix FA2 integration (#28142 )	2023-12-20 14:25:07 +05:30
llama3.md	Docs - update formatting of llama3 model card (#33438 )	2024-09-12 11:24:56 +02:00
llava_next_video.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
llava_next.md	Add auto model for image-text-to-text (#32472 )	2024-10-08 14:26:43 +02:00
llava_onevision.md	[Docs] Improve VLM docs (#33393 )	2024-10-07 09:54:07 +02:00
llava.md	[Docs] Improve VLM docs (#33393 )	2024-10-07 09:54:07 +02:00
longformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
longt5.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
luke.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
lxmert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
m2m_100.md	Add SDPA support for M2M100 (#33309 )	2024-09-25 18:04:42 +01:00
madlad-400.md	Add madlad-400 MT models (#27471 )	2023-11-28 13:19:50 +00:00
mamba.md	Trainer - deprecate tokenizer for processing_class (#32385 )	2024-10-02 14:08:46 +01:00
mamba2.md	quickfix documentation (#32566 )	2024-08-26 17:49:44 +02:00
marian.md	Mention model_info.id instead of model_info.modelId (#32106 )	2024-07-22 14:14:47 +01:00
markuplm.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
mask2former.md	Instance segmentation examples (#31084 )	2024-05-31 16:56:17 +01:00
maskformer.md	Instance segmentation examples (#31084 )	2024-05-31 16:56:17 +01:00
matcha.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
mbart.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
mctct.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mega.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
megatron_gpt2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
megatron-bert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mgp-str.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
mimi.md	Codec integration (#33565 )	2024-09-18 19:23:44 +02:00
mistral.md	[Port] TensorFlow implementation of Mistral (#29708 )	2024-05-23 17:48:49 +01:00
mixtral.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
mllama.md	Add MLLama (#33703 )	2024-09-25 19:56:25 +02:00
mluke.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mms.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
mobilebert.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
mobilenet_v1.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mobilenet_v2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mobilevit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mobilevitv2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mpnet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mpt.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
mra.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
mt5.md	Adding [T5/MT5/UMT5]ForTokenClassification (#28443 )	2024-02-01 03:53:49 +01:00
musicgen_melody.md	Add MusicGen Melody (#28819 )	2024-03-18 13:06:12 +00:00
musicgen.md	[Docs] Add language identifiers to fenced code blocks (#28955 )	2024-02-12 10:48:31 -08:00
mvp.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
myt5.md	[WIP] Add Tokenizer for MyT5 Model (#31286 )	2024-10-06 10:33:16 +02:00
nat.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
nemotron.md	Add Nemotron HF Support (#31699 )	2024-08-06 15:42:05 +02:00
nezha.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
nllb-moe.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
nllb.md	Add SDPA support for M2M100 (#33309 )	2024-09-25 18:04:42 +01:00
nougat.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
nystromformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
olmo.md	Add OLMo model family (#29890 )	2024-04-17 17:59:07 +02:00
olmoe.md	Add paper link (#33305 )	2024-09-05 15:49:28 +02:00
omdet-turbo.md	Fix docs and docstrings Omdet-Turbo (#33726 )	2024-09-26 12:18:23 -04:00
oneformer.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
open-llama.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
openai-gpt.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
opt.md	add sdpa to OPT (#33298 )	2024-10-10 11:49:34 +02:00
owlv2.md	Fix OWLv2 Doc (#30794 )	2024-05-14 08:36:11 +02:00
owlvit.md	Update bounding box format everywhere (#27944 )	2023-12-11 18:03:42 +00:00
paligemma.md	Paligemma support for multi-image (#33447 )	2024-09-27 11:23:14 +02:00
patchtsmixer.md	[Docs] Add resources (#28705 )	2024-02-19 15:22:29 +01:00
patchtst.md	[Docs] Add resources (#28705 )	2024-02-19 15:22:29 +01:00
pegasus_x.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
pegasus.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
perceiver.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
persimmon.md	Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878 )	2024-05-20 10:06:57 +02:00
phi.md	Fix doctest more (for `docs/source/en`) (#30247 )	2024-04-15 14:10:59 +02:00
phi3.md	phi3 chat_template does not support system role (#30606 )	2024-05-02 15:30:21 +02:00
phimoe.md	PhiMoE (#33363 )	2024-10-04 21:39:45 +02:00
phobert.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
pix2struct.md	🌐 [i18n-ZH] Translate chat_templating.md into Chinese (#28790 )	2024-02-26 08:42:24 -08:00
pixtral.md	[Docs] Improve VLM docs (#33393 )	2024-10-07 09:54:07 +02:00
plbart.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
poolformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
pop2piano.md	[Docs] Add language identifiers to fenced code blocks (#28955 )	2024-02-12 10:48:31 -08:00
prophetnet.md	chore: remove duplicate words (#31853 )	2024-07-09 10:38:29 +01:00
pvt_v2.md	Add PvT-v2 Model (#26812 )	2024-03-13 19:05:20 +00:00
pvt.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
qdqbert.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
qwen2_audio.md	Add Qwen2-Audio (#32137 )	2024-08-08 15:47:24 +02:00
qwen2_moe.md	Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878 )	2024-05-20 10:06:57 +02:00
qwen2_vl.md	[Docs] Improve VLM docs (#33393 )	2024-10-07 09:54:07 +02:00
qwen2.md	Update qwen2.md (#32108 )	2024-07-24 11:54:41 +01:00
rag.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
realm.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
recurrent_gemma.md	[Docs] Update recurrent_gemma.md for some minor nits (#30238 )	2024-04-15 18:30:59 +02:00
reformer.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
regnet.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
rembert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
resnet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
retribert.md	Deprecate models (#24787 )	2023-07-13 11:46:54 -04:00
roberta-prelayernorm.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
roberta.md	[RoBERTa] Minor clarifications to model doc (#31949 )	2024-07-22 10:08:27 -07:00
roc_bert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
roformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
rt_detr.md	[RT-DETR] Add resources (#31815 )	2024-07-10 16:34:53 +01:00
rwkv.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
sam.md	[Docs] Add Developer Guide: How to Hack Any Transformers Model (#33979 )	2024-10-07 10:08:20 +02:00
seamless_m4t_v2.md	[Seamless] Fix links in docs (#27905 )	2023-12-14 15:14:13 +00:00
seamless_m4t.md	[Seamless] Fix links in docs (#27905 )	2023-12-14 15:14:13 +00:00
segformer.md	Decorators for deprecation and named arguments validation (#30799 )	2024-06-10 12:35:10 +01:00
seggpt.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
sew-d.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
sew.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
siglip.md	Make siglip examples clearer and error free (#33667 )	2024-09-27 10:33:55 +02:00
speech_to_text_2.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
speech_to_text.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
speech-encoder-decoder.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
speecht5.md	add generate method to SpeechT5ForTextToSpeech (#25233 )	2023-08-03 14:12:07 +01:00
splinter.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
squeezebert.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
stablelm.md	Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878 )	2024-05-20 10:06:57 +02:00
starcoder2.md	Add TokenClassification for Mistral, Mixtral and Qwen2 (#29878 )	2024-05-20 10:06:57 +02:00
superpoint.md	[SuperPoint, PaliGemma] Update docs (#31025 )	2024-05-28 13:22:06 +02:00
swiftformer.md	Add TF swiftformer (#23342 )	2024-04-19 18:31:43 +01:00
swin.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
swin2sr.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
swinv2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
switch_transformers.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
t5.md	Fix doctest more (for `docs/source/en`) (#30247 )	2024-04-15 14:10:59 +02:00
t5v1.1.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
table-transformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
tapas.md	[docs] fixed links with 404 (#27327 )	2023-11-06 19:45:03 +00:00
tapex.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
time_series_transformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
timesformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
trajectory_transformer.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
transfo-xl.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
trocr.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
tvlt.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
tvp.md	Update TVP arxiv link (#27672 )	2023-11-23 17:02:16 +00:00
udop.md	[UDOP] Improve docs, add resources (#29571 )	2024-04-10 16:02:50 +02:00
ul2.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
umt5.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
unispeech-sat.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
unispeech.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
univnet.md	Add UnivNet Vocoder Model for Tortoise TTS Diffusers Integration (#24799 )	2023-11-22 17:21:36 +01:00
upernet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
van.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
video_llava.md	Enable BNB multi-backend support (#31098 )	2024-09-24 03:40:56 -06:00
videomae.md	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
vilt.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
vipllava.md	Llava: add default chat templates (#31691 )	2024-07-19 10:08:56 +05:00
vision-encoder-decoder.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
vision-text-dual-encoder.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
visual_bert.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
vit_hybrid.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
vit_mae.md	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
vit_msn.md	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
vit.md	Fast image processor (#28847 )	2024-06-11 15:47:38 +01:00
vitdet.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
vitmatte.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
vits.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
vivit.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
wav2vec2_phoneme.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
wav2vec2-bert.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
wav2vec2-conformer.md	doc: add info about wav2vec2 bert in older wav2vec2 models. (#31120 )	2024-06-05 11:56:11 +01:00
wav2vec2.md	doc: add info about wav2vec2 bert in older wav2vec2 models. (#31120 )	2024-06-05 11:56:11 +01:00
wavlm.md	[Docs] Fix spelling and grammar mistakes (#28825 )	2024-02-02 08:45:00 +01:00
whisper.md	[docs] add quick usage snippet to Whisper. (#31289 )	2024-08-27 14:11:52 +02:00
xclip.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
xglm.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
xlm-prophetnet.md	Deprecate low use models (#30781 )	2024-05-28 18:07:07 +01:00
xlm-roberta-xl.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
xlm-roberta.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
xlm-v.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
xlm.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
xlnet.md	Fixed Majority of the Typos in `transformers[en]` Documentation (#33350 )	2024-09-09 10:47:24 +02:00
xls_r.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
xlsr_wav2vec2.md	doc: add info about wav2vec2 bert in older wav2vec2 models. (#31120 )	2024-06-05 11:56:11 +01:00
xmod.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
yolos.md	add sdpa to ViT [follow up of #29325 ] (#30555 )	2024-05-16 10:56:11 +01:00
yoso.md	[Docs] Model_doc structure/clarity improvements (#26876 )	2023-11-03 10:57:03 -04:00
zamba.md	Add Zamba (#30950 )	2024-10-04 22:28:05 +02:00
zoedepth.md	Add ZoeDepth (#30136 )	2024-07-08 11:43:33 +02:00