transformers/docs/source/en/tasks
Yoach Lacombe 29f1aee3b6
Add SeamlessM4T v2 (#27779)
* add working convertion script

* first non-working version of modeling code

* update modeling code (working)

* make style

* make fix-copies

* add config docstrings

* add config to ignore docstrings formatage due to unconventional markdown

* fix copies

* fix generation num_return_sequences

* enrich docs

* add and fix tests beside integration tests

* update integration tests

* update repo id

* add tie weights and make style

* correct naming in .md

* fix imports and so on

* correct docstrings

* fix fp16 speech forward

* fix speechencoder attention

* make style

* fix copied from

* rename SeamlessM4Tv2-v2 to SeamlessM4Tv2

* Apply suggestions on configuration

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* remove useless public models

* fix private models + better naming for T2U models

* clean speech encoder relative position embeddings

* refactor chunk attention

* add docstrings to chunk attention method

* improve naming and docstrings

* rename some attention variables + add temperature sampling in T2U model

* rename DOCSTRINGS variable names

* make style + remove 2 useless config parameters

* enrich model card

* remove any attention_head reference + fix temperature in T2U

* new fmt and make style

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* rename spkr_id->speaker_id and change docstrings of get_char_input_ids

* simplify v2attention

* make style

* Update seamless_m4t_v2.md

* update code and tests with last update

* update repo ids

* fill article name, abstract andauthors

* update not_doctested and slow_doc tests

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
2023-11-30 20:24:43 +01:00
..
asr.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
audio_classification.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
document_question_answering.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
idefics.md fix typos in idefics.md (#26648) 2023-10-09 12:18:02 +02:00
image_captioning.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
image_classification.md Pvt model (#24720) 2023-07-24 15:34:19 +01:00
image_to_image.md Image-to-Image Task Guide (#26595) 2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md Knowledge distillation for vision guide (#25619) 2023-10-18 04:42:32 -07:00
language_modeling.md fixed broken link (#27560) 2023-11-17 08:20:42 -08:00
masked_language_modeling.md fixed broken link (#27560) 2023-11-17 08:20:42 -08:00
monocular_depth_estimation.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
multiple_choice.md Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
object_detection.md [doc] fixed indices in obj detection example (#26343) 2023-09-22 10:29:27 -04:00
prompting.md [docs] LLM prompting guide (#26274) 2023-10-12 08:48:01 -04:00
question_answering.md [MPT] Add MosaicML's MPT model to transformers (#24629) 2023-07-25 14:32:40 +02:00
semantic_segmentation.md Extended semantic segmentation to image segmentation (#27039) 2023-11-23 15:58:21 +00:00
sequence_classification.md Add Phi-1 and Phi-1_5 (#26170) 2023-11-10 15:28:30 +00:00
summarization.md Add SeamlessM4T v2 (#27779) 2023-11-30 20:24:43 +01:00
text-to-speech.md Reflect RoCm support in the documentation (#27636) 2023-11-25 00:59:17 +09:00
token_classification.md Add Phi-1 and Phi-1_5 (#26170) 2023-11-10 15:28:30 +00:00
translation.md Add SeamlessM4T v2 (#27779) 2023-11-30 20:24:43 +01:00
video_classification.md Add ViViT (#22518) 2023-07-11 14:04:04 +01:00
visual_question_answering.md VQA task guide (#25244) 2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md [docs] Fix model reference in zero shot image classification example (#26206) 2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00