transformers/docs/source/en/tasks
Yoach Lacombe d2cdefb9ec
Add new meta w2v2-conformer BERT-like model (#28165)
* first commit

* correct default value non causal

* update config and modeling code

* update converting checkpoint

* clean modeling and fix tests

* make style

* add new config parameters to docstring

* fix copied from statements

* Apply suggestions from code review

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* make position_embeddings_type docstrings clearer

* clean converting script

* remove function not used

* clean modeling file

* apply suggestion for test file + add convert script to not_doctested

* modify tests according to review - cleaner logic and more tests

* Apply nit suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add checker of valid position embeddings type

* instantiate new layer norm layer with the right eps

* fix freeze_feature_encoder since it can be None in some cases

* add test same output in convert script

* restore wav2vec2conformer and add new model

* create processor and FE + clean

* add new model code

* fix convert script and set default config parameters

* correct model id paths

* make style

* make fix-copies and cleaning files

* fix copied from statements

* complete .md and fixe copies

* clean convert script argument defaults

* fix config parameters docstrings

* fix config docstring

* add copied from and enrich FE tests

* fix copied from and repo-consistency

* add autotokenizer

* make test input length shorter and change docstring code

* fix docstrings and copied from

* add add_adapter to ASR training example

* make testing of adapters more robust

* adapt to multi adapter layers

* refactor input_values->input_features and remove w2v2-bert feature extractor

* remove pretraining model

* remove depreciated features and useless lines

* add copied from and ignore statements to modeling tests

* remove pretraining model #2

* change import in convert script

* change default in convert script

* update readme and remove useless line

* Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor BERT to Bert for consistency

* remove useless ignore copy statement

* add persistent to buffer in rotary

* add eps in LayerNorm init and remove copied from

* add adapter activation parameters and add copied from statements

* Fix copied statements and add unitest.skip reasons

* add copied statement in test_processor

* refactor processor

* make style

* replace numpy random by torch rand

* remove expected output CTC

* improve converting script with processor class

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove gumbel class

* remove tests related to previously deleted class

* Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* correct typos

* remove uused parameters

* update processor to takes both text and audio

* update checkpoints

* update expected output and add ctc expected output

* add label_attention_mask

* replace pt with np in processor tests

* fix typo

* revert to behaviour with labels_attention_mask

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
2024-01-18 13:37:34 +00:00
..
asr.md Add new meta w2v2-conformer BERT-like model (#28165) 2024-01-18 13:37:34 +00:00
audio_classification.md Add new meta w2v2-conformer BERT-like model (#28165) 2024-01-18 13:37:34 +00:00
document_question_answering.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
idefics.md Translate en/tasks folder docs to Japanese 🇯🇵 (#27098) 2023-12-04 14:10:54 -08:00
image_captioning.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
image_classification.md Pvt model (#24720) 2023-07-24 15:34:19 +01:00
image_to_image.md Image-to-Image Task Guide (#26595) 2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md fixed typos (issue 27919) (#27920) 2023-12-11 18:44:23 -05:00
language_modeling.md Add qwen2 (#28436) 2024-01-17 16:02:22 +01:00
masked_language_modeling.md fixed broken link (#27560) 2023-11-17 08:20:42 -08:00
monocular_depth_estimation.md Migrate doc files to Markdown. (#24376) 2023-06-20 18:07:47 -04:00
multiple_choice.md Add Multi Resolution Analysis (MRA) (New PR) (#24513) 2023-07-10 10:50:43 +01:00
object_detection.md Fixing visualization code for object detection to support both types of bounding box. (#27842) 2023-12-22 13:24:40 +00:00
prompting.md [docs] LLM prompting guide (#26274) 2023-10-12 08:48:01 -04:00
question_answering.md [MPT] Add MosaicML's MPT model to transformers (#24629) 2023-07-25 14:32:40 +02:00
semantic_segmentation.md Fix indentation error - semantic_segmentation.md (#28117) 2023-12-18 12:47:54 -05:00
sequence_classification.md Add qwen2 (#28436) 2024-01-17 16:02:22 +01:00
summarization.md Translate en/tasks folder docs to Japanese 🇯🇵 (#27098) 2023-12-04 14:10:54 -08:00
text-to-speech.md Add FastSpeech2Conformer (#23439) 2024-01-03 18:01:06 +00:00
token_classification.md Add Phi-1 and Phi-1_5 (#26170) 2023-11-10 15:28:30 +00:00
translation.md Translate en/tasks folder docs to Japanese 🇯🇵 (#27098) 2023-12-04 14:10:54 -08:00
video_classification.md Add ViViT (#22518) 2023-07-11 14:04:04 +01:00
visual_question_answering.md VQA task guide (#25244) 2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md [docs] Fix model reference in zero shot image classification example (#26206) 2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md fix documentation for zero_shot_object_detection (#28267) 2024-01-03 09:20:34 -08:00