transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-03 12:50:06 +06:00

History

Yoach Lacombe d2cdefb9ec Add new meta w2v2-conformer BERT-like model (#28165 ) * first commit * correct default value non causal * update config and modeling code * update converting checkpoint * clean modeling and fix tests * make style * add new config parameters to docstring * fix copied from statements * Apply suggestions from code review Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * make position_embeddings_type docstrings clearer * clean converting script * remove function not used * clean modeling file * apply suggestion for test file + add convert script to not_doctested * modify tests according to review - cleaner logic and more tests * Apply nit suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add checker of valid position embeddings type * instantiate new layer norm layer with the right eps * fix freeze_feature_encoder since it can be None in some cases * add test same output in convert script * restore wav2vec2conformer and add new model * create processor and FE + clean * add new model code * fix convert script and set default config parameters * correct model id paths * make style * make fix-copies and cleaning files * fix copied from statements * complete .md and fixe copies * clean convert script argument defaults * fix config parameters docstrings * fix config docstring * add copied from and enrich FE tests * fix copied from and repo-consistency * add autotokenizer * make test input length shorter and change docstring code * fix docstrings and copied from * add add_adapter to ASR training example * make testing of adapters more robust * adapt to multi adapter layers * refactor input_values->input_features and remove w2v2-bert feature extractor * remove pretraining model * remove depreciated features and useless lines * add copied from and ignore statements to modeling tests * remove pretraining model #2 * change import in convert script * change default in convert script * update readme and remove useless line * Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * refactor BERT to Bert for consistency * remove useless ignore copy statement * add persistent to buffer in rotary * add eps in LayerNorm init and remove copied from * add adapter activation parameters and add copied from statements * Fix copied statements and add unitest.skip reasons * add copied statement in test_processor * refactor processor * make style * replace numpy random by torch rand * remove expected output CTC * improve converting script with processor class * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * remove gumbel class * remove tests related to previously deleted class * Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> * correct typos * remove uused parameters * update processor to takes both text and audio * update checkpoints * update expected output and add ctc expected output * add label_attention_mask * replace pt with np in processor tests * fix typo * revert to behaviour with labels_attention_mask --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>		2024-01-18 13:37:34 +00:00
..
asr.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
audio_classification.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
document_question_answering.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
idefics.md	Translate `en/tasks` folder docs to Japanese 🇯🇵 (#27098 )	2023-12-04 14:10:54 -08:00
image_captioning.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
image_classification.md	Pvt model (#24720 )	2023-07-24 15:34:19 +01:00
image_to_image.md	Image-to-Image Task Guide (#26595 )	2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md	fixed typos (issue 27919) (#27920 )	2023-12-11 18:44:23 -05:00
language_modeling.md	Add qwen2 (#28436 )	2024-01-17 16:02:22 +01:00
masked_language_modeling.md	fixed broken link (#27560 )	2023-11-17 08:20:42 -08:00
monocular_depth_estimation.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
multiple_choice.md	Add Multi Resolution Analysis (MRA) (New PR) (#24513 )	2023-07-10 10:50:43 +01:00
object_detection.md	Fixing visualization code for object detection to support both types of bounding box. (#27842 )	2023-12-22 13:24:40 +00:00
prompting.md	[docs] LLM prompting guide (#26274 )	2023-10-12 08:48:01 -04:00
question_answering.md	[`MPT`] Add MosaicML's `MPT` model to transformers (#24629 )	2023-07-25 14:32:40 +02:00
semantic_segmentation.md	Fix indentation error - semantic_segmentation.md (#28117 )	2023-12-18 12:47:54 -05:00
sequence_classification.md	Add qwen2 (#28436 )	2024-01-17 16:02:22 +01:00
summarization.md	Translate `en/tasks` folder docs to Japanese 🇯🇵 (#27098 )	2023-12-04 14:10:54 -08:00
text-to-speech.md	Add FastSpeech2Conformer (#23439 )	2024-01-03 18:01:06 +00:00
token_classification.md	Add Phi-1 and Phi-1_5 (#26170 )	2023-11-10 15:28:30 +00:00
translation.md	Translate `en/tasks` folder docs to Japanese 🇯🇵 (#27098 )	2023-12-04 14:10:54 -08:00
video_classification.md	Add ViViT (#22518 )	2023-07-11 14:04:04 +01:00
visual_question_answering.md	VQA task guide (#25244 )	2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md	[docs] Fix model reference in zero shot image classification example (#26206 )	2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md	fix documentation for zero_shot_object_detection (#28267 )	2024-01-03 09:20:34 -08:00