transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-04 13:20:12 +06:00

History

Yoach Lacombe c43b380e70 Add MusicGen Melody (#28819 ) * first modeling code * make repository * still WIP * update model * add tests * add latest change * clean docstrings and copied from * update docstrings md and readme * correct chroma function * correct copied from and remove unreleated test * add doc to toctree * correct imports * add convert script to notdoctested * Add suggestion from Sanchit Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * correct get_uncoditional_inputs docstrings * modify README according to SANCHIT feedback * add chroma to audio utils * clean librosa and torchaudio hard dependencies * fix FE * refactor audio decoder -> audio encoder for consistency with previous musicgen * refactor conditional -> encoder * modify sampling rate logics * modify license at the beginning * refactor all_self_attns->all_attentions * remove ignore copy from causallm generate * add copied from for from_sub_models * fix make copies * add warning if audio is truncated * add copied from where relevant * remove artefact * fix convert script * fix torchaudio and FE * modify chroma method according to feedback-> better naming * refactor input_values->input_features * refactor input_values->input_features and fix import fe * add input_features to docstrigs * correct inputs_embeds logics * remove dtype conversion * refactor _prepare_conditional_hidden_states_kwargs_for_generation ->_prepare_encoder_hidden_states_kwargs_for_generation * change warning for chroma length * Update src/transformers/models/musicgen_melody/convert_musicgen_melody_transformers.py Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * change way to save wav, using soundfile * correct docs and change to soundfile * fix import * fix init proj layers * remove line breaks from md * fix issue with docstrings * add FE suggestions * improve is in logics and remove useless imports * remove custom from_pretrained * simplify docstring code * add suggestions for modeling tests * make style * update converting script with sanity check * remove encoder attention mask from conditional generation * replace musicgen melody checkpoints with official orga * rename ylacombe->facebook in checkpoints * fix copies * remove unecessary warning * add shape in code docstrings * add files to slow doc tests * fix md bug and add md to not_tested * make fix-copies * fix hidden states test and batching --------- Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>		2024-03-18 13:06:12 +00:00
..
asr.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
audio_classification.md	Add new meta w2v2-conformer BERT-like model (#28165 )	2024-01-18 13:37:34 +00:00
document_question_answering.md	Migrate doc files to Markdown. (#24376 )	2023-06-20 18:07:47 -04:00
idefics.md	[Docs] Fix broken links and syntax issues (#28918 )	2024-02-08 14:13:35 -08:00
image_captioning.md	[Docs] Fix backticks in inline code and documentation links (#28875 )	2024-02-06 11:15:44 -08:00
image_classification.md	Add PvT-v2 Model (#26812 )	2024-03-13 19:05:20 +00:00
image_feature_extraction.md	Image Feature Extraction docs (#28973 )	2024-02-27 09:39:58 +00:00
image_to_image.md	Image-to-Image Task Guide (#26595 )	2023-10-16 15:12:03 +02:00
knowledge_distillation_for_image_classification.md	fixed typos (issue 27919) (#27920 )	2023-12-11 18:44:23 -05:00
language_modeling.md	Add MusicGen Melody (#28819 )	2024-03-18 13:06:12 +00:00
mask_generation.md	Mask Generation Task Guide (#28897 )	2024-02-14 18:29:49 +00:00
masked_language_modeling.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
monocular_depth_estimation.md	Add Depth Anything (#28654 )	2024-01-25 09:34:50 +01:00
multiple_choice.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
object_detection.md	Fixing visualization code for object detection to support both types of bounding box. (#27842 )	2023-12-22 13:24:40 +00:00
prompting.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
question_answering.md	fix the post-processing link (#29091 )	2024-02-19 10:15:58 +00:00
semantic_segmentation.md	Fix indentation error - semantic_segmentation.md (#28117 )	2023-12-18 12:47:54 -05:00
sequence_classification.md	Starcoder2 model - bis (#29215 )	2024-02-28 01:24:34 +01:00
summarization.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
text-to-speech.md	Add FastSpeech2Conformer (#23439 )	2024-01-03 18:01:06 +00:00
token_classification.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
translation.md	Update all references to canonical models (#29001 )	2024-02-16 08:16:58 +01:00
video_classification.md	[Docs] Add language identifiers to fenced code blocks (#28955 )	2024-02-12 10:48:31 -08:00
visual_question_answering.md	VQA task guide (#25244 )	2023-08-09 08:29:06 -04:00
zero_shot_image_classification.md	[docs] Fix model reference in zero shot image classification example (#26206 )	2023-09-19 00:45:12 +02:00
zero_shot_object_detection.md	[Docs] Update README and default pipelines (#28864 )	2024-02-12 10:21:36 +01:00