transformers

mirror of https://github.com/huggingface/transformers.git synced 2025-07-07 14:50:07 +06:00

Author	SHA1	Message	Date
Patrick von Platen	d72343d2b8	[Wav2Vec2 Speech Event] Add speech event v2 (#15083 ) * up * up * up * up * up * up * improve * up * up * Update src/transformers/trainer.py * up * up * up	2022-01-10 10:46:21 +01:00
Stas Bekman	77262ef750	fix --gradient_checkpointing (#13964 )	2021-11-11 17:50:21 +01:00
Antonio Carlos Falcão Petri	05a2afc252	Add missing --validation_split_percentage data args (#14119 )	2021-10-22 19:04:54 +02:00
Patrick von Platen	7fb2a8b3d9	up (#14008 )	2021-10-14 15:46:22 +02:00
Patrick von Platen	24cbf6bc5a	Update README.md	2021-08-08 17:11:19 +02:00
21jun	5c673efad7	fix typo in gradient_checkpointing arg (#12855 ) help for `ModelArguments.gradient_checkpointing` should be "If True, use gradient checkpointing to save memory at the expense of slower backward pass." not "Whether to freeze the feature extractor layers of the model." (which is duplicated from `freeze_feature_extractor` arg)	2021-07-30 15:06:33 +08:00
Stas Bekman	98364ea74f	[tests] fix logging_steps requirements (#12860 )	2021-07-23 08:05:48 -07:00
Patrick von Platen	2e9fb13fb1	[Wav2Vec2] Correctly pad mask indices for PreTraining (#12748 ) * fix_torch_device_generate_test * remove @ * start adding tests * correct wav2vec2 pretraining * up * up Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-07-15 21:40:25 +01:00
Stas Bekman	4a872caef4	remove extra white space from log format (#12360 )	2021-06-25 13:20:14 -07:00
Stas Bekman	88e84186e5	[style] consistent nn. and nn.functional: part 4 `examples` (#12156 ) * consistent nn. and nn.functional: p4 examples * restore	2021-06-14 12:28:24 -07:00
Anton Lozhkov	d472bd7b18	Wav2Vec2 Pretraining (#11306 ) * Working quantizer forward * Working quantizer forward * Clean up unused model parts, test reproducibility * Working quantizer forward * Clean up unused model parts, test reproducibility * Remove custom outputs from the shared ones * correct conversion * correct bug * add first pretrain script * save intermediate * static shapes * save intermediate * finish first pretrain script version * more refactor * remove wanddb * refactor more * improve test * correct perplexity compute bug * finish model implementation * add to docs * finish docs * finish pretraining script * finish pretraining script * remove wandb * finish PR for merge * finish config * finish * make deepspeed work * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply suggestions * fix flaky test Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-09 18:40:56 +01:00
Stas Bekman	d14e0af274	sync LayerDrop for Wav2Vec2Encoder + tests (#12076 )	2021-06-09 13:21:03 +01:00
Stas Bekman	11d86d3de4	[Deepspeed Wav2vec2] integration (#11638 ) * wip * wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044 * cleanup * workaround * working 5/8 modes * solve fp32 distributed zero3 * style * sync * sync * rework * deprecation * cleanup * https://github.com/microsoft/DeepSpeed/pull/1044 pr was merged * clean up * add a guide * more prose * more prose * fix * more prose * sub_group_size was too big * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * refactor * bug fix * make the true check explicit * new deepspeed release Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-06-08 12:32:03 -07:00
Philip May	77f4c46b50	remove defaults to None if optional (#11703 )	2021-05-12 09:11:10 -04:00
Nithin Holla	653076ca30	Save the Wav2Vec2 processor before training starts (#10910 ) Co-authored-by: nithin19 <nithin@amberscript.com>	2021-04-14 14:52:06 +03:00
Yih-Dar	e031162a6b	fix md file to avoid evaluation crash (#10962 )	2021-03-30 21:26:22 +03:00
Qiushi Pan	29904a967b	Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849 ) Fix typo.	2021-03-22 07:58:59 -04:00
Patrick von Platen	0f226f78ce	push (#10846 )	2021-03-22 10:32:21 +03:00
Suraj Patil	82b8d8c7b0	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-21 22:47:09 +05:30
Patrick von Platen	af6125ffdb	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-21 12:31:33 +03:00
Patrick von Platen	5aaf6e1460	small improvements for wav2vec2 info script (#10829 )	2021-03-21 11:41:44 +03:00
Suraj Patil	68b55885ed	add doc for Local machine (#10828 )	2021-03-21 13:25:34 +05:30
Julien Chaumond	1438c487df	wav2vec doc tweaks (#10808 ) * wording/typos tweaks * Make model upload instructions simpler	2021-03-19 12:48:54 -04:00
Patrick von Platen	b9570a813c	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 19:45:28 +03:00
Patrick von Platen	e8968bd03a	[XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806 ) * finish * fix * fix * fix * fix	2021-03-19 12:52:54 +03:00
Patrick von Platen	2ae678229f	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 00:29:20 +03:00
Patrick von Platen	68a3215949	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 00:27:40 +03:00
Patrick von Platen	03df3fbcb4	Update FINE_TUNE_XLSR_WAV2VEC2.md	2021-03-19 00:26:49 +03:00
Patrick von Platen	e84adbed40	Add XLSR-Wav2Vec2 Fine-Tuning README.md (#10786 ) * upload * upload fine-tuning script * improve * adapt * Apply suggestions from code review * correct * upload * finalize * remove @ * correct typos	2021-03-19 00:22:43 +03:00
Suraj Patil	5f19c07a70	add run_common_voice script (#10767 ) * add initial script * finish script * add shell script example * accept chars_to_ignor as cl arg * align the script with other example scripts * add torchaudio dep	2021-03-18 17:21:16 +05:30
Mohamed El-Geish	af8afdc88d	wav2vec2: support datasets other than LibriSpeech (#10581 ) * wav2vec2: support datasets other than LibriSpeech * Formatting run_asr.py to pass code quality test * bundled orthography options and added verbose logs * fixing a typo in timit fine-tuning script * update comment for clarity * resize_lm_head and load custom vocab from file * adding a max_duration_in_seconds filter * do not assign `duration_filter` lambda, use a def * log untransliterated text as well * fix base model for arabic * fix duration filter when target_sr is not set * drop duration_in_seconds when unneeded * script for wav2vec2-large-lv60-timit-asr * fix for "tha" in arabic corpus (huggingface#10581) * adding more options to work with common_voice * PR feedback (huggingface#10581) * small README change	2021-03-18 10:20:26 +03:00
Patrick von Platen	395ffcd757	fix run seq2seq (#10547 )	2021-03-05 18:17:12 +03:00
Patrick von Platen	0234de8418	Add Fine-Tuning for Wav2Vec2 (#10145 ) * add encode labels function to tokenizer * start adding finetuning * init dropout * upload * correct convert script * apply changes * fix second typo * make first dummy training run * adapt convert script * push confg for comparison * remove conf * finish training * adapt data collator * add research folder * update according to fairseq feedback * some minor corrections * refactor masking indices a bit * some minor changes * clean tokenizer * finish clean-up * remove previous logic * update run script * correct training * finish changes * finish model * correct bug * fix training a bit more * add some tests * finish gradient checkpointing * finish example * correct gradient checkpointing * improve tokenization method * revert changes in tokenizer * revert general change * adapt fine-tuning * update * save intermediate test * Update README.md * finish finetuning * delete conversion script * Update src/transformers/models/wav2vec2/configuration_wav2vec2.py * Update src/transformers/models/wav2vec2/processing_wav2vec2.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * finish wav2vec2 script * finish wav2vec2 fine-tuning * finalize test * correct test * adapt tests * finish * remove test file Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2021-03-01 12:13:17 +03:00

33 Commits