mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-05 22:00:09 +06:00

* add encode labels function to tokenizer * start adding finetuning * init dropout * upload * correct convert script * apply changes * fix second typo * make first dummy training run * adapt convert script * push confg for comparison * remove conf * finish training * adapt data collator * add research folder * update according to fairseq feedback * some minor corrections * refactor masking indices a bit * some minor changes * clean tokenizer * finish clean-up * remove previous logic * update run script * correct training * finish changes * finish model * correct bug * fix training a bit more * add some tests * finish gradient checkpointing * finish example * correct gradient checkpointing * improve tokenization method * revert changes in tokenizer * revert general change * adapt fine-tuning * update * save intermediate test * Update README.md * finish finetuning * delete conversion script * Update src/transformers/models/wav2vec2/configuration_wav2vec2.py * Update src/transformers/models/wav2vec2/processing_wav2vec2.py Co-authored-by: Lysandre Debut <lysandre@huggingface.co> * finish wav2vec2 script * finish wav2vec2 fine-tuning * finalize test * correct test * adapt tests * finish * remove test file Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
22 lines
587 B
Bash
Executable File
22 lines
587 B
Bash
Executable File
#!/usr/bin/env bash
|
|
python run_asr.py \
|
|
--output_dir="./wav2vec2-base-100h" \
|
|
--num_train_epochs="30" \
|
|
--per_device_train_batch_size="32" \
|
|
--per_device_eval_batch_size="32" \
|
|
--evaluation_strategy="steps" \
|
|
--save_total_limit="3" \
|
|
--save_steps="500" \
|
|
--eval_steps="100" \
|
|
--logging_steps="50" \
|
|
--learning_rate="5e-4" \
|
|
--warmup_steps="3000" \
|
|
--model_name_or_path="facebook/wav2vec2-base" \
|
|
--fp16 \
|
|
--dataset_name="librispeech_asr" \
|
|
--dataset_config_name="clean" \
|
|
--train_split_name="train.100" \
|
|
--preprocessing_num_workers="32" \
|
|
--group_by_length \
|
|
--freeze_feature_extractor
|