Patrick von Platen
d72343d2b8
[Wav2Vec2 Speech Event] Add speech event v2 ( #15083 )
...
* up
* up
* up
* up
* up
* up
* improve
* up
* up
* Update src/transformers/trainer.py
* up
* up
* up
2022-01-10 10:46:21 +01:00
Patrick von Platen
600496fa50
[Wav2Vec2] Rename model's feature extractor to feature encoder ( #14959 )
...
* rename classes
* clean up more namings
* remove bogus file
* Apply suggestions from code review
* Apply suggestions from code review
* replace more names
* more regex replace
* make style
* correct
* correct more
* make style
* finish
* correct more in wav2vec2
* make style
* improve freeze_extractor
* add aliases
* add tf aliases
2021-12-28 20:33:23 +01:00
Patrick von Platen
1c121916f3
Add Speech Seq2Seq Training script ( #14792 )
...
* start
* add gradient checkpointing and feature extractor freezing
* Apply suggestions from code review
* up
* up
* up
* correct
* up
* more changes
* up
* up
* up
* remove rst
2021-12-28 10:20:51 +01:00
Patrick von Platen
fa39ff9fc4
Docs for v4.16.0dev0
2021-12-22 20:39:44 +01:00
Patrick von Platen
05fa1a7ac1
Release: v4.15.0
2021-12-22 18:43:15 +01:00
Patrick von Platen
7ae6f07004
[ASR example] Improve example + add more examples ( #14848 )
...
* up
* load up
* up
2021-12-21 13:12:22 +01:00
Patrick von Platen
c4a96cecbc
Wav2Vec2 meets phonemes ( #14353 )
...
* up
* add tokenizer
* improve more
* finish tokenizer
* finish
* adapt speech recognition script
* adapt convert
* more fixes
* more fixes
* update phonemizer wav2vec2
* better naming
* fix more tests
* more fixes swedish
* correct tests
* finish
* improve script
* remove file
* up
* lets get those 100 model architectures until the end of the month
* make fix-copies
* correct more
* correct script
* more fixes
* more fixes
* add to docs
* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* replace assert
* fix copies
* fix docs
* new try docs
* boom boom
* update
* add phonemizer to audio tests
* make fix-copies
* up
* upload models
* some changes
* Update tests/test_tokenization_wav2vec2_phoneme.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
* more fixes
* remove @
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
2021-12-17 19:56:44 +01:00
Lysandre
7c9c41f43c
Docs for v4.14.0
2021-12-15 18:29:53 +01:00
Lysandre
960d8cb41d
Release: v4.14.0
2021-12-15 18:20:35 +01:00
Lysandre
ab31b3e41b
Docs for v4.14.0dev0
2021-12-09 17:09:23 +01:00
Lysandre
4da3a696e4
Release: v4.13.0
2021-12-09 16:55:21 +01:00
Patrick von Platen
55f49c5f4b
[Wav2Vec2 Example] Improve fine-tuning script ( #14373 )
...
* improve some stuff
* finish
* correct last
2021-11-12 16:35:57 +01:00
Lysandre
b8fad022a0
v4.13.0.dev0
2021-10-28 12:56:46 -04:00
Lysandre
62bf536631
Release v4.12.0
2021-10-28 12:09:49 -04:00
Matthew Goldey
42bfb83d74
fix typos in error messages in speech recognition example and modelcard.py ( #14166 )
...
* specify the text column name in the error message
* pluralize the word fields
2021-10-26 16:36:26 -04:00
Patrick von Platen
f5ed19f57d
[Speech Recognition] - Distributed training: Make sure vocab file removal and creation don't interfer ( #14161 )
...
* up
* better
2021-10-26 15:59:33 +02:00
Patrick von Platen
e248e9b042
up ( #14154 )
2021-10-26 13:08:18 +02:00
Patrick von Platen
e9d2a639f4
up ( #14093 )
2021-10-21 10:30:02 +02:00
Patrick von Platen
53dc39d821
up ( #14079 )
2021-10-20 13:01:42 +02:00
Anton Lozhkov
dbaf49203e
[Examples] Use Audio feature in speech classification ( #14052 )
...
* Update SEW integration test tolerance
* Update audio classification
* Update test
* Remove torchaudio
* Add dataset revision
* Hub branch naming
* Revert dataset revisions
* Update datasets
2021-10-20 12:22:43 +03:00
Patrick von Platen
bdf31d6e0a
[Speech] Move all examples to new audio feature ( #14045 )
...
* up
* up
* up
* finish
2021-10-18 12:52:40 +02:00
Patrick von Platen
37c5759cbe
[Speech Examples] Add new audio feature ( #14027 )
...
* finish
* up
* finish all
* up
2021-10-17 23:01:03 +02:00
Lysandre
11c69b8045
Docs for version v4.11.0
2021-09-27 14:19:38 -04:00
Lysandre
dc193c906d
Release: v4.11.0
2021-09-27 14:14:09 -04:00
Patrick von Platen
493643fff8
up ( #13733 )
2021-09-24 18:32:35 +02:00
Patrick von Platen
4a320f6c9a
[ASR] Add official ASR CTC example to examples/pytorch/speech-recognition
( #13620 )
...
* up
* rename
* add asr example
* add auto feature extractor
* some more fixes
* correct layerdrop
* correct for multi-gpu dist
* clean up
* refactor
* refactor
* more fixes
* more fixes
* clean-up
* finish
* up
* Apply suggestions from code review
* fix isort
* update
* up
* add note
* apply surajs suggestions
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
* isort
* small change
* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
* add hubert
* Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
2021-09-24 07:01:11 +02:00