transformers/tests/models/speech_to_text
Matthijs Hollemans 7f91950901
audio_utils improvements (#21998)
* silly change to allow making a PR

* clean up doc comments

* simplify hertz_to_mel and mel_to_hertz

* fixup

* clean up power_to_db

* also add amplitude_to_db

* move functions

* clean up mel_filter_bank

* fixup

* credit librosa & torchaudio authors

* add unit tests

* tests for power_to_db and amplitude_to_db

* add mel_filter_bank tests

* rewrite STFT

* add convenience spectrogram function

* missing transpose

* fewer transposes

* add integration test to M-CTC-T

* frame length can be either window or FFT length

* rewrite stft API

* add preemphasis coefficient

* move argument

* add log option to spectrogram

* replace M-CTC-T feature extractor

* fix api thing

* replace whisper STFT

* replace whisper mel filters

* replace tvlt's stft

* allow alternate window names

* replace speecht5 stft

* fixup

* fix integration tests

* fix doc comments

* remove manual FFT length calculation

* fix docs

* go away, deprecation warnings

* combine everything into spectrogram function

* add deprecated functions back

* fixup
2023-05-09 09:10:17 -04:00
..
__init__.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00
test_feature_extraction_speech_to_text.py audio_utils improvements (#21998) 2023-05-09 09:10:17 -04:00
test_modeling_speech_to_text.py 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
test_modeling_tf_speech_to_text.py 🔥Rework pipeline testing by removing PipelineTestCaseMeta 🚀 (#21516) 2023-02-28 19:40:57 +01:00
test_processor_speech_to_text.py [processor] Add 'model input names' property (#20117) 2022-11-10 19:29:20 +00:00
test_tokenization_speech_to_text.py Move test model folders (#17034) 2022-05-03 14:42:02 +02:00