transformers/tests/models/whisper
Karel Vesely 1a81d774b1
Add dithering to the Speech2TextFeatureExtractor API. (#34638)
* Add dithering to the `Speech2TextFeatureExtractor` API.

- in kaldi : 4a8b7f6732/src/feat/feature-window.cc (L145)
- with dithering without a seed, the features become non-deterministic due
  to small Gaussian noise added to the audio (i.e. 2 runs lead to little
  different outputs)

* update the PR

- add dithering also for WhisperFeatureExtractor
- not adding to Wav2Vec2FeatureExtractor (no FBANK computation)

* add unit-tests for dithering, fix docstrings

* ruff

* utils/check_copies.py --fix_and_overwrite

* update code, add seed to unit-test

* adding explanation of dithering
2025-02-19 11:50:02 +01:00
..
__init__.py
test_feature_extraction_whisper.py Add dithering to the Speech2TextFeatureExtractor API. (#34638) 2025-02-19 11:50:02 +01:00
test_modeling_flax_whisper.py CI: avoid human error, automatically infer generative models (#33212) 2025-02-13 16:27:11 +01:00
test_modeling_tf_whisper.py [tests] remove tf/flax tests in /generation (#36235) 2025-02-17 14:59:22 +00:00
test_modeling_whisper.py CI: avoid human error, automatically infer generative models (#33212) 2025-02-13 16:27:11 +01:00
test_processor_whisper.py feat: Whisper prompting (#22496) 2023-05-19 09:33:11 +01:00
test_tokenization_whisper.py Fix flax whisper tokenizer bug (#33151) 2024-09-12 12:21:59 +01:00