mirror of
https://github.com/huggingface/transformers.git
synced 2025-07-29 09:12:21 +06:00
![]() * add: NumberNormalizer works for integers, floats, common currencies, negative numbers and percentages * fix: renamed number normalizer class and added normalization to SpeechT5Processor * fix: restyled with black and ruff, should pass code quality tests * fix: moved normalization to tokenizer and other small changes to normalizer * add: test for normalization and changed the existing full tokenizer test * fix: tokenization tests now pass, made changes to existing tokenization where normalization is covered; added normalize arg to func signature * fix: changed default normalize setting to False, modified the tests a bit * fix: added support for comma separated numbers, tokenization on the fly with kwargs and normalizer getter setter funcs |
||
---|---|---|
.. | ||
__init__.py | ||
test_feature_extraction_speecht5.py | ||
test_modeling_speecht5.py | ||
test_processor_speecht5.py | ||
test_tokenization_speecht5.py |