Yih-Dar
|
a7cb92aa03
|
fix / skip (for now) some tests before switch to torch 2.2 (#28838)
* fix / skip some tests before we can switch to torch 2.2
* style
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
|
2024-02-02 14:11:50 +01:00 |
|
Lysandre Debut
|
5c66378cea
|
[Tokenizers] Skip tests temporarily (#26574)
* Skip tests temporarily
* style
* Add additional test
|
2023-10-03 19:43:42 +02:00 |
|
Yih-Dar
|
d9e4bc2895
|
Update tiny model information and pipeline tests (#26285)
* Update tiny model summary file
* add to pipeline tests
* revert
* fix import
* fix import
* fix
* fix
* update
* update
* update
* fix
* remove BarkModelTest
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
|
2023-09-25 18:08:12 +02:00 |
|
Sanchit Gandhi
|
d70fab8b20
|
[TTA Pipeline] Test MusicGen and VITS (#26146)
|
2023-09-15 10:00:36 +01:00 |
|
Sanchit Gandhi
|
2af87d018e
|
[VITS] Fix nightly tests (#25986)
* fix tokenizer
* make bs even
* fix multi gpu test
* style
* model forward
* fix torch import
* revert tok pin
|
2023-09-07 17:49:14 +01:00 |
|
Sanchit Gandhi
|
d750eff627
|
[VITS] Fix init test (#25945)
* [VITS] Fix init test
* add flaky decorator
* style
* max attempts
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
* style
---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
|
2023-09-04 17:09:26 +01:00 |
|
ydshieh
|
ab8cba824e
|
CI: hotfix (skip VitsModelTest::test_initialization)
|
2023-09-04 09:06:11 +02:00 |
|
Matthijs Hollemans
|
4ece3b9433
|
add VITS model (#24085)
* add VITS model
* let's vits
* finish TextEncoder (mostly)
* rename VITS to Vits
* add StochasticDurationPredictor
* ads flow model
* add generator
* correctly set vocab size
* add tokenizer
* remove processor & feature extractor
* add PosteriorEncoder
* add missing weights to SDP
* also convert LJSpeech and VCTK checkpoints
* add training stuff in forward
* add placeholder tests for tokenizer
* add placeholder tests for model
* starting cleanup
* let the great renaming begin!
* use config
* global_conditioning
* more cleaning
* renaming variables
* more renaming
* more renaming
* it never ends
* reticulating the splines
* more renaming
* HiFi-GAN
* doc strings for main model
* fixup
* fix-copies
* don't make it a PreTrainedModel
* fixup
* rename config options
* remove training logic from forward pass
* simplify relative position
* use actual checkpoint
* style
* PR review fixes
* more review changes
* fixup
* more unit tests
* fixup
* fix doc test
* add integration test
* improve tokenizer tests
* add tokenizer integration test
* fix tests on GPU (gave OOM)
* conversion script can handle repos from hub
* add conversion script for all MMS-TTS checkpoints
* automatically create a README for the converted checkpoint
* small changes to config
* push README to hub
* only show uroman note for checkpoints that need it
* remove conversion script because code formatting breaks the readme
* make WaveNet layers configurable
* rename variables
* simplifying the math
* output attentions and hidden states
* remove VitsFlip in flow model
* also got rid of the other flip
* fix tests
* rename more variables
* rename tokenizer, add phonemization
* raise error when phonemizer missing
* re-order config docstrings to match method
* change config naming
* remove redundant str -> list
* fix copyright: vits authors -> kakao enterprise
* (mean, log_variances) -> (prior_mean, prior_log_variances)
* if return dict -> if not return dict
* speed -> speaking rate
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* update fused tanh sigmoid
* reduce dims in tester
* audio -> output_values
* audio -> output_values in tuple out
* fix return type
* fix return type
* make _unconstrained_rational_quadratic_spline a function
* all nn's to accept a config
* add spectro to output
* move {speaking rate, noise scale, noise scale duration} to config
* path -> attn_path
* idxs -> valid idxs -> padded idxs
* output values -> waveform
* use config for attention
* make generation work
* harden integration test
* add spectrogram to dict output
* tokenizer refactor
* make style
* remove 'fake' padding token
* harden tokenizer tests
* ron norm test
* fprop / save tests deterministic
* move uroman to tokenizer as much as possible
* better logger message
* fix vivit imports
* add uroman integration test
* make style
* up
* matthijs -> sanchit-gandhi
* fix tokenizer test
* make fix-copies
* fix dict comprehension
* fix config tests
* fix model tests
* make outputs consistent with reverse/not reverse
* fix key concat
* more model details
* add author
* return dict
* speaker error
* labels error
* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* Update src/transformers/models/vits/convert_original_checkpoint.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* remove uromanize
* add docstrings
* add docstrings for tokenizer
* upper-case skip messages
* fix return dict
* style
* finish tests
* update checkpoints
* make style
* remove doctest file
* revert
* fix docstring
* fix tokenizer
* remove uroman integration test
* add sampling rate
* fix docs / docstrings
* style
* add sr to model output
* fix outputs
* style / copies
* fix docstring
* fix copies
* remove sr from model outputs
* Update utils/documentation_tests.txt
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
* add sr as allowed attr
---------
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
|
2023-09-01 10:50:06 +01:00 |
|