Comments

You must log in or register to comment.

mamafied t1_j3w66d9 wrote

check coqui TTS they have all kinds of models and they own yourtts compared in the paper. It is also way faster than tortoisetts

3

CeFurkan OP t1_j3w9hu6 wrote

Are you able to generate speech based on given timings like providing a str, vtt file or convert speech audio into equivalent timed speech?

​

ty so much for answers.

1

sayoonarachu t1_j3z22kh wrote

Other than tortoise tts as mentioned above, probably best to watch the Microsoft github page. They have a section for vall-e and they do tend to release some of their source codes for their other models.

Might take a while as the paper was just publish like a week and and still says, "work in progress."

https://github.com/microsoft/unilm/blob/master/valle/README.md

3