Submitted by CeFurkan t3_10z6ke2 in MachineLearning

Greetings everyone.

I am looking for the best text to speech AI model out there for english

I am looking for links to the models you know as best

If the model supports subtitle file to speech that would be even more awesome

Like providing .srt or .vtt to generate speech - speeding up the necessary parts of speech to fit into durations

Thank you very much again

I will use this to replace audio of my older lecture recordings by providing a time generated manually corrected subtitle file like srt or vtt

I am looking for any male sounding model that sounds natural

​

I have found this

They colab and looks very easy to generate. I think I can automate it. But is this one the best?

https://www.reddit.com/r/MachineLearning/comments/v9rigf/p_silero_tts_full_v3_release/

found this too but only female voice :/

https://www.reddit.com/r/MachineLearning/comments/ttgsr4/r_nixtts_an_incredibly_lightweight_texttospeech/

I need a male voice

any other good ones?

​

11

Comments

You must log in or register to comment.

tetelestia_ t1_j83o6nh wrote

Google's API is pretty cheap. Might be free depending on how much you need.

2

CeFurkan OP t1_j8466db wrote

I have been spending time with Tortoise TTS since yesterday

couldn't produce my voice yet but i am understanding :/

it is also super slow - damn slow on rtx 3060 - cuda running

1