Viewing a single comment thread. View all comments

RemarkableSavings13 t1_ja0wqn2 wrote

You’re already committed to paying for the custom voice? Honestly if you’re already paying for that they might just offer the option to buy an offline model you can run on prem.

2

Brunt__ OP t1_ja0xlvw wrote

I haven't found that option on their site.

1

RemarkableSavings13 t1_ja12aii wrote

I know, but if you're paying for a custom voice that can't be cheap. I'd guess you'll be paying 5 figures at least for something like this, since you can't buy it without "contacting a sales rep". Your sales rep will be able to tell you if offline models are available, they often are but just aren't advertised.

To be honest though it sounds like you may be out of your depth, the Google custom voice product is expecting you to be a company with a deep pocketbook and a professional voice actor doing the reading. Is that who you are? If you're just some person who wants to use your own voice to read books, look into some of the zero shot TTS tools other people have posted.

1

Brunt__ OP t1_ja176wx wrote

I apologize---the custom voice is myself and any other local people in my project. It's not a new voice by itself. My apologies.

0

RemarkableSavings13 t1_ja18scl wrote

Oh in that case then forget trying to distill the Google model, you'll need an ML expert and that will be expensive. As a reference, I have a decade of ML experience and for me to take on a project like this would probably cost you 10 grand at least. And that's not even counting the fact that Google could be unhappy with what you're doing and you risk getting banned from the service for attempting to distill their internal models.

Instead, just use Firefox's open-source TTS model: https://github.com/mozilla/TTS

It might be slightly lower quality, but you can definitely pay a random coder on Fiverr to just integrate that into a website. No ML experience required, just Python.

2

Brunt__ OP t1_ja1abo3 wrote

Thank you. Does https://beta.elevenlabs.io use their own proprietary model? I couldn't find anything on their site. This is the model I'm after.

1

RemarkableSavings13 t1_ja49dzu wrote

Side note but that model is absolutely exceptional if it's actually as they claim. The "Great Gatsby" reading is phenomenal, with the different voices for different characters. If they did that without specifically annotating they wanted a different voice I'm super impressed.

1