Viewing a single comment thread. View all comments

RemarkableSavings13 t1_ja18scl wrote

Oh in that case then forget trying to distill the Google model, you'll need an ML expert and that will be expensive. As a reference, I have a decade of ML experience and for me to take on a project like this would probably cost you 10 grand at least. And that's not even counting the fact that Google could be unhappy with what you're doing and you risk getting banned from the service for attempting to distill their internal models.

Instead, just use Firefox's open-source TTS model: https://github.com/mozilla/TTS

It might be slightly lower quality, but you can definitely pay a random coder on Fiverr to just integrate that into a website. No ML experience required, just Python.

2

Brunt__ OP t1_ja1abo3 wrote

Thank you. Does https://beta.elevenlabs.io use their own proprietary model? I couldn't find anything on their site. This is the model I'm after.

1

RemarkableSavings13 t1_ja49dzu wrote

Side note but that model is absolutely exceptional if it's actually as they claim. The "Great Gatsby" reading is phenomenal, with the different voices for different characters. If they did that without specifically annotating they wanted a different voice I'm super impressed.

1