Comments

You must log in or register to comment.

aidenr t1_ja0igb4 wrote

“I’m not a mechanic but I’d like a custom motorcycle. Seems easy enough, anyone up for the task? Or recommend me a commodity worker who can do it for nearly zero. Thanks!”

16

schludy t1_ja14r22 wrote

"Toyota sells entire engines, they seem pretty straight forward to use, so all you would have to do is plug a few things in and we're good to go, probably just 1 or 2 days of work."

2

aidenr t1_ja18i9z wrote

Sure, but we aren’t shopping for a supplier; we want an engineer.

1

NotARedditUser3 t1_ja0in8n wrote

You should paste this into chatgpt. You might get some useful resources on where to go. Short answer.... You expect way too much for a budget of almost nothing

12

Brunt__ OP t1_ja0jcfh wrote

I don't mind paying for the task. I was not expecting it to be inexpensive.

1

junetwentyfirst2020 t1_ja0wjzs wrote

Most people in this field who are able to get jobs in this field have an undergrad in computer science, and a masters degree. It’s applied math + computer science, which is different from being a web developer. There are no people with these degrees who are struggling to find work currently, and they command relatively high salaries at their jobs (>150k USD guaranteed).

You might be able to find a regular dev who could put this together, but if something doesn’t work out of the box the chances that they’ll know how to address the problem is pretty much zero because it’s not just a coding issue. We don’t even look at resumes that don’t have a masters degree because it really is important that the candidate can do all kinds of math, knows the family of algorithms, how to train DL models well, can explain why something did or didn’t work, can do analysis of data and results, and can also write efficient code. LOL it’s a stressful field 😝

2

RemarkableSavings13 t1_ja0kg0w wrote

So you want to use Google Custom Voice service to create a model of your own voice, then distill that voice into a custom on-device model?

1

Brunt__ OP t1_ja0kj6v wrote

Yes

1

RemarkableSavings13 t1_ja0wqn2 wrote

You’re already committed to paying for the custom voice? Honestly if you’re already paying for that they might just offer the option to buy an offline model you can run on prem.

2

Brunt__ OP t1_ja0xlvw wrote

I haven't found that option on their site.

1

RemarkableSavings13 t1_ja12aii wrote

I know, but if you're paying for a custom voice that can't be cheap. I'd guess you'll be paying 5 figures at least for something like this, since you can't buy it without "contacting a sales rep". Your sales rep will be able to tell you if offline models are available, they often are but just aren't advertised.

To be honest though it sounds like you may be out of your depth, the Google custom voice product is expecting you to be a company with a deep pocketbook and a professional voice actor doing the reading. Is that who you are? If you're just some person who wants to use your own voice to read books, look into some of the zero shot TTS tools other people have posted.

1

Brunt__ OP t1_ja176wx wrote

I apologize---the custom voice is myself and any other local people in my project. It's not a new voice by itself. My apologies.

0

RemarkableSavings13 t1_ja18scl wrote

Oh in that case then forget trying to distill the Google model, you'll need an ML expert and that will be expensive. As a reference, I have a decade of ML experience and for me to take on a project like this would probably cost you 10 grand at least. And that's not even counting the fact that Google could be unhappy with what you're doing and you risk getting banned from the service for attempting to distill their internal models.

Instead, just use Firefox's open-source TTS model: https://github.com/mozilla/TTS

It might be slightly lower quality, but you can definitely pay a random coder on Fiverr to just integrate that into a website. No ML experience required, just Python.

2

Brunt__ OP t1_ja1abo3 wrote

Thank you. Does https://beta.elevenlabs.io use their own proprietary model? I couldn't find anything on their site. This is the model I'm after.

1

RemarkableSavings13 t1_ja49dzu wrote

Side note but that model is absolutely exceptional if it's actually as they claim. The "Great Gatsby" reading is phenomenal, with the different voices for different characters. If they did that without specifically annotating they wanted a different voice I'm super impressed.

1

doctorjuice t1_ja1gdif wrote

Expect to pay $100-$200 an hour, will probably take at least months

1

jobeta t1_ja1jdgc wrote

You don’t need code. You can use a service for that. Check Descript overdub for instance. Or whatever other similar thing you can find. I’m not affiliated with them but saw a demo. It will be done overnight after you spend 20 min reading some text.

1