Submitted by Long8D t3_zz0u5a in MachineLearning

Not sure where else to ask this as I can't find other subreddits to ask this question in. I've heard this AI voice plenty of times. It sounds pretty good and I've seen it used in some videos but I just can't find it anywhere. Play .ht has a similar voice but it doesn't flow as good as this one and makes lots of mistakes. I figured maybe someone here has experience with TTS and they ran into this one at some point. Below I am posting a sample, it's only 50 seconds long. Also, I need this one specifically and I've been searching for days but I can't find it anywhere.

https://sndup.net/x628/

45

Comments

You must log in or register to comment.

sayoonarachu t1_j2dytbo wrote

Not sure if it's tortoise tts but you should take a look at their examples.

https://nonint.com/static/tortoise_v2_examples.html

3

brucebay t1_j2ew4x6 wrote

Wow, I didn't know about this project but the samples are very good. Clearly they need some post processing (I'm guessing he used ted talks that may explain the slight microphone tone) then they will be even more realistic.

I wonder if he used in-sample text or cherry picked the best (he mentioned they were some of the better ones).

I will definitely check that out.

Update:listening long sentences some of them are clearly used audiobook library. But some of the short sentences seems to be from ted talks. Update2: double wow, I just listened tom and weaver without checking their names.They definitely sound a lot like real actors themselves as I immediately recognized the actors.

3

Long8D OP t1_j2e1rwr wrote

I did check these yesterday but I don't think it's any of them. Maybe it could be william or snakes, but the delivery is just so different.

1

SkinnyJoshPeck t1_j2bspqi wrote

Murf.ai has some that seem really, really close (like Clint) but with the pitch and stuff adjusted. I don't know if you noticed, but where you place commas and periods make a huge difference in their flow.

"It was September 2005, and their anniversary was coming up. "

gets read differently than

"It was September 2005 and their anniversary was coming up. "

which is still different than

"It was September, 2005, and their anniversary was coming up. "

So I would look around with different punctuations to make sure you're not missing it just from that alone.

2

Long8D OP t1_j2crcpd wrote

I've looked at all of them also tried many variations with the pitch but it seems like it's non of those voices. Maybe it's just me, but all the murf ai voices sound robotic to me.

1

Glycerine t1_j2ch0mm wrote

This sounds like the new neural AI from google or openai. However It's hard to place - where did you find the reference?

Microsoft voices are good but I can't hear this one: https://azure.microsoft.com/en-us/products/cognitive-services/text-to-speech/#features


Potentially it could be one of these https://mycroft.ai/mimic-3/

The mimic AI voices sound great and are also offline: https://github.com/MycroftAI/mimic3-voices There are a lot of voices to choose from - some are more human than others https://mycroftai.github.io/mimic3-voices/

I understand it's not your specific but the female English (US) vctk_low -> p225 is phenomenal.

2

Long8D OP t1_j2djlpu wrote

Did a quick look but couldn’t find anything.

1

dodomaze t1_j2d0enm wrote

Sounds like the voice of elevator's news in Mass Effect.

1