Viewing a single comment thread. View all comments

sincle354 t1_jcv7v93 wrote

Elevenlabs' AI voice only needs 5 minutes for maximum effect now. Hell, someone took a very minor character with 23 seconds of dialogue (Morshu) and created a convincing meme with it.

18

GonWithTheNen t1_jcvaxrq wrote

Interesting info, thanks! I figured back then that the required length of audio samples would be greatly reduced over time. Honestly, with tech's fast advancements, it's less of a surprise and more of what's expected.

2

notsocoolnow t1_jcximnv wrote

Right now people are using Elevenlabs to emulate the voice of Dagoth Ur, a character in the Elder Scrolls IV Morrowind with only a handful of voice lines, to make funny meme videos.

The capability of the AI tool is quite impressive even with small samples, though the small number of available samples does mean people are also depending on that small sample to get an impression of what the character sounds like.

I imagine it would be different for a character with a large number of samples (which viewers are familiar with) that you only fed a handful of to the AI.

3

GonWithTheNen t1_jd05kbs wrote

Being able to do voice samples "even with small samples" is mind-blowing. Frightening in terms of the possibilities, but mind-blowing.

What I hope all these emulators include is some kind of digital signature (like what Adobe said they'd use) that would detect manipulated audio and the real person's voice.

I mean, emulation seems benign for game-related memes and similar media where any player can open their copy and verify what a voice actor actually said. Other scenarios? Maybe not so much...

1