sincle354 t1_jcv7v93 wrote on March 19, 2023 at 8:32 PM

Reply to comment by GonWithTheNen in TIL that in 2020, to educate the public about misinformation, researchers used AI to have Richard Nixon deliver the never-used "In Event of Moon Disaster" Apollo 11 speech. by [deleted]

Elevenlabs' AI voice only needs 5 minutes for maximum effect now. Hell, someone took a very minor character with 23 seconds of dialogue (Morshu) and created a convincing meme with it.

GonWithTheNen t1_jcvaxrq wrote on March 19, 2023 at 8:54 PM

Interesting info, thanks! I figured back then that the required length of audio samples would be greatly reduced over time. Honestly, with tech's fast advancements, it's less of a surprise and more of what's expected.

notsocoolnow t1_jcximnv wrote on March 20, 2023 at 9:35 AM

Right now people are using Elevenlabs to emulate the voice of Dagoth Ur, a character in the Elder Scrolls IV Morrowind with only a handful of voice lines, to make funny meme videos.

The capability of the AI tool is quite impressive even with small samples, though the small number of available samples does mean people are also depending on that small sample to get an impression of what the character sounds like.

I imagine it would be different for a character with a large number of samples (which viewers are familiar with) that you only fed a handful of to the AI.

GonWithTheNen t1_jd05kbs wrote on March 20, 2023 at 9:40 PM

Being able to do voice samples "even with small samples" is mind-blowing. Frightening in terms of the possibilities, but mind-blowing.

What I hope all these emulators include is some kind of digital signature (like what Adobe said they'd use) that would detect manipulated audio and the real person's voice.

I mean, emulation seems benign for game-related memes and similar media where any player can open their copy and verify what a voice actor actually said. Other scenarios? Maybe not so much...