enkae7317 t1_itnrpt2 wrote on October 25, 2022 at 12:18 AM

#211,034

We can do it now, albeit very poorly. I'd imagine in 5 years we will have close to perfect mimicry of anybody's voice given enough sampling.

Sashinii t1_itns1pv wrote on October 25, 2022 at 12:21 AM

#211,064

2024, which is also my answer for most synthetic media predictions, as that's the year I think AI will become so advanced that people will use AI to create their own personalized entertainment.

FranciscoJ1618 t1_itnubrt wrote on October 25, 2022 at 12:37 AM

#211,310

I think it will be possible in 1 year or less.

RemindMe! 1 year

RemindMeBot t1_itnufye wrote on October 25, 2022 at 12:38 AM

#211,324

Replying to FranciscoJ1618 (#211,310)

I will be messaging you in 1 year on 2023-10-25 00:37:55 UTC to remind you of this link

6 OTHERS CLICKED THIS LINK to send a PM to also be reminded and to reduce spam.

^(Parent commenter can ) ^(delete this message to hide from others.)

^(Info)	^(Custom)	^(Your Reminders)	^(Feedback)

Desperate_Donut8582 t1_itnuo30 wrote on October 25, 2022 at 12:40 AM

#211,338

You already can wasn’t there text to sound websites for the past years SpongeBob, trump etc?

Sandbar101 t1_itnuyak wrote on October 25, 2022 at 12:42 AM

#211,369

Today my guy just not public tech yet

[deleted] t1_itnwmcp wrote on October 25, 2022 at 12:54 AM

#211,540

[removed]

HelloGoodbyeFriend OP t1_itnwsus wrote on October 25, 2022 at 12:55 AM

#211,557

Replying to Desperate_Donut8582 (#211,338)

That’s speech though. I’m curious about re-creating a singers voice.

sonderlingg t1_itnzuno wrote on October 25, 2022 at 1:16 AM

#211,810

Replying to enkae7317 (#211,034)

You highly underestimate pace of progress

Verzingetorix t1_ito0fs6 wrote on October 25, 2022 at 1:20 AM

#211,866

The voice or the singing?

Like others said, computer generated speech mimicry has been demonstrated already.

HelloGoodbyeFriend OP t1_ito2qe3 wrote on October 25, 2022 at 1:37 AM

#212,056

Replying to Verzingetorix (#211,866)

Singing. For example, I have an instrumental track of Fleetwood Mac’s - Landslide and I want Freddy Mercury to sing it.

Found this after I made this post.

https://youtu.be/S2eYaCclnU0

Jmsvrg t1_ito3o0i wrote on October 25, 2022 at 1:43 AM

#212,125

Descript has a consumer product you can train with less than 1 hr of audio… i use a digital version of both my podcast hosts with this tool (sparingly - single words and short phrases ) the intonation needs work but its pretty solid

Desperate_Donut8582 t1_ito5ks7 wrote on October 25, 2022 at 1:57 AM

#212,310

Replying to HelloGoodbyeFriend (#211,557)

Wouldn’t be that harder tho I think we can right now

ihateshadylandlords t1_ito65i7 wrote on October 25, 2022 at 2:02 AM

#212,366

I’m thinking a product that’s available for the masses will be available in 10 years. I wouldn’t be surprised if a proof-of-concept product hasn’t already been created.

Sonic_TertuL t1_ito6611 wrote on October 25, 2022 at 2:02 AM

#212,369

Check out the composer Holly Herndon and Holly+.

quasi_aesthetic t1_ito8o28 wrote on October 25, 2022 at 2:21 AM

#212,591

Replying to Sandbar101 (#211,369)

Here is a Ted talk about it.

quasi_aesthetic t1_ito8qaq wrote on October 25, 2022 at 2:21 AM

#212,601

It's already being done. Here is a Ted talk about it.

Reasonable-Room-307 t1_ito9ni9 wrote on October 25, 2022 at 2:28 AM

#212,674

It’s not possible already?

ishizako t1_itobyvg wrote on October 25, 2022 at 2:46 AM

#212,861

Replying to HelloGoodbyeFriend (#211,557)

Singing is just speech that's modulated by the vocal cords to harmonize.

When we speak normally all that happens is also just notes sounding out in sequences that we are familiar with and have memorized those sequences as words. With singing the notes are just chosen in such an order that they sound good with each other.

Sampling singing voice is not different than sampling speaking voice. Although with current ai models the same person's singing voice and speaking voice would need to be trained on separately. As the common database of both spoken and sung voice would confuse the ai in terms of how to "pronounce" things. As it cannot naturally tell a difference between sung and spoken word.

Primus_Pilus1 t1_itomevr wrote on October 25, 2022 at 4:16 AM

#213,788

A few years from now music composers will be able to make completely virtual works of music using anyone's voices (it's just a particular timbre of the human throat instrument) and instruments to create virtual studio performances.

HelloGoodbyeFriend OP t1_itoo6os wrote on October 25, 2022 at 4:33 AM

#213,941

Replying to quasi_aesthetic (#212,601)

Holy shit. Thank you!!

HelloGoodbyeFriend OP t1_itoo8vu wrote on October 25, 2022 at 4:34 AM

#213,947

Replying to Sonic_TertuL (#212,369)

Just saw the Ted Talk. Spot on. Thank you

Sonic_TertuL t1_itoov7w wrote on October 25, 2022 at 4:40 AM

#214,007

Replying to HelloGoodbyeFriend (#213,947)

No problem. Cheers!

clusterstage t1_itp1aad wrote on October 25, 2022 at 7:13 AM

#215,160

Well, I came across respeecher. You should really try it out.

-ZeroRelevance- t1_itpa60a wrote on October 25, 2022 at 9:23 AM

#215,867

From what I’ve seen, NVIDIA’s Tacotron2 can already be used to create some pretty convincing singing voices, though the examples I’ve seen are mostly rap, so I’m not sure how good they are at more complex singing styles.

DeviMon1 t1_itpbtm9 wrote on October 25, 2022 at 9:46 AM

#216,008

Replying to -ZeroRelevance- (#215,867)

Damn thats pretty good. I've heard a few decent Juice WRLD ones too.

I think it'll take someone to make a convincing MJ cover of a popular new pop song to make this thing blow up and get everyone talking about it.

TheBestOnTheCitadel t1_itpdhkx wrote on October 25, 2022 at 10:09 AM

#216,125

Jukebox AI has done it pretty well

swiggidyswooner t1_itpehah wrote on October 25, 2022 at 10:23 AM

#216,209

Disney’s doing to Darth Vader’s voice so not far off

quasi_aesthetic t1_itpws1p wrote on October 25, 2022 at 1:21 PM

#218,294

Replying to HelloGoodbyeFriend (#213,941)

I just happened to hear her on TED radio hour a few months back. I'm still amazed they could change his voice in real time!

techhouseliving t1_itq3647 wrote on October 25, 2022 at 2:08 PM

#219,062

Replying to ihateshadylandlords (#212,366)

10 years.. No way. Competition is a powerful driver. If it's not already available in beta it'll be available by a dozen vendors in 6 months to a year max

ihateshadylandlords t1_itq5wqr wrote on October 25, 2022 at 2:27 PM

#219,379

Replying to techhouseliving (#219,062)

I hope you’re right…

Cold-Ad2729 t1_itq67vs wrote on October 25, 2022 at 2:30 PM

#219,405

Replying to swiggidyswooner (#216,209)

Can’t wait to hear that song 😉

swampshark19 t1_itq90jp wrote on October 25, 2022 at 2:48 PM

#219,771

Can a singer claim copyright over the sound of their voice?

modestLife1 t1_itqrqxu wrote on October 25, 2022 at 4:51 PM

#222,023

Replying to quasi_aesthetic (#218,294)

i saw holly herndon at a show in austin in 2015 and shook her hand afterwards lol, i was shy. i tried to get through the ted talk but it was too unnerving. the singularity is coming.

Recent-Fish-9233 t1_itr4q1m wrote on October 25, 2022 at 6:14 PM

#223,645

Replying to swampshark19 (#219,771)

Hopefully, and probably yeah Music Industry is very strict with this sort of thing.

HuemanInstrument t1_itrfr7u wrote on October 25, 2022 at 7:25 PM

#224,837

u/HelloGoodbyeFriend

look up TorToiSe Text-to-Speech
Perhaps you could train it one someones singing voice

HelloGoodbyeFriend OP t1_itrh31a wrote on October 25, 2022 at 7:33 PM

#224,959

Replying to -ZeroRelevance- (#215,867)

Exactly what I was looking for thank you. Found this tutorial, gonna give it a try.

https://youtu.be/gVqSEIr2PD4

Additional-Cap-7110 t1_its5u3e wrote on October 25, 2022 at 10:16 PM

#227,644

We haven’t even done perfect regular speech yet

Additional-Cap-7110 t1_its5who wrote on October 25, 2022 at 10:16 PM

#227,653

Replying to ishizako (#212,861)

Singing is going to be much harder.There’s so much variation. Plus words just requires it to sound natural, singing requires much more of a performance and we have all kinds of other aspects. Like singing softly, loudly, vibrato, portamento, rhythm, not it mention notes themselves,

This might make it clear. We can do synthesized percussion much better than we can do synthesized tonal instruments like violins, flutes etc. sampling percussion has always been the easiest thing to get realistic and 100% synthesized instruments are no different.

If you want to sample percussion all you really need aside from recording quality is sampling multiple repetitions and a shit-ton of dynamic layers. The best percussion sample libraries today will have like maybe 10-20 dynamic layers and 5+ to 10+ repetition samples sometimes. You don’t even need that many to make it sound convincing. But with instruments like vocals, violins flutes etc that’s not scratching the surface. These are complex on much higher dimensions and you need completely different techniques to capture them, and even then it’s still not quite right or it’s highly limited in it’s use

Lawjarp2 t1_ituow68 wrote on October 26, 2022 at 1:23 PM

#236,989

Hope they don't copyright voices. Something that can be easily achieved should never be copyrighted. If we can create artificial singer voices and enjoy creating our own music, it will be a mini musical revolution.

challengethegods t1_iu15ui7 wrote on October 27, 2022 at 7:58 PM

#266,176

Replying to ihateshadylandlords (#212,366)

That sounds a lot like: "yea, dalle-mini is neat but it'll be at least 20 years until it can make anything close to what a real artist can do"

challengethegods t1_iu16dgd wrote on October 27, 2022 at 8:01 PM

#266,240

Replying to Lawjarp2 (#236,989)

I can tell you now the AI overlords are not going to like all this copyright/trademark/patent bullshit that people are so obsessed with.

ihateshadylandlords t1_iu1dutf wrote on October 27, 2022 at 8:49 PM

#267,292

Replying to challengethegods (#266,176)

lol if you say so

styxboa t1_iub3y00 wrote on October 29, 2022 at 11:37 PM

#327,678

Replying to sonderlingg (#211,810)

Can you explain this to me? Persuade me that it'll happen in less than 5 years. It seems insane to me, but I'm not well educated enough on it.

sonderlingg t1_iuc4r9e wrote on October 30, 2022 at 5:04 AM

#335,016

Replying to styxboa (#327,678)

Just imagine how many people work on it. How they want to be the first to create AGI. And their number quickly increase.

Imagine how right now many new models are being trained on GPUs. Moore's law still works. Hardware becomes better and better.

We've already recreated many brain's algorithms (art, speech, face recognition, driving and many more). All that's left is to teach a machine how to learn by itself.

And by the way, we already can copy singer's voice, read other comments

styxboa t1_iuccxvj wrote on October 30, 2022 at 6:48 AM

#336,470

Replying to sonderlingg (#335,016)

That makes sense. Thanks.

Do you think it'll rapidly help with things like CRISPR gene editing as well?

sonderlingg t1_iucdjtg wrote on October 30, 2022 at 6:56 AM

#336,559

Replying to styxboa (#336,470)

If AGI is benevolent, it will help with everything.

All things that may increase intelligence, like neurointerfaces, gene editing and maybe unknown drugs, are other ways to singularity. Everything is connected. That's why the progress is exponential. Thought AI way seems the most possible to me

Comments