Viewing a single comment thread. View all comments

Akimbo333 t1_j1tlalt wrote

Star Wars the Force Awakens Remake, where (Finn) portrayed by (John Boyega) is more capable, with the potential to become a great (Jedi Knight). Rey is important but not much more than Finn. Luke Skywalker shows up in the end to save both Finn and Rey from Kylo Ren. (Good story), (Award Winning), (Beloved by Fans) 4khd Written by: Timothy Zahn

negative prompts: (bad movie), (weak plot), (pathetic Kylo Ren), (((overpowered Rey)))

But I don't think that such a prompt program, generated by AI would be available until another 15 years.

8

Nintell OP t1_j1tm5d9 wrote

I'm aware that near perfect ai movies are decades away, Most ai image makers can't even get hands right (as of now) I imagined that you'd be thinking of prompts to write in the year 2040 - 2050

7

coumineol t1_j1tp2yi wrote

>I'm aware that near perfect ai movies are decades away

No, it will most probably be possible in 2025 at the latest. Don't miss the incredible achievements of this year and our place at the exponential curve by obsessing over small problems.

6

Akimbo333 t1_j1tr7ls wrote

Yeah. I respect your optimism, but Movies require sound and voices. They really can't do that much especially for over 2 hours. But you might be right and I wrong.

8

coumineol t1_j1tzzkw wrote

Actually I'll say sound and voices are one of the easiest parts, to the degree that it's almost possible even today.

5

Akimbo333 t1_j1u0l47 wrote

Oh right!!!

1

coumineol t1_j1u129c wrote

Have you ever heard of something called... text-to-speech?

2

Akimbo333 t1_j1u21j1 wrote

Yeah, actually, I've seen aspects of it. Though tts is pretty expensive. Like Murf.AI . Hopefully eventually it can be open sourced like Stable Diffusion. There is also this thing called Coqui, but I couldn't figure it out, and the sound quality was utterly terrible lol! But hey, who knows it might very well improve!

3

Nintell OP t1_j1tpfpu wrote

While that is fair... do you know how hard making a movie might actually be? With the plot having to be coherent and the sound also having to be coherent? For an hour aswell? As much as this technology is improving very quickly a full movie being generated in just 2 years from now is a bit quick no?

5

coumineol t1_j1tzpht wrote

Take a pen and paper (or open your favorite drawing software). With a top-down approach progressively identify the steps needed for fully automated movie generation, until you have the smallest ingredients. Think about what kind of tech is needed for each of them. You will notice that no paradigm-breaking discovery is needed - all of them are just advanced versions of existing tools and technology. Now extrapolate for when we can actually get to that level, using the recent speed of development in AI. I'm quite sure you won't be thinking it will take decades.

4

banuk_sickness_eater t1_j1ulf90 wrote

Exactly. People throw out "decades" like it's a salient point when it's really just un-thought through, kneejerk, and "safe" prediction.

2

Nintell OP t1_j1upn8h wrote

Never thought about it like that🤔

1

Antique-Bus-7787 t1_j1upz6y wrote

I don't think the story being coherent is a problem. As you said you just need a bigger LLM that can hold a lot more data in its "cache" to create the overall scenario and then create every scene one after the other.

What I'm more skeptical of are the images and the voices. TTS are good but it's extremely complicated to add the right "emotions" and "ponctuations" to the generated voices for now. Voice conversions are better but you still need a starting voice.

The temporal coherence of videos are the second biggest problem I think.
(Cost to produce that also)
We'll see ! But 2025 seems way too soon for me!

1

coumineol t1_j1urjx2 wrote

I'll give it to you that emotions in TTS are difficult, though I still say what is needed is not a novel algorithm but an enhanced version of today's algorithms. For example the AI that will generate the movie's scenario can also add marks at the text that will indicate the correct emotion or punctuation, and I'm pretty convinced we can have a TTS algorithm that can reasonably abide by those in three years.

1

goldygnome t1_j1uf2h8 wrote

I've seen estimate of about 10 years. It'll definitely be less than 20 years.

1