Submitted by NotANumber13 t3_zyo7mg in singularity

I was thinking about how an entire story can be written by an AI from a short prompt, then I extrapolated that thought to how long it would take to turn that writing prompt into a movie. I do not think it is too crazy of a thought. Here is why. You tell the AI what your'e in the mood for then the script would be written within a few minutes. From there it could generate graphics that are almost lifelike with a soundtrack based on something like your like on your favorite music app that you are currently logged into and sharing data. I think the bottleneck would be processing power to generate all of the graphics.

​

Your thoughts?

12

Comments

You must log in or register to comment.

Sashinii t1_j2709i2 wrote

AI such as Phenaki already does this to some extent, but most people can't use the tech yet.

I think publically available text-to-video synthesis in 2023 and it'll be perfected in 2026.

7

reconditedreams t1_j27x8hs wrote

Saying text to video will be perfected in 2026 seems wildly optimistic to me.

Maybe text to image will be nearly perfected by then, but text to video is an entirely different ballpark.

6

Gotisdabest t1_j28f6a8 wrote

Basic text to video already exists. Considering how fast text to image has grown, it's not unreasonable to have an optimistic timeline. As for perfect text to image, i really don't see it taking more than a year or two from now given the current rate of advancement (compare something like midjourney v1 to V4, which is around 8-9 months of progress). Obviously they could hit a major snag somewhere but i would not bet on it.

7

Nintell t1_j276qti wrote

2026 for perfection? The basic version is already here but won't perfection take long since it's video we're talking about here? Like this stuff can't be easy to make man idk

2

Sashinii t1_j277ay6 wrote

Yes, 2026, because of exponential growth.

Most of the technology that exists today wouldn't exist yet if not for exponential growth.

8

venix124 t1_j27xab2 wrote

There are hardly any big research org working on training AI to learn how movies work, even though we have enough data resource to train, we have huge liberary of movies with scripts as prompt, since every frame of a movie is to be analysed and broken down to details, it will be a huge workload for training, much bigger than what Big data companies compute today,

TO ANSWER THE QUESTION -

We can possibly see a incomplete version of AI short movie generation that looks realistic In 2025-2027 it will take few more years to perfect the fine creation

Last remarks , as you know even with stable diffusion the video that are created are not constant with character and placement, so yeah we have long way to go, rightful constant image generation will be a major pillar for such projects

Edit : dont forget about how charactar generations with behavioral charactarstics and distinct sound of charactars are important,

5

666vampiric t1_j27jng9 wrote

LLM's struggle with both long form content and anything requiring a creative element (and not boring predictable text summary-like behavior). Combining in a multi-media format is 20 years away.

4

Nintell t1_j276iif wrote

Publically available 2023

Perfected 2045 - 2055

2

Akimbo333 t1_j27u545 wrote

The processing power could be intense, though. Which is why a lot of people give it 20 years, lol!

2