Sasbe93 t1_itg6e6s wrote on October 23, 2022 at 12:19 PM

Reply to Given the exponential rate of improvement to prompt based image/video generation, in how many years do you think we'll see entire movies generated from a prompt? by yea_okay_dude

This question is actually one of my favorite topics right now. At the beginning of this year, I was still thinking about 10 years. Even then, I was still looked at skeptically.

With regard to the rapid progress in Text2Video in recent months, I now assume a maximum of 5 years. With Google Imagene Videos you can make high resolution clips and with Metas MakeAVideo you can even make short clips out of images. The latter can even be fed with two images and the program creates a clip between these two images. Already with this you can create coherent movies with a little effort.

But making a coherent movie from a single prompt is another task. Actually, however, one would have to wait only for an AI, which converts a whole script into single coordinated video scenes and for an AI, which creates a whole script from a prompt. Then these two can work together and the question would be answered with a yes. I expect the former in 4-5 years and the latter even in two years at most. And sometimes I even think that I overestimate myself.

I think this prediction can be delayed only if something happens with Taiwan.