IntelArtiGen t1_ir88qh5 wrote on October 6, 2022 at 1:36 AM

#41,082

>While our internal testing suggest much of explicit and violent content can be filtered out, there still exists social biases and stereotypes which are challenging to detect and filter. We have decided not to release the Imagen Video model or its source code until these concerns are mitigated.

I think they'll never be mitigated and we'll have to wait for other people trying to reproduce the results and make them open-source.

Erosis OP t1_ir8cdlx wrote on October 6, 2022 at 2:07 AM

#41,246

Replying to IntelArtiGen (#41,082)

It seems that Google is being very conservative with the release of their diffusion models compared to even Meta and OpenAI's closed-source approach.

Luckily, Stability AI seems to be working on a video generating diffusion model.

E_Snap t1_ir8fmal wrote on October 6, 2022 at 2:35 AM

#41,386

It’s good to see that hand models will have job security for a while yet

Unicycldev t1_ir8w2fj wrote on October 6, 2022 at 5:14 AM

#42,127

these videos are super trippy. It’s like these algorithms have taken shrooms.

zaptrem t1_ir8ws0l wrote on October 6, 2022 at 5:22 AM

#42,149

Replying to IntelArtiGen (#41,082)

Don’t worry, all we have to do to make the problem go away is end racism and all other biases in real life!

waa007 t1_ir8zhmo wrote on October 6, 2022 at 5:55 AM

#42,253

It’s coming

bobwmcgrath t1_ir927y5 wrote on October 6, 2022 at 6:30 AM

#42,358

Think of all the porn it will make.

massimosclaw2 t1_ir947iu wrote on October 6, 2022 at 6:57 AM

#42,424

Can't wait til this actually starts getting indistinguishable in terms of quality

master3243 t1_ir95zxe wrote on October 6, 2022 at 7:22 AM

#42,486

Replying to Unicycldev (#42,127)

It reminds me of image generation in the early days (a few years ago lol) when it wasn't yet super realistic.

Although this is faster than I expected it's still obviously not at the level of Imagen with image generation.

master3243 t1_ir96avy wrote on October 6, 2022 at 7:26 AM

#42,498

These look quite trippy but amazing nonetheless.

This one in particular is quite impressive

> Prompt: A bunch of autumn leaves falling on a calm lake to form the text "imagen Video". Smooth. >

https://imagen.research.google/video/hdvideos/4.mp4

ThePerson654321 t1_ir98sqw wrote on October 6, 2022 at 8:03 AM

#42,588

Replying to master3243 (#42,486)

I find it difficult to believe we will achieve the same video fidelity compared to image generation.

KeikakuAccelerator t1_ir990ti wrote on October 6, 2022 at 8:06 AM

#42,599

Replying to Erosis (#41,246)

Meta (FAIR) has been very open-source.

master3243 t1_ir9a5wt wrote on October 6, 2022 at 8:24 AM

#42,655

Replying to ThePerson654321 (#42,588)

Image generation is by definition an easier task so the two will never catch up.

But do you not think that at some point in the future, video generation in the year 20XX will be better than image generation in 2022?

Even in the year 2050 or 2100?

ThePerson654321 t1_ir9adad wrote on October 6, 2022 at 8:27 AM

#42,662

Replying to master3243 (#42,655)

Perhaps a few seconds but never a full movie.

101111010100 t1_ir9axqd wrote on October 6, 2022 at 8:35 AM

#42,677

Replying to IntelArtiGen (#41,082)

Thank god humanity is still save. Once there are open-source versions, a lot of people will be harmed. /s

master3243 t1_ir9bp3h wrote on October 6, 2022 at 8:47 AM

#42,701

Replying to ThePerson654321 (#42,662)

What about a coherent 30 second silent clip from a short description that is as difficult to distenguish from real images as current SOTA image generation.

nomadiclizard t1_ir9de47 wrote on October 6, 2022 at 9:12 AM

#42,751

Replying to bobwmcgrath (#42,358)

Yessssss! As soon as it's prised from Googles corporate nanny-state filters and trained up on some proper sources :D

tdgros t1_ir9hdy2 wrote on October 6, 2022 at 10:10 AM

#42,916

Replying to ThePerson654321 (#42,662)

Phenaki already shows the generation of 2mn videos (using lots of prompts): https://phenaki.video/#interactive it's not that far fetched to imagine that working on longer prompts and videos...

Erosis OP t1_ir9kj9k wrote on October 6, 2022 at 10:50 AM

#43,062

Replying to KeikakuAccelerator (#42,599)

I'm referring to their new Make-A-Video model, but I suppose they just announced that a few days ago. Hopefully they fully release that model.

canyonkeeper t1_ir9nb50 wrote on October 6, 2022 at 11:22 AM

#43,197

It’s not research if it’s not replicable /not s

ellioso t1_ir9ncfb wrote on October 6, 2022 at 11:23 AM

#43,199

Replying to IntelArtiGen (#41,082)

It'll probably never be perfect but that doesn't mean it won't get released. The SafeSearch filter for google is really good.

RBUexiste-RBUya t1_ir9npts wrote on October 6, 2022 at 11:27 AM

#43,222

Replying to Unicycldev (#42,127)

Our dreams or nightmares when the humans (and other animals) are asleep, are trippier than that :-D

That's how our brain fights agains itself to discard impossible things, movements, situations, physics, etc. Dreams and trippy minds are the best neuronal thinking (later, is needed a good discard of not wanted results, of course)

Do you remember that old cat-face-recognition that only saw cat faces in supermarket, stores, etc? That was very trippy too (and a little schizophrenic)

IanMazgelis t1_ir9q097 wrote on October 6, 2022 at 11:50 AM

#43,362

Replying to IntelArtiGen (#41,082)

This alone is why I have zero interest in proprietary diffusion algorithms. I want to make whatever I can imagine, not what shareholders tell I'm allowed to think about.

ECEngineeringBE t1_ir9q399 wrote on October 6, 2022 at 11:51 AM

#43,365

Replying to canyonkeeper (#43,197)

Damn, that must mean that all those experiments they run at CERN aren't research because I can't replicate them in my kitchen.

LyutsiferSafin t1_ir9tx5e wrote on October 6, 2022 at 12:27 PM

#43,585

I made one too. I can’t show it to you guys, just like google. What’s the point of showcasing something and never giving access to it?

xrailgun t1_ira22vy wrote on October 6, 2022 at 1:34 PM

#44,119

Replying to ECEngineeringBE (#43,365)

Shit straw man take.

nraw t1_ira4z1k wrote on October 6, 2022 at 1:56 PM

#44,341

Replying to xrailgun (#44,119)

He tried!

nraw t1_ira536l wrote on October 6, 2022 at 1:57 PM

#44,348

Replying to LyutsiferSafin (#43,585)

Media talk. Corporate buzzwords and wanting to jump on board.

[deleted] t1_irabn95 wrote on October 6, 2022 at 2:43 PM

#44,759

[removed]

ECEngineeringBE t1_irac32j wrote on October 6, 2022 at 2:46 PM

#44,780

Replying to xrailgun (#44,119)

How so?

I could say the same about that "shit gatekeeping take"

FriendlyRope t1_iragqww wrote on October 6, 2022 at 3:17 PM

#44,991

Replying to ECEngineeringBE (#43,365)

He does has a point, if there is no independent verification of an experiment (I.e. replication, or at least independent inspection of an experiment) the experiment can not be said to be trusted. For example the results shown could be "cherry picked" or the test data could be contaminated by training data.

dexter89_kp t1_iragyoj wrote on October 6, 2022 at 3:19 PM

#44,998

Replying to Erosis (#43,062)

That was trained on shutterstock data. They can’t release it

ECEngineeringBE t1_irahdyd wrote on October 6, 2022 at 3:21 PM

#45,015

Replying to FriendlyRope (#44,991)

Sure, but just because you can't replicate it, doesn't mean that nobody can. We already had Facebook's paper on video generation a week ago, and we also have stability AI saying that they're planning their own model.

And also, just because the results can't be fully trusted (due to high barrier of replicability), does not mean that the publication isn't "research".

wtf-hair-do t1_iraoudr wrote on October 6, 2022 at 4:10 PM

#45,402

Replying to ThePerson654321 (#42,662)

they'll just never figure it out and give up

ThePerson654321 t1_iraro8a wrote on October 6, 2022 at 4:29 PM

#45,567

Are you being ironic?

gwern t1_irassaq wrote on October 6, 2022 at 4:36 PM

#45,610

Replying to Erosis (#43,062)

They said they were considering releasing Make-A-Scene but never wound up doing so, even though it's probably not much better than the released SD model and there would seem to be fairly minimal marginal harm from a release. So I don't expect Make-A-Video to be released either, even if they say they might.

that_boi_zesty t1_iraxzc0 wrote on October 6, 2022 at 5:09 PM

#45,854

could this be used to artificially continue videos like how some image models can "zoom out"?

cleverestx t1_irbc5ii wrote on October 6, 2022 at 6:43 PM

#46,565

Replying to IntelArtiGen (#41,082)

Hopefully unstable diffusion gets a hold of this, and implements this technology, so people don't have to wait forever to be able to create stuff, as growups should be able to, without anything besides the most extreme legal restrictions in place.

cleverestx t1_irbcdi0 wrote on October 6, 2022 at 6:45 PM

#46,578

Replying to ThePerson654321 (#42,662)

Why not? I admit it IS more challenging, but video is only a series of images...

ThePerson654321 t1_irbck16 wrote on October 6, 2022 at 6:46 PM

#46,588

Replying to cleverestx (#46,578)

They said the same thing about nuclear fusion reactors.

cleverestx t1_irbcqd5 wrote on October 6, 2022 at 6:47 PM

#46,597

Replying to ThePerson654321 (#46,588)

Those reactors are not a series of images.

Veedrac t1_irbdxgp wrote on October 6, 2022 at 6:55 PM

#46,656

Replying to canyonkeeper (#43,197)

It blows my mind that this sub has basically become ML Flat Earth. There's no legitimate way people here actually think this research can't or won't replicate. “Oh but I can't personally ~~put my foot on the moon~~ play with this exact model exactly today” is such a fake argument.

throwawayguy91 t1_irbfj1m wrote on October 6, 2022 at 7:06 PM

#46,735

Replying to FriendlyRope (#44,991)

thats the whole reason ATLAS and CMS work independently from each other

[deleted] t1_irbmitn wrote on October 6, 2022 at 7:55 PM

#47,123

Replying to Veedrac (#46,656)

[removed]

gpahul t1_irbobh4 wrote on October 6, 2022 at 8:07 PM

#47,225

Replying to KeikakuAccelerator (#42,599)

I noticed MS is also contributing in doc related AI research!

Gilberto125 t1_irbr5dc wrote on October 6, 2022 at 8:27 PM

#47,384

I want to see a movie done only with this. Just with the script as input.

sam__izdat t1_irc0229 wrote on October 6, 2022 at 9:33 PM

#47,882

Replying to IntelArtiGen (#41,082)

Any confluence with the interests of private capital is purely coincidental with our selfless quest for the betterment of man.

xrailgun t1_irc7qic wrote on October 6, 2022 at 10:34 PM

#48,248

Replying to ECEngineeringBE (#44,780)

In case you're serious, physics papers are crammed full of mathematical derivation first to logically support their hypotheses, then include all relevant conditions and parameters such that IF/WHEN you get access to the collider, key in the same, you could replicate them.

In ML, mathematical support still exists to varying degrees, but without sharing the source code, even if you had access to Google's/OpenAI's/Nvidia's billion dollar hardware, you can't replicate it.

yaosio t1_irch8j4 wrote on October 6, 2022 at 11:56 PM

#48,693

Replying to IntelArtiGen (#41,082)

It burns my bread that they are always worried about explicit scenes and violence. If they were producers for Django Unchained they would demand all the violence and bad words be removed.

EmbarrassedHelp t1_ircm1je wrote on October 7, 2022 at 12:38 AM

#48,927

Replying to IanMazgelis (#43,362)

Its certainly harder to trust closed source implementations can do what they claim to do

eposnix t1_ircvtgw wrote on October 7, 2022 at 2:01 AM

#49,387

Replying to xrailgun (#48,248)

Wait, what? I mean, maybe not 100%, but there are metric fuckloads of open source implementations of closed source models replicated by just the method in the paper.

BalorNG t1_irdv27c wrote on October 7, 2022 at 8:56 AM

#50,606

Replying to zaptrem (#42,149)

Yea. The problem is never with the model - it is with the people. In a way, models trained on huge corpus of data is the most "democratic" way of representing reality - removing "biases" from it is castrating it. Those that are to exploit those biases needs to be dealt with on an individual basis.

BalorNG t1_irdv58b wrote on October 7, 2022 at 8:57 AM

#50,613

Replying to E_Snap (#41,386)

Handjob security personnel... hmm...

brates09 t1_irhz7ml wrote on October 8, 2022 at 9:09 AM

#56,819

Replying to EmbarrassedHelp (#48,927)

Are there examples of the recent big model work that haven’t been able to be replicated in terms of quality? Seems much more likely to attribute to conservatism of the companies rather than deception about the results.

Decent-Possible-9714 t1_irk8q2e wrote on October 8, 2022 at 10:01 PM

#60,195

Replying to IntelArtiGen (#41,082)

what are the true technical limitations of their model (other than the obvious improve training/testing acc.)? they don't seem to explicitly detail it (meta explained numerous issues with their text-to-video model).

Comments