Text-Based Real Image Editing

Code: https://github.com/ShivamShrirao/diffusers/tree/main/examples/imagic

Colab: https://colab.research.google.com/github/ShivamShrirao/diffusers/blob/main/examples/imagic/Imagic_Stable_Diffusion.ipynb

Still need to play around and tune the parameters a bit, may not work as is on every subject. Hopefully everyone can try it out now.

Input Image

A photo of Barack Obama smiling with a big grin.

Comments

You must log in or register to comment.

advertisementeconomy t1_iswsi3r wrote on October 19, 2022 at 7:37 AM

#140,427

Wow. The pace is exciting. Is that the Barack from the original tweet or was it run through this implementation?

Here's the README for anyone interested: https://github.com/ShivamShrirao/diffusers/blob/main/examples/imagic/README.md

Is the .ipynb file a Jupyter Notebook that could be run locally on a card with 12GB VRAM (forgive me if this is a stupid question using Colab and Jupyter is new to me)?

0x00groot OP t1_iswtu7x wrote on October 19, 2022 at 7:57 AM

#140,472

Replying to advertisementeconomy (#140,427)

This is produced through this implementation.

Yes you can run it locally in 12 GB VRAM.

Roarexe t1_isx0dpe wrote on October 19, 2022 at 9:36 AM

#140,710

Awesome, thanks for sharing!

ThatInternetGuy t1_isxfzu2 wrote on October 19, 2022 at 12:33 PM

#141,600

This Shivam Shrirao guy is super fast! Took him two days to make Dreambooth scripts and now just one day to make Imagic scripts.

0x00groot OP t1_isxgvlo wrote on October 19, 2022 at 12:41 PM

#141,677

Replying to ThatInternetGuy (#141,600)

Haha. Thanks

deep-yearning t1_isxlgw1 wrote on October 19, 2022 at 1:18 PM

#142,046

paging Automatic1111

pls implement in webui

nmkd t1_isxlyh2 wrote on October 19, 2022 at 1:22 PM

#142,091

Replying to deep-yearning (#142,046)

This is not Windows compatible as far as I know.

0x00groot OP t1_isxnm43 wrote on October 19, 2022 at 1:35 PM

#142,204

Replying to nmkd (#142,091)

Some people have been able to run xformers on windows.

https://github.com/huggingface/diffusers/pull/532#issuecomment-1273656447

deep-yearning t1_isxpazd wrote on October 19, 2022 at 1:48 PM

#142,314

Replying to nmkd (#142,091)

Automatic1111's webui runs in linux or windows

thelastpizzaslice t1_isxw4bb wrote on October 19, 2022 at 2:37 PM

#142,801

What is the value of having a ckpt output? Is it like dreambooth?

0x00groot OP t1_isxwwp7 wrote on October 19, 2022 at 2:43 PM

#142,854

Replying to thelastpizzaslice (#142,801)

Not right now. You need the model weights along with the optimised embeddings to get the results.

thelastpizzaslice t1_isxxsok wrote on October 19, 2022 at 2:49 PM

#142,913

Replying to 0x00groot (#142,854)

So, to use this, I run the colab, take the ckpt and also a pt that exists somewhere presumably, drop them into AUTOMATIC1111, and then I can pose a specific photo like it's a doll/restyle it at will in AUTOMATIC1111? Am I correct in this description?

0x00groot OP t1_isy024x wrote on October 19, 2022 at 3:04 PM

#143,069

Replying to thelastpizzaslice (#142,913)

Currently automatic doesn't support it. You can use the inference code given at the end of colab to generate images for now.

LargeSackOfNuts t1_isy24fv wrote on October 19, 2022 at 3:18 PM

#143,221

Obamna

nmkd t1_isy2xtz wrote on October 19, 2022 at 3:24 PM

#143,280

Replying to 0x00groot (#142,204)

but not bitsandbytes as far as i know

danquandt t1_isy30dt wrote on October 19, 2022 at 3:24 PM

#143,289

How different is this in practice from running img2img on regular SD? The examples shown in the paper look very similar to what you would get from img2img, as far as I can tell.

(Ps: great work on your repos! I still can't run Dreambooth on my 3080 10gb but have played around with it in Collab and it's fantastic.)

thelastpizzaslice t1_isy86ca wrote on October 19, 2022 at 3:58 PM

#143,603

Replying to 0x00groot (#143,069)

I decided to copy paste the model into automatic1111 anyway. I made one based on a photo of Atul from spiritfarer with a loose description of him as "uncle frog spirit person" and it's actually the single best cartoon generator I've ever worked with. I've spent dozens of hours trying to make these things and this paper beat all of them on accident. What a time to be alive!

The author of this paper is apparently a genius who has built something better than TI or Dreambooth, and is massively understating his accomplishment.

Here's the three photos #1 is standard, #2 is dreambooth, #3 is imagic

This is Atul

thelastpizzaslice t1_isy9jey wrote on October 19, 2022 at 4:07 PM

#143,683

Replying to deep-yearning (#142,046)

Just copy the ckpt output and use similar terms. It worked for me.

0x00groot OP t1_isycsor wrote on October 19, 2022 at 4:29 PM

#143,907

Replying to thelastpizzaslice (#143,603)

Oh wow. That's really interesting. I'll have to look into it.

histin116 t1_iszecx6 wrote on October 19, 2022 at 8:29 PM

#146,515

Replying to danquandt (#143,289)

https://twitter.com/andrewb10687674/status/1582479603129466881

In this tweet the author also claims that cycle diffusion is about 1minute , unlike Imagic which is 5min+ atleast

thelastpizzaslice t1_it4lynp wrote on October 20, 2022 at 10:03 PM

#158,409

Does this use model v1.5 or is it still running on v1.4?

0x00groot OP t1_it5u08r wrote on October 21, 2022 at 3:37 AM

#161,615

Replying to thelastpizzaslice (#158,409)

You can specify what to use with MODEL_NAME variable.

campfirecrucifix t1_it9sqxa wrote on October 22, 2022 at 12:00 AM

#172,553

Oh my Obama. What large teeth you have.

HuWasHere t1_itafuvj wrote on October 22, 2022 at 3:11 AM

#174,472

Replying to danquandt (#143,289)

img2img even at a high init image setting doesn't necessarily respect the init image, this is far more precise. It's limited (to my knowledge) because it uses one input image, but the results are pretty incredible.

readyourSICP t1_itga0um wrote on October 23, 2022 at 12:55 PM

#188,177

Does this give the exact same output as 24gb VRAM?

[deleted] t1_itsqm3q wrote on October 26, 2022 at 12:52 AM

#230,170

Replying to thelastpizzaslice (#143,603)

[deleted]