Viewing a single comment thread. View all comments

pm_me_your_pay_slips t1_j7l6icx wrote

note that the VQ-VAE part of the SD model alone can encode and decode arbitrary natural/human-made images pretty well with very little artifacts. The diffusion model part of SD is learning a distribution of images in that encoded space.

1

orbital_lemon t1_j7lel1d wrote

The diffusion model weights are the part at issue, no? The question is whether you can squeeze infringing content out of the weights to feed to the vae.

1