I have a fine-tuned Stable Diffusion Model and would like to host it to make it publicly available. Both options (GPU, CPU) seem to be problematic.

I can't find a "cheap" GPU hosting platform. AWS etc. are all > 200$ per month and they have no serverless option (only found banana.dev which seems to have relatively limited flexibility)
CPU seems to be too slow for inference

I am currently running the model on my notebook CPU with 35s/it which is way too slow. Is it possible to host a Stable Diffusion on CPU with close to real-time responses (< 60s for ~100 inference steps) or is there a "cheap" GPU hosting platform I couldn't find yet?

Comments

MarkusDeNeutoy t1_iz1dpfj wrote on December 5, 2022 at 7:24 PM

LetterRip t1_iz20yak wrote on December 5, 2022 at 9:54 PM

You might consider uploading it to civitai rather than self hosting, then people can download it and/or make it available via a number of free and paid services.

3pinephrin3 t1_iz1is4i wrote on December 5, 2022 at 7:56 PM

If you use ddim you can get decent results with as few as 8 steps

MonstarGaming t1_iz2f60y wrote on December 5, 2022 at 11:35 PM

I'm not sure how you're calculating it out, but AWS does have serverless inference and there is a free tier for it. In fact, the first 150,000 seconds of serverless inference are free.

OkOkPlayer OP t1_iz3posd wrote on December 6, 2022 at 5:59 AM

Yes but this is only CPU not GPU if I understood correctly.

Competitive-Hall398 t1_iz4b9qr wrote on December 6, 2022 at 11:07 AM

I think it's possible with CPU.

[deleted] t1_iz4ics1 wrote on December 6, 2022 at 12:33 PM

[removed]

Inevitable_Host_1446 t1_iz50q5y wrote on December 6, 2022 at 3:10 PM

Assuming speed is your problem rather than wanting to share it with others, a decent longer term solution would be to put together a cheap rig with a 3060 12gb. It has enough vram / tensor cores to do pretty well, relative to price, at least orders of magnitude faster than cpu.

machineko t1_iz7x0mh wrote on December 7, 2022 at 3:05 AM

How "cheap" does it have to be?

Cheapest would be to deploy it on your own using: https://github.com/stochasticai/x-stable-diffusion. Let me if you need more help on real-time inference.

vizim t1_j23t41o wrote on December 29, 2022 at 1:29 PM

Did you get to a solution? I am currently looking right now as well

OkOkPlayer OP t1_j23tq7b wrote on December 29, 2022 at 1:35 PM

No unfortunately not. replicate looked the most promising but for own custom models it is closed beta atm.

vizim t1_j23tuqe wrote on December 29, 2022 at 1:36 PM

I saw replicate can also handle private models. Have you looked into that?

OkOkPlayer OP t1_j23u7ou wrote on December 29, 2022 at 1:39 PM

No. I only saw "closed beta" but they have a documentation for it. But because my project is currently stopped due to other reasons I have not further looked into it.