wywywywy t1_j9apjs3 wrote on February 20, 2023 at 3:29 PM

I had a 3070 with 8GB and I managed to run these locally through KoboldAI.

Meta OPT 2.7B
EleutherAI GPT-Neo 2.7B
BigScience Bloom 1.7B

xrailgun t1_j9aq903 wrote on February 20, 2023 at 3:34 PM

Did you test any larger and it wouldn't run?

Also, any comments so far among those? Good? Bad? Easy? Etc?

wywywywy t1_j9ar2tk wrote on February 20, 2023 at 3:40 PM

I did test larger but it didn't run. I can't remember which ones, probably GPT-J. I recently got a 3090 so I can load larger models now.

As for quality, my use case is simple (writing prompt to help with writing stories & articles) and nothing sophisticated, and they worked well. Until ChatGPT came along. I use ChatGPT instead now.

xrailgun t1_j9avboh wrote on February 20, 2023 at 4:09 PM

Thanks!

I wish model publishers would indicate rough (V)RAM requirements...

wywywywy t1_j9b2kqu wrote on February 20, 2023 at 4:57 PM

So, not scientific at all, but I've noticed that checkpoint file size * 0.6 is pretty close to actual VRAM requirement for LLM.

But you're right it'd be nice to have a table handy.