SpaceCockatoo

SpaceCockatoo t1_jbz2mns wrote on March 12, 2023 at 8:54 PM

Reply to [P] vanilla-llama an hackable plain-pytorch implementation of LLaMA that can be run on any system (if you have enough resources) by poppear

Any plans to implement 4/8-bit quantization?

SpaceCockatoo t1_jblj2so wrote on March 9, 2023 at 10:13 PM

Reply to comment by ortegaalfredo in [R] Created a Discord server with LLaMA 13B by ortegaalfredo

4bit quant already out

SpaceCockatoo t1_j12q9me wrote on December 21, 2022 at 6:01 AM

Reply to [D] Running large language models on a home PC? by Zondartul

I too would like to know if this is even theoretically possible