[P] vanilla-llama an hackable plain-pytorch implementation of LLaMA that can be run on any system (if you have enough resources) Submitted by poppear t3_11ozl85 on March 12, 2023 at 12:07 AM in MachineLearning 8 comments 83
SpaceCockatoo t1_jbz2mns wrote on March 12, 2023 at 8:54 PM Any plans to implement 4/8-bit quantization? Permalink 2 poppear OP t1_jbzhgfh wrote on March 12, 2023 at 10:39 PM I was thinking about it. It shouldn't be so hard, i will probably git it a try as soon as I will have some spare time 😀 Permalink Parent 2
poppear OP t1_jbzhgfh wrote on March 12, 2023 at 10:39 PM I was thinking about it. It shouldn't be so hard, i will probably git it a try as soon as I will have some spare time 😀 Permalink Parent 2
Viewing a single comment thread. View all comments