wrossmorrow t1_je7vy2p wrote on March 30, 2023 at 1:22 AM Reply to comment by ustainbolt in [D] Training a 65b LLaMA model by Business-Lead2679 +1 for lambda labs Permalink Parent 8
wrossmorrow t1_jdmsbvf wrote on March 25, 2023 at 3:46 PM Reply to comment by shanereid1 in [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700 Probably related https://arxiv.org/abs/2106.09685 Permalink Parent 1
wrossmorrow t1_je7vy2p wrote
Reply to comment by ustainbolt in [D] Training a 65b LLaMA model by Business-Lead2679
+1 for lambda labs