[D] Do we really need 100B+ parameters in a large language model? Submitted by Vegetable-Skill-9700 t3_121a8p4 on March 25, 2023 at 4:14 AM in MachineLearning 84 comments 101
LeN3rd t1_jdls5jy wrote on March 25, 2023 at 10:06 AM How big do models need to be until certain capabilities emerge? That is the actual question here, isn't it? Do smaller models perform as well in all tasks, or just the one they are trained for? Permalink 2
Viewing a single comment thread. View all comments