Viewing a single comment thread. View all comments

LeN3rd t1_jdls5jy wrote

How big do models need to be until certain capabilities emerge? That is the actual question here, isn't it? Do smaller models perform as well in all tasks, or just the one they are trained for?

2