Why bigger transformer models are better learners? Submitted by begooboi t3_119zmpd on February 23, 2023 at 2:56 PM in deeplearning 15 comments 7
AnDaoLe t1_j9ul0cf wrote on February 24, 2023 at 5:48 PM There's a bunch of papers that show large neural networks are actually just memorizing data as well Permalink 1
Viewing a single comment thread. View all comments