Why bigger transformer models are better learners? Submitted by begooboi t3_119zmpd on February 23, 2023 at 2:56 PM in deeplearning 15 comments 7