Viewing a single comment thread. View all comments

gamerx88 t1_j2vzjfx wrote on January 4, 2023 at 9:15 AM

"An empirical analysis of compute-optimal large language model training" by Deepmind, suggesting that LLMs are over-parameterized or under-trained (insufficient data used in training).