Viewing a single comment thread. View all comments

hassan789_ t1_jb7hzjx wrote on March 7, 2023 at 12:36 AM

Lack of quality information. There's a max of 12 trillion high quality token for LLMs to learn from. After that, the returns could diminish (maybe 10% new quality info is added per year). Right now, largest models are trained on 1T tokens..

NothingVerySpecific t1_jb93eck wrote on March 7, 2023 at 10:21 AM

Sounds intriguing, got a link for the T nomelecture?