blazejd OP t1_ix7k18e wrote on November 21, 2022 at 10:17 AM

Reply to comment by asafaya in [D] Why do we train language models with next word prediction instead of some kind of reinforcement learning-like setup? by blazejd

What language models are doing is indeed modelling language distribution, but what ML community wants them to be doing and what is the end goal is to create a model that learns to understand and communicate with a language. You can see that by the ways we try to evaluate the language, for example asking it to solve math equations

asafaya t1_ix80sjx wrote on November 21, 2022 at 1:36 PM

>What language models are doing is indeed modelling language distribution, but what ML community wants them to be doing and what is the end goal is to create a model that learns to understand and communicate with a language. You can see that by the ways we try to evaluate the language, for example asking it to solve math equations

I totally agree that this is happening in the ML community. I believe they will hit a wall soon. Probably in ~3-5 years.