blazejd OP t1_ix7k18e wrote
Reply to comment by asafaya in [D] Why do we train language models with next word prediction instead of some kind of reinforcement learning-like setup? by blazejd
What language models are doing is indeed modelling language distribution, but what ML community wants them to be doing and what is the end goal is to create a model that learns to understand and communicate with a language. You can see that by the ways we try to evaluate the language, for example asking it to solve math equations
asafaya t1_ix80sjx wrote
>What language models are doing is indeed modelling language distribution, but what ML community wants them to be doing and what is the end goal is to create a model that learns to understand and communicate with a language. You can see that by the ways we try to evaluate the language, for example asking it to solve math equations
I totally agree that this is happening in the ML community. I believe they will hit a wall soon. Probably in ~3-5 years.
Viewing a single comment thread. View all comments