Submitted by RadioFreeAmerika t3_122ilav in singularity
FoniksMunkee t1_jdqt5ci wrote
Reply to comment by RadioFreeAmerika in Why is maths so hard for LLMs? by RadioFreeAmerika
Regarding 2. MS says - "We believe that the ... issue constitutes a more profound limitation."
They say: "...it seems that the autoregressive nature of the model
which forces it to solve problems in a sequential fashion sometimes poses a more profound difficulty that cannot be remedied simply by instructing the model to find a step by step solution" and "In short, the problem ... can be summarized as the model’s “lack of ability to plan ahead”."
Notably, MS did not provide a solution for this - and pointed at another paper by LeCun that suggests a non LLM model to solve the issue. Which is not super encouraging.
RadioFreeAmerika OP t1_jdr25uz wrote
So plugins I guess? Or completely integrating another model?
FoniksMunkee t1_jdr75f1 wrote
It’s not clear. The paper was very vague about it.
Viewing a single comment thread. View all comments