ok531441 t1_jb0229i wrote on March 5, 2023 at 11:40 AM

Why would RL be doomed? Didn’t sticking RL on top of a big GPT model just give us ChatGPT?

ggdupont t1_jb152am wrote on March 5, 2023 at 5:19 PM

That's the cherry on the top (see https://twitter.com/hlntnr/status/1632030583462285312 ), not the core of the app.

(edit in reaction to downvotes: in all transparency, I love RL paradigm and really think this is decision making approaches are a key to AI ; this being said, my experience in industrial application of RL has always been disapointing in that others approaches did better ;-) )