Submitted by vidul7498 t3_11itl7g in MachineLearning
ok531441 t1_jb0229i wrote
Why would RL be doomed? Didn’t sticking RL on top of a big GPT model just give us ChatGPT?
ggdupont t1_jb152am wrote
That's the cherry on the top (see https://twitter.com/hlntnr/status/1632030583462285312 ), not the core of the app.
(edit in reaction to downvotes: in all transparency, I love RL paradigm and really think this is decision making approaches are a key to AI ; this being said, my experience in industrial application of RL has always been disapointing in that others approaches did better ;-) )
Viewing a single comment thread. View all comments