mtocrat

mtocrat t1_j6zk1ka wrote

Let's say your initial model is quite racist and outputs only extremely or moderately racist choices. If you rank those against each other and do supervised training on that dataset you train it to mimic the moderately racist style. You might however plausibly train a model from this that can judge what racism is and extrapolate to judge answers free of it to be even better. Then you optimize with respect to that model to get that style

2

mtocrat t1_j4zecpm wrote

What you're describing is a general approach to RL that is used in different forms in many methods: sample actions, weight or rank them in some way by the estimated return, regress to the weighted actions. So you're not suggesting to do something other than RL but to replace one RL approach with a different RL approach.

2

mtocrat t1_j17ybgf wrote

It's seen as something big because what we observe in the responses implies a level of reasoning that is much greater than what we expected. It seems that operating in the language domain allows you to make use of that in a way that currently can't be done with other methods. The older systems you describe cannot do this and were therefore less interesting, although it's worth noting that plenty of current users prefer to operate their devices with text based inputs using Siri, Alexa or Google Assistant.

3

mtocrat t1_iymi8i7 wrote

You could already tape together a deep learning solution consisting of neural speech recognition, an LLM and Wavenet. Counts as a deep learning solution in my book. I'm not sure if anyone has built an end-to-end solution and I expect it would be worse, but I'm sure if someone put their mind and money to it you'd get decent results

2