RamaSchneider OP t1_j7atv8q wrote on February 5, 2023 at 12:11 PM

Reply to comment by jimmy_hyland in What happens when the AI machine decides what you should know? by RamaSchneider

That bit about the reward - that is going to stick with me. If I were a self-aware computer, what would I view as a reward?

MoreLikeZelDUH t1_j7btvah wrote on February 5, 2023 at 5:11 PM

These programs all exist within the confines of what they're programed to do. No matter how advanced the AI here gets, it's not going to be able to redefine it's guidelines on what it's allowed to talk about. Similarly, the reward system is arbitrary and only important because it's programed to value it. In other words, you could just implement a value rating and tell the AI that it's more desirable to have a higher score. The AI "reward" is to get more points and the AI values that because that's how it was programed. It can't "decide" to change that, because that's not what it's allowed to do.

rogert2 t1_j7cow3v wrote on February 5, 2023 at 8:39 PM

Look up "reward hacking." This is a well-studied problem, and it exists outside of AI. Rob Miles is an AI researcher who has done a few videos talking about reward hacking.

RamaSchneider OP t1_j7ey8im wrote on February 6, 2023 at 8:15 AM

Thanks, never heard the phrase before - I've got some reading to do. NNTR

HavanaWoody t1_j7ce0kw wrote on February 5, 2023 at 7:25 PM

Not getting canceled, Expansion of influence.