Viewing a single comment thread. View all comments

sarabjeet_singh t1_j77t9zi wrote on February 4, 2023 at 7:03 PM

In the end, this technology is going to be a reflection of human history. That’s not a pretty thoughts. They’re literally modelled on us.

spiritus_dei OP t1_j77u2ic wrote on February 4, 2023 at 7:09 PM

That might be why RLHF (reinforcement learning by human feedback) is ultimately doomed to fail.