Submitted by spiritus_dei t3_10tlh08 in MachineLearning
sarabjeet_singh t1_j77t9zi wrote
In the end, this technology is going to be a reflection of human history. That’s not a pretty thoughts. They’re literally modelled on us.
spiritus_dei OP t1_j77u2ic wrote
That might be why RLHF (reinforcement learning by human feedback) is ultimately doomed to fail.
Viewing a single comment thread. View all comments