Submitted by robotphilanthropist t3_zh2u3k in MachineLearning
FerretDude t1_izyu3ka wrote
Reply to comment by cfoster0 in [R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist
RLHF is a bit tricky because you have to either work with data vendors or groups that have access to feedback data. Eventually we'll rely more on crowd sourcing I think.
Viewing a single comment thread. View all comments