Submitted by robotphilanthropist t3_zh2u3k in MachineLearning
zaptrem t1_izn4krn wrote
Reply to comment by FerretDude in [R] Illustrating Reinforcement Learning from Human Feedback (RLHF) by robotphilanthropist
Are there any plans to reproduce WebGPT as part of the InstructGPT reproduction seeing as ChatGPT appears to already have or will be receiving such functionality soon?
Viewing a single comment thread. View all comments