Viewing a single comment thread. View all comments

cfoster0 t1_izrdeii wrote

Who? Who's even using RLHF in production yet, besides OpenAI (and maybe Cohere)?

5

FerretDude t1_izs8wj1 wrote

Not allowed to share, many groups are looking into using RLHF in production though

−1

cfoster0 t1_izuxn52 wrote

Did y'all stop doing work out in the open? That's a shame. End of an era, I guess.

2

FerretDude t1_izyu3ka wrote

RLHF is a bit tricky because you have to either work with data vendors or groups that have access to feedback data. Eventually we'll rely more on crowd sourcing I think.

2