CollapseKitty t1_je8wa3w wrote on March 30, 2023 at 7:23 AM

Reply to comment by Not-Banksy in When people refer to “training” an AI, what does that actually mean? by Not-Banksy

Modern LLMs (large language models), like ChatGPT, use what's called reinforcement learning from human feedback, RLHF, to train a reward model which then is used to train the language model.

Basically, we attempt to instill an untrained model with weights selected through human preference (which looks more like a cat? which sentence is more polite?). This then automates the process and scales it to superhuman levels which are capable of training massive models like ChatGPT with hopefully something close to what the humans initially intended.