Reinforcement Learning from Human Feedback

Concept

AI training technique where human evaluators rate outputs to teach models preferred behavior

1 story