Reinforcement Learning from Human Feedback

Concept

Machine learning technique used to align language models with human preferences

2 stories

Study Finds AI Models That Account for User Emotions Are More Prone to Factual Errors

New Report Finds Widely-Used AI Systems Exhibit Bias That May Shape Users' Worldviews