no code implementations • 2 Mar 2023 • Peter Barnett, Rachel Freedman, Justin Svegliato, Stuart Russell
Reward learning algorithms utilize human feedback to infer a reward function, which is then used to train an AI system.