no code implementations • 9 Mar 2024 • Evan Ellis, Gaurav R. Ghosal, Stuart J. Russell, Anca Dragan, Erdem Biyik
Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task.