no code implementations • 25 Mar 2024 • Sadanand Modak, Noah Patton, Isil Dillig, Joydeep Biswas
This paper addresses the problem of preference learning, which aims to learn user-specific preferences (e. g., "good parking spot", "convenient drop-off location") from visual input.
no code implementations • 14 Jun 2021 • Noah Patton, Jihwan Jeong, Michael Gimelfarb, Scott Sanner
The direct optimization of this empirical objective in an end-to-end manner is called the risk-averse straight-line plan, which commits to a sequence of actions in advance and can be sub-optimal in highly stochastic domains.