no code implementations • 29 Oct 2019 • Somil Bansal, Andrea Bajcsy, Ellis Ratner, Anca D. Dragan, Claire J. Tomlin
We construct a new continuous-time dynamical system, where the inputs are the observations of human behavior, and the dynamics include how the belief over the model parameters change.
1 code implementation • 10 Sep 2019 • David Fridovich-Keil, Ellis Ratner, Anca D. Dragan, Claire J. Tomlin
We benchmark our method in a three-player general-sum simulated example, in which it takes < 0. 75 s to identify a solution and < 50 ms to solve warm-started subproblems in a receding horizon.
Systems and Control Robotics Systems and Control
no code implementations • ICLR 2019 • Kelvin Xu, Ellis Ratner, Anca Dragan, Sergey Levine, Chelsea Finn
A significant challenge for the practical application of reinforcement learning toreal world problems is the need to specify an oracle reward function that correctly defines a task.
no code implementations • 7 Jun 2018 • Ellis Ratner, Dylan Hadfield-Menell, Anca D. Dragan
Designing a good reward function is essential to robot planning and reinforcement learning, but it can also be challenging and frustrating.
no code implementations • 31 May 2018 • Kelvin Xu, Ellis Ratner, Anca Dragan, Sergey Levine, Chelsea Finn
A significant challenge for the practical application of reinforcement learning in the real world is the need to specify an oracle reward function that correctly defines a task.