no code implementations • 26 Oct 2020 • Vicenc Rubies-Royo, Eric Mazumdar, Roy Dong, Claire Tomlin, S. Shankar Sastry
In this work we present a multi-armed bandit framework for online expert selection in Markov decision processes and demonstrate its use in high-dimensional settings.
1 code implementation • 1 Oct 2019 • David Fridovich-Keil, Vicenc Rubies-Royo, Claire J. Tomlin
Iterative linear-quadratic (ILQ) methods are widely used in the nonlinear optimal control community.
Systems and Control Computer Science and Game Theory Multiagent Systems Robotics Systems and Control
no code implementations • 19 Feb 2019 • Vicenc Rubies-Royo, Roberto Calandra, Dusan M. Stipanovic, Claire Tomlin
To use neural networks in safety-critical settings it is paramount to provide assurances on their runtime operation.