1 code implementation • NeurIPS 2019 • Jack Umenberger, Mina Ferizbegovic, Thomas B. Schön, Håkan Hjalmarsson
This paper concerns the problem of learning control policies for an unknown linear dynamical system to minimize a quadratic cost function.
reinforcement-learning Reinforcement Learning (RL)