no code implementations • ICLR 2020 • Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge
We consider the problem of learning control policies that optimize a reward function while satisfying constraints due to considerations of safety, fairness, or other costs.
no code implementations • 20 Jun 2020 • Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge
We consider the problem of reinforcement learning when provided with (1) a baseline control policy and (2) a set of constraints that the learner must satisfy.