no code implementations • 23 Feb 2020 • Gabriel I. Fernandez, Colin Togashi, Dennis W. Hong, Lin F. Yang
In this paper we propose a novel method that guarantees a stable region of attraction for the output of a policy trained in simulation, even for highly nonlinear systems.