1 code implementation • 24 Feb 2020 • Subin Huh, Insoon Yang
Our approach constructs a Lyapunov function with respect to a safe policy to restrain each policy improvement stage.
Autonomous Driving Continuous Control +4