2 code implementations • 3 Jun 2016 • Hoang M. Le, Andrew Kang, Yisong Yue, Peter Carr
We study the problem of smooth imitation learning for online sequence prediction, where the goal is to train a policy that can smoothly imitate demonstrated behavior in a dynamic and continuous environment in response to online, sequential context input.