no code implementations • 8 Jul 2019 • Ashwin Balakrishna, Brijen Thananjeyan, Jonathan Lee, Felix Li, Arsh Zahed, Joseph E. Gonzalez, Ken Goldberg
Existing on-policy imitation learning algorithms, such as DAgger, assume access to a fixed supervisor.