no code implementations • 6 Feb 2023 • Branton DeMoss, Paul Duckworth, Nick Hawes, Ingmar Posner
We propose DITTO, an offline imitation learning algorithm which uses world models and on-policy reinforcement learning to addresses the problem of covariate shift, without access to an oracle or any additional online interactions.