no code implementations • 18 Jan 2024 • Hee-Jun Ahn, Seong-Woong Shim, Byung-Jun Lee
In offline imitation learning (IL), we generally assume only a handful of expert trajectories and a supplementary offline dataset from suboptimal behaviors to learn the expert policy.