RILe: Reinforced Imitation Learning

no code implementations12 Jun 2024 Mert Albaba, Sammy Christen, Thomas Langarek, Christoph Gebhardt, Otmar Hilliges, Michael J. Black

The trainer optimizes for long-term cumulative rewards from the discriminator, enabling it to provide nuanced feedback that accounts for the complexity of the task and the student's current capabilities.

Computational Efficiency Imitation Learning +2

