no code implementations • 9 Sep 2019 • Alberto Maria Metelli, Guglielmo Manneschi, Marcello Restelli
We study the problem of identifying the policy space of a learning agent, having access to a set of demonstrations generated by its optimal policy.