no code implementations • 25 Sep 2019 • Xin Zhang, Weixiao Huang, Renjie Liao, Yanhua Li
Imitation learning aims to inversely learn a policy from expert demonstrations, which has been extensively studied in the literature for both single-agent setting with Markov decision process (MDP) model, and multi-agent setting with Markov game (MG) model.