no code implementations • 15 Dec 2017 • Siddharthan Rajasekaran, Jinwei Zhang, Jie Fu
In this paper, we introduce the Non-parametric Behavior Clustering IRL algorithm to simultaneously cluster demonstrations and learn multiple reward functions from demonstrations that may be generated from more than one behaviors.
no code implementations • 7 Feb 2017 • Sri Ramana Sekharan, Ramkumar Natarajan, Siddharthan Rajasekaran
In this paper, we tackle the problem of transferring policy from multiple partially observable source environments to a partially observable target environment modeled as predictive state representation.