no code implementations • 30 Nov 2017 • Himanshu Sahni, Saurabh Kumar, Farhan Tejani, Charles Isbell
We present a differentiable framework capable of learning a wide variety of compositions of simple policies that we call skills.
no code implementations • 24 May 2017 • Himanshu Sahni, Saurabh Kumar, Farhan Tejani, Yannick Schroecker, Charles Isbell
To address this issue, we develop a framework through which a deep RL agent learns to generalize policies from smaller, simpler domains to more complex ones using a recurrent attention mechanism.