no code implementations • 26 Feb 2019 • Tom Zahavy, Avinatan Hasidim, Haim Kaplan, Yishay Mansour
We consider a settings of hierarchical reinforcement learning, in which the reward is a sum of components.
Hierarchical Reinforcement Learning reinforcement-learning +2
no code implementations • 13 Mar 2018 • Tom Zahavy, Avinatan Hasidim, Haim Kaplan, Yishay Mansour
In this work, we provide theoretical guarantees for reward decomposition in deterministic MDPs.
Hierarchical Reinforcement Learning reinforcement-learning +2