1 code implementation • 9 Feb 2022 • Valentin Villecroze, Harry J. Braviner, Panteha Naderian, Chris J. Maddison, Gabriel Loaiza-Ganem
Skills or low-level policies in reinforcement learning are temporally extended actions that can speed up learning and enable complex behaviours.
1 code implementation • ICLR 2021 • Panteha Naderian, Gabriel Loaiza-Ganem, Harry J. Braviner, Anthony L. Caterini, Jesse C. Cresswell, Tong Li, Animesh Garg
In order to address these limitations, we introduce the concept of cumulative accessibility functions, which measure the reachability of a goal from a given state within a specified horizon.