1 code implementation • 9 Feb 2022 • Valentin Villecroze, Harry J. Braviner, Panteha Naderian, Chris J. Maddison, Gabriel Loaiza-Ganem
Skills or low-level policies in reinforcement learning are temporally extended actions that can speed up learning and enable complex behaviours.
1 code implementation • ICLR 2021 • Panteha Naderian, Gabriel Loaiza-Ganem, Harry J. Braviner, Anthony L. Caterini, Jesse C. Cresswell, Tong Li, Animesh Garg
In order to address these limitations, we introduce the concept of cumulative accessibility functions, which measure the reachability of a goal from a given state within a specified horizon.
1 code implementation • 8 Apr 2020 • Shakti Kumar, Jerrod Parker, Panteha Naderian
In this work we first partially replicate the results shown in Stabilizing Transformers in RL on both reactive and memory based environments.
Partially Observable Reinforcement Learning reinforcement-learning +1