no code implementations • 22 Dec 2018 • Jacob Menashe, Peter Stone
We show that the ERD presents a suite of challenges with scalable difficulty to provide a smooth learning gradient from Taxi to the Arcade Learning Environment.
Hierarchical Reinforcement Learning Montezuma's Revenge +2