1 code implementation • 25 Jun 2017 • Jarryd Martin, Suraj Narayanan Sasikumar, Tom Everitt, Marcus Hutter
We present a new method for computing a generalised state visit-count, which allows the agent to estimate the uncertainty associated with any state.
Ranked #13 on Atari Games on Atari 2600 Montezuma's Revenge
no code implementations • 2 Jun 2016 • Jarryd Martin, Tom Everitt, Marcus Hutter
Reinforcement learning (RL) is a general paradigm for studying intelligent behaviour, with applications ranging from artificial intelligence to psychology and economics.