no code implementations • 5 Oct 2017 • Suraj Narayanan Sasikumar
Function approximation techniques enable RL agents to generalize in order to estimate the value of unvisited states, but at present few methods have achieved generalization about the agent's uncertainty regarding unvisited states.
1 code implementation • 25 Jun 2017 • Jarryd Martin, Suraj Narayanan Sasikumar, Tom Everitt, Marcus Hutter
We present a new method for computing a generalised state visit-count, which allows the agent to estimate the uncertainty associated with any state.
Ranked #13 on Atari Games on Atari 2600 Montezuma's Revenge