TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Atari Games	Atari 2600 Freeway	DQNMMCe	Score	29.5	# 35
Atari Games	Atari 2600 Gravitar	DQNMMCe	Score	1078.3	# 22
Atari Games	Atari 2600 Montezuma's Revenge	DQNMMCe+SR	Score	1778.6	# 20
Atari Games	Atari 2600 Montezuma's Revenge	DQN+SR	Score	1778.8	# 19
Atari Games	Atari 2600 Private Eye	DQNMMCe+SR	Score	99.1	# 47
Atari Games	Atari 2600 Solaris	DQNMMCe	Score	2244.6	# 20
Atari Games	Atari 2600 Venture	DQNMMCe+SR	Score	1241.8	# 16

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/count-based-exploration-with-the-successor/atari-games-on-atari-2600-venture)](https://paperswithcode.com/sota/atari-games-on-atari-2600-venture?p=count-based-exploration-with-the-successor)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/count-based-exploration-with-the-successor/atari-games-on-atari-2600-montezumas-revenge)](https://paperswithcode.com/sota/atari-games-on-atari-2600-montezumas-revenge?p=count-based-exploration-with-the-successor)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/count-based-exploration-with-the-successor/atari-games-on-atari-2600-solaris)](https://paperswithcode.com/sota/atari-games-on-atari-2600-solaris?p=count-based-exploration-with-the-successor)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/count-based-exploration-with-the-successor/atari-games-on-atari-2600-gravitar)](https://paperswithcode.com/sota/atari-games-on-atari-2600-gravitar?p=count-based-exploration-with-the-successor)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/count-based-exploration-with-the-successor/atari-games-on-atari-2600-freeway)](https://paperswithcode.com/sota/atari-games-on-atari-2600-freeway?p=count-based-exploration-with-the-successor)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/count-based-exploration-with-the-successor/atari-games-on-atari-2600-private-eye)](https://paperswithcode.com/sota/atari-games-on-atari-2600-private-eye?p=count-based-exploration-with-the-successor)`

Count-Based Exploration with the Successor Representation

ICLR 2019 · Marlos C. Machado, Marc G. Bellemare, Michael Bowling ·

In this paper we introduce a simple approach for exploration in reinforcement learning (RL) that allows us to develop theoretically justified algorithms in the tabular case but that is also extendable to settings where function approximation is required. Our approach is based on the successor representation (SR), which was originally introduced as a representation defining state generalization by the similarity of successor states. Here we show that the norm of the SR, while it is being learned, can be used as a reward bonus to incentivize exploration. In order to better understand this transient behavior of the norm of the SR we introduce the substochastic successor representation (SSR) and we show that it implicitly counts the number of times each state (or feature) has been observed. We use this result to introduce an algorithm that performs as well as some theoretically sample-efficient approaches. Finally, we extend these ideas to a deep RL algorithm and show that it achieves state-of-the-art performance in Atari 2600 games when in a low sample-complexity regime.

PDF Abstract ICLR 2019 PDF ICLR 2019 Abstract

Code

Add Remove Mark official

mcmachado/count_based_exploration_sr official

bonniesjli/DQN_SR

Tasks

Add Remove

Atari Games

Efficient Exploration

Reinforcement Learning (RL)

Datasets

Arcade Learning Environment

Results from the Paper

Edit

Ranked #16 on Atari Games on Atari 2600 Venture

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Atari Games	Atari 2600 Freeway	DQNMMCe	Score	29.5	# 35	Compare
Atari Games	Atari 2600 Gravitar	DQNMMCe	Score	1078.3	# 22	Compare
Atari Games	Atari 2600 Montezuma's Revenge	DQNMMCe+SR	Score	1778.6	# 20	Compare
Atari Games	Atari 2600 Montezuma's Revenge	DQN+SR	Score	1778.8	# 19	Compare
Atari Games	Atari 2600 Private Eye	DQNMMCe+SR	Score	99.1	# 47	Compare
Atari Games	Atari 2600 Solaris	DQNMMCe	Score	2244.6	# 20	Compare
Atari Games	Atari 2600 Venture	DQNMMCe+SR	Score	1241.8	# 16	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Count-Based Exploration with the Successor Representation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove