Go-Explore

Introduced by Ecoffet et al. in Go-Explore: a New Approach for Hard-Exploration Problems

Go-Explore is a family of algorithms aiming to tackle two challenges with effective exploration in reinforcement learning: algorithms forgetting how to reach previously visited states ("detachment") and from failing to first return to a state before exploring from it ("derailment").

To avoid detachment, Go-Explore builds an archive of the different states it has visited in the environment, thus ensuring that states cannot be forgotten. Starting with an archive beginning with the initial state, the archive is built iteratively. In Go-Explore we:

(a) Probabilistically select a state from the archive, preferring states associated with promising cells.

(b) Return to the selected state, such as by restoring simulator state or by running a goal-conditioned policy.

(d) Map every state encountered during returning and exploring to a low-dimensional cell representation.

(e) Add states that map to new cells to the archive and update other archive entries.

Source: Go-Explore: a New Approach for Hard-Exploration Problems

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Reinforcement Learning (RL)	8	34.78%
Montezuma's Revenge	3	13.04%
Atari Games	2	8.70%
Imitation Learning	2	8.70%
Efficient Exploration	1	4.35%
energy management	1	4.35%
Management	1	4.35%
Multi-agent Reinforcement Learning	1	4.35%
Continuous Control	1	4.35%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
🤖 No Components Found	You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories

Add Remove

Behaviour Policies