2 papers with code • 0 benchmarks • 0 datasets

Mean in-game score over 1000 episodes with random seeds not seen during training. See (Section 2.4 Evaluation Protocol) for details.


Greatest papers with code

BeBold: Exploration Beyond the Boundary of Explored Regions

maximecb/gym-minigrid 15 Dec 2020

In this paper, we analyze the pros and cons of each method and propose the regulated difference of inverse visitation counts as a simple but effective criterion for IR.

Curriculum Learning Efficient Exploration +1

The NetHack Learning Environment

facebookresearch/nle NeurIPS 2020

Here, we present the NetHack Learning Environment (NLE), a scalable, procedurally generated, stochastic, rich, and challenging environment for RL research based on the popular single-player terminal-based roguelike game, NetHack.

NetHack Score Systematic Generalization