Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

The rapid pace of recent research in AI has been driven in part by the presence of fast and challenging simulation environments. These environments often take the form of games; with tasks ranging from simple board games, to competitive video games. We propose a new benchmark - Obstacle Tower: a high fidelity, 3D, 3rd person, procedurally generated environment. An agent playing Obstacle Tower must learn to solve both low-level control and high-level planning problems in tandem while learning from pixels and a sparse reward signal. Unlike other benchmarks such as the Arcade Learning Environment, evaluation of agent performance in Obstacle Tower is based on an agent's ability to perform well on unseen instances of the environment. In this paper we outline the environment and provide a set of baseline results produced by current state-of-the-art Deep RL methods as well as human players. These algorithms fail to produce agents capable of performing near human level.

PDF Abstract

Datasets


Introduced in the Paper:

Obstacle Tower

Used in the Paper:

Arcade Learning Environment
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
General Reinforcement Learning Obstacle Tower (No Gen) fixed RNB Score 7 # 1
General Reinforcement Learning Obstacle Tower (No Gen) fixed PPO Score 5 # 2
General Reinforcement Learning Obstacle Tower (No Gen) varied PPO Score 1 # 2
General Reinforcement Learning Obstacle Tower (No Gen) varied RNB Score 4.8 # 1
General Reinforcement Learning Obstacle Tower (Strong Gen) fixed RNB Score 0.6 # 1
General Reinforcement Learning Obstacle Tower (Strong Gen) fixed PPO Score 0.6 # 1
General Reinforcement Learning Obstacle Tower (Strong Gen) varied RNB Score 0.8 # 1
General Reinforcement Learning Obstacle Tower (Strong Gen) varied PPO Score 0.6 # 2
General Reinforcement Learning Obstacle Tower (Weak Gen) fixed PPO Score 1.2 # 1
General Reinforcement Learning Obstacle Tower (Weak Gen) fixed RNB Score 1 # 2
General Reinforcement Learning Obstacle Tower (Weak Gen) varied RNB Score 3.4 # 1
General Reinforcement Learning Obstacle Tower (Weak Gen) varied PPO Score 0.8 # 2

Methods


No methods listed for this paper. Add relevant methods here