Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models

3 Jul 2015Bradly C. StadieSergey LevinePieter Abbeel

Achieving efficient and scalable exploration in complex domains poses a major challenge in reinforcement learning. While Bayesian and PAC-MDP approaches to the exploration problem offer strong formal guarantees, they are often impractical in higher dimensions due to their reliance on enumerating the state-action space... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Atari Games Atari 2600 Freeway MP-EB Score 27.0 # 24
Atari Games Atari 2600 Frostbite MP-EB Score 507.0 # 30
Atari Games Atari 2600 Montezuma's Revenge MP-EB Score 142 # 18
Atari Games Atari 2600 Q*Bert MP-EB Score 15805 # 15
Atari Games Atari 2600 Venture MP-EB Score 0.0 # 37

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet