Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU

18 Nov 2016Mohammad BabaeizadehIuri FrosioStephen TyreeJason ClemonsJan Kautz

We introduce a hybrid CPU/GPU version of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. We analyze its computational traits and concentrate on aspects critical to leveraging the GPU's computational power... (read more)

PDF Abstract

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper