Playing hard exploration games by watching YouTube

NeurIPS 2018 Yusuf AytarTobias PfaffDavid BuddenTom Le PaineZiyu WangNando de Freitas

Deep reinforcement learning methods traditionally struggle with tasks where environment rewards are particularly sparse. One successful method of guiding exploration in these domains is to imitate trajectories provided by a human demonstrator... (read more)

PDF Abstract

Evaluation results from the paper

  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.