The task is to train an agent to play SNES games such as Super Mario.
( Image credit: Large-Scale Study of Curiosity-Driven Learning )
|TREND||DATASET||BEST METHOD||PAPER TITLE||PAPER||CODE||COMPARE|
However, annotating each environment with hand-designed, dense rewards is not scalable, motivating the need for developing reward functions that are intrinsic to the agent.
#8 best model for Atari Games on Atari 2600 Freeway
This paper trains a GAN to generate levels for Super Mario Bros using a level from the Video Game Level Corpus.