no code implementations • 22 Mar 2019 • Stephen Zhen Gou, Yuyang Liu
However, one insight is that these transitions can be used to learn the dynamics of the environment as a supervised learning problem.
Atari Games OpenAI Gym +1