# MIME: Mutual Information Minimisation Exploration

16 Jan 2020Haitao XuBrendan McCaneLech SzymanskiCraig Atkinson

We show that reinforcement learning agents that learn by surprise (surprisal) get stuck at abrupt environmental transition boundaries because these transitions are difficult to learn. We propose a counter-intuitive solution that we call Mutual Information Minimising Exploration (MIME) where an agent learns a latent representation of the environment without trying to predict the future states... (read more)

PDF Abstract