no code implementations • 19 Jul 2021 • Mingqi Yuan, Mon-on Pun, Dong Wang, Yi Chen, Haojun Li
Furthermore, we leverage a variational auto-encoder (VAE) model to capture the life-long novelty of states, which is combined with the global JFI score to form multimodal intrinsic rewards.