1 code implementation • 5 Dec 2023 • Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser
In this work, we investigate the means of using curiosity on replay buffers to improve offline multi-task continual reinforcement learning when tasks, which are defined by the non-stationarity in the environment, are non labeled and not evenly exposed to the learner in time.