1 code implementation • 13 Jun 2019 • Francesco Foglino, Christiano Coletto Christakou, Ricardo Luna Gutierrez, Matteo Leonetti
We propose a task sequencing algorithm maximizing the cumulative return, that is, the return obtained by the agent across all the learning episodes.
no code implementations • 31 Jan 2019 • Francesco Foglino, Christiano Coletto Christakou, Matteo Leonetti
In reinforcement learning, all previous task sequencing methods have shaped exploration with the objective of reducing the time to reach a given performance level.