no code implementations • 29 Sep 2021 • Louis Bagot, Kevin Mets, Tom De Schepper, Peter Hellinckx, Steven Latre
As an alternative to the widespread method of a weighted sum of rewards, Explore Options let the agent call an intrinsically motivated agent in order to observe and learn from interesting behaviors in the environment.
no code implementations • ICML Workshop LifelongML 2020 • Louis Bagot, Kevin Mets, Steven Latré
A Reinforcement Learning (RL) agent needs to find an optimal sequence of actions in order to maximize rewards.