Search Results for author: Hélène Plisnier

Found 6 papers, 1 papers with code

Transfer Learning Across Simulated Robots With Different Sensors

no code implementations18 Jul 2019 Hélène Plisnier, Denis Steckelmacher, Diederik Roijers, Ann Nowé

After training in the lab, the robot should be able to get by without the expensive equipment that used to be available to it, and yet still be guaranteed to perform well on the field.

Transfer Learning

The Actor-Advisor: Policy Gradient With Off-Policy Advice

no code implementations7 Feb 2019 Hélène Plisnier, Denis Steckelmacher, Diederik M. Roijers, Ann Nowé

In this paper, we propose an elegant solution, the Actor-Advisor architecture, in which a Policy Gradient actor learns from unbiased Monte-Carlo returns, while being shaped (or advised) by the Softmax policy arising from an off-policy critic.

Transfer Learning

Directed Policy Gradient for Safe Reinforcement Learning with Human Advice

no code implementations13 Aug 2018 Hélène Plisnier, Denis Steckelmacher, Tim Brys, Diederik M. Roijers, Ann Nowé

Our technique, Directed Policy Gradient (DPG), allows a teacher or backup policy to override the agent before it acts undesirably, while allowing the agent to leverage human advice or directives to learn faster.

reinforcement-learning Reinforcement Learning (RL) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.