Search Results for author: Denis Steckelmacher

Found 11 papers, 3 papers with code

Synthesising Reinforcement Learning Policies through Set-Valued Inductive Rule Learning

1 code implementation10 Jun 2021 Youri Coppens, Denis Steckelmacher, Catholijn M. Jonker, Ann Nowé

Then, to ensure that the rules explain a valid, non-degenerate policy, we introduce a refinement algorithm that fine-tunes the rules to obtain good performance when executed in the environment.

reinforcement-learning Reinforcement Learning (RL) +1

Explainable Reinforcement Learning Through Goal-Based Explanations

no code implementations1 Jan 2021 Gregory Bonaert, Youri Coppens, Denis Steckelmacher, Ann Nowe

Our key contribution to improve explainability is introducing goal-based explanations, a new explanation mechanism where the agent produces goals and attempts to reach those goals one-by-one while maximizing the collected reward.

reinforcement-learning Reinforcement Learning (RL)

Transfer Learning Across Simulated Robots With Different Sensors

no code implementations18 Jul 2019 Hélène Plisnier, Denis Steckelmacher, Diederik Roijers, Ann Nowé

After training in the lab, the robot should be able to get by without the expensive equipment that used to be available to it, and yet still be guaranteed to perform well on the field.

Transfer Learning

The Actor-Advisor: Policy Gradient With Off-Policy Advice

no code implementations7 Feb 2019 Hélène Plisnier, Denis Steckelmacher, Diederik M. Roijers, Ann Nowé

In this paper, we propose an elegant solution, the Actor-Advisor architecture, in which a Policy Gradient actor learns from unbiased Monte-Carlo returns, while being shaped (or advised) by the Softmax policy arising from an off-policy critic.

Transfer Learning

Dynamic Weights in Multi-Objective Deep Reinforcement Learning

3 code implementations20 Sep 2018 Axel Abels, Diederik M. Roijers, Tom Lenaerts, Ann Nowé, Denis Steckelmacher

In the dynamic weights setting the relative importance changes over time and specialized algorithms that deal with such change, such as a tabular Reinforcement Learning (RL) algorithm by Natarajan and Tadepalli (2005), are required.

Multi-Objective Reinforcement Learning reinforcement-learning

Directed Policy Gradient for Safe Reinforcement Learning with Human Advice

no code implementations13 Aug 2018 Hélène Plisnier, Denis Steckelmacher, Tim Brys, Diederik M. Roijers, Ann Nowé

Our technique, Directed Policy Gradient (DPG), allows a teacher or backup policy to override the agent before it acts undesirably, while allowing the agent to leverage human advice or directives to learn faster.

reinforcement-learning Reinforcement Learning (RL) +1

An Empirical Comparison of Neural Architectures for Reinforcement Learning in Partially Observable Environments

no code implementations17 Dec 2015 Denis Steckelmacher, Peter Vrancx

This paper explores the performance of fitted neural Q iteration for reinforcement learning in several partially observable environments, using three recurrent neural network architectures: Long Short-Term Memory, Gated Recurrent Unit and MUT1, a recurrent neural architecture evolved from a pool of several thousands candidate architectures.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.