Search Results for author: Denis Steckelmacher

Found 11 papers, 3 papers with code

Dynamic Size Message Scheduling for Multi-Agent Communication under Limited Bandwidth

no code implementations • 16 Jun 2023 • Qingshuang Sun, Denis Steckelmacher, Yuan YAO, Ann Nowé, Raphaël Avalos

Communication plays a vital role in multi-agent systems, fostering collaboration and coordination.

Multi-agent Reinforcement Learning Scheduling

Paper
Add Code

Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem

no code implementations • 30 Jan 2023 • Hélène Plisnier, Denis Steckelmacher, Jeroen Willems, Bruno Depraetere, Ann Nowé

Many instances of similar or almost-identical industrial machines or tools are often deployed at once, or in quick succession.

Management reinforcement-learning +1

Paper
Add Code

Synthesising Reinforcement Learning Policies through Set-Valued Inductive Rule Learning

1 code implementation • 10 Jun 2021 • Youri Coppens, Denis Steckelmacher, Catholijn M. Jonker, Ann Nowé

Then, to ensure that the rules explain a valid, non-degenerate policy, we introduce a refinement algorithm that fine-tunes the rules to obtain good performance when executed in the environment.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Explainable Reinforcement Learning Through Goal-Based Explanations

no code implementations • 1 Jan 2021 • Gregory Bonaert, Youri Coppens, Denis Steckelmacher, Ann Nowe

Our key contribution to improve explainability is introducing goal-based explanations, a new explanation mechanism where the agent produces goals and attempts to reach those goals one-by-one while maximizing the collected reward.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Transfer Learning Across Simulated Robots With Different Sensors

no code implementations • 18 Jul 2019 • Hélène Plisnier, Denis Steckelmacher, Diederik Roijers, Ann Nowé

After training in the lab, the robot should be able to get by without the expensive equipment that used to be available to it, and yet still be guaranteed to perform well on the field.

Transfer Learning

Paper
Add Code

Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics

1 code implementation • 11 Mar 2019 • Denis Steckelmacher, Hélène Plisnier, Diederik M. Roijers, Ann Nowé

We argue that actor-critic algorithms are limited by their need for an on-policy critic.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

The Actor-Advisor: Policy Gradient With Off-Policy Advice

no code implementations • 7 Feb 2019 • Hélène Plisnier, Denis Steckelmacher, Diederik M. Roijers, Ann Nowé

In this paper, we propose an elegant solution, the Actor-Advisor architecture, in which a Policy Gradient actor learns from unbiased Monte-Carlo returns, while being shaped (or advised) by the Softmax policy arising from an off-policy critic.

Transfer Learning

Paper
Add Code

Dynamic Weights in Multi-Objective Deep Reinforcement Learning

3 code implementations • 20 Sep 2018 • Axel Abels, Diederik M. Roijers, Tom Lenaerts, Ann Nowé, Denis Steckelmacher

In the dynamic weights setting the relative importance changes over time and specialized algorithms that deal with such change, such as a tabular Reinforcement Learning (RL) algorithm by Natarajan and Tadepalli (2005), are required.

Multi-Objective Reinforcement Learning reinforcement-learning

247

Paper
Code

Directed Policy Gradient for Safe Reinforcement Learning with Human Advice

no code implementations • 13 Aug 2018 • Hélène Plisnier, Denis Steckelmacher, Tim Brys, Diederik M. Roijers, Ann Nowé

Our technique, Directed Policy Gradient (DPG), allows a teacher or backup policy to override the agent before it acts undesirably, while allowing the agent to leverage human advice or directives to learn faster.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Reinforcement Learning in POMDPs with Memoryless Options and Option-Observation Initiation Sets

no code implementations • 22 Aug 2017 • Denis Steckelmacher, Diederik M. Roijers, Anna Harutyunyan, Peter Vrancx, Hélène Plisnier, Ann Nowé

Many real-world reinforcement learning problems have a hierarchical nature, and often exhibit some degree of partial observability.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

An Empirical Comparison of Neural Architectures for Reinforcement Learning in Partially Observable Environments

no code implementations • 17 Dec 2015 • Denis Steckelmacher, Peter Vrancx

This paper explores the performance of fitted neural Q iteration for reinforcement learning in several partially observable environments, using three recurrent neural network architectures: Long Short-Term Memory, Gated Recurrent Unit and MUT1, a recurrent neural architecture evolved from a pool of several thousands candidate architectures.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.