Search Results for author: Wendelin Boehmer

Found 7 papers, 4 papers with code

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

1 code implementation • ICLR 2021 • Vitaly Kurin, Maximilian Igl, Tim Rocktäschel, Wendelin Boehmer, Shimon Whiteson

They also allow practitioners to inject biases encoded in the structure of the input graph.

Paper
Code

Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning

no code implementations • ICLR 2021 • Maximilian Igl, Gregory Farquhar, Jelena Luketina, Wendelin Boehmer, Shimon Whiteson

Non-stationarity can arise in Reinforcement Learning (RL) even in stationary environments.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Privileged Information Dropout in Reinforcement Learning

no code implementations • 19 May 2020 • Pierre-Alexandre Kamienny, Kai Arulkumaran, Feryal Behbahani, Wendelin Boehmer, Shimon Whiteson

Using privileged information during training can improve the sample efficiency and performance of machine learning systems.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination

no code implementations • 21 Oct 2019 • Dongge Han, Wendelin Boehmer, Michael Wooldridge, Alex Rogers

We evaluate our model empirically on a set of multi-agent pursuit and taxi tasks, and show that our agents learn to adapt flexibly across scenarios that require different termination behaviours.

Hierarchical Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Deep Residual Reinforcement Learning

1 code implementation • 3 May 2019 • Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

We revisit residual algorithms in both model-free and model-based reinforcement learning settings.

Model-based Reinforcement Learning reinforcement-learning +1

3,096

Paper
Code

Generalized Off-Policy Actor-Critic

1 code implementation • NeurIPS 2019 • Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

We propose a new objective, the counterfactual objective, unifying existing objectives for off-policy policy gradient algorithms in the continuing reinforcement learning (RL) setting.

counterfactual reinforcement-learning +1

3,096

Paper
Code

Multi-Agent Common Knowledge Reinforcement Learning

1 code implementation • NeurIPS 2019 • Christian A. Schroeder de Witt, Jakob N. Foerster, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson

In this paper, we show that common knowledge between agents allows for complex decentralised coordination.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.