Search Results for author: Wendelin Boehmer

Found 7 papers, 4 papers with code

Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination

no code implementations21 Oct 2019 Dongge Han, Wendelin Boehmer, Michael Wooldridge, Alex Rogers

We evaluate our model empirically on a set of multi-agent pursuit and taxi tasks, and show that our agents learn to adapt flexibly across scenarios that require different termination behaviours.

Hierarchical Reinforcement Learning reinforcement-learning +1

Deep Residual Reinforcement Learning

1 code implementation3 May 2019 Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

We revisit residual algorithms in both model-free and model-based reinforcement learning settings.

Model-based Reinforcement Learning reinforcement-learning +1

Generalized Off-Policy Actor-Critic

1 code implementation NeurIPS 2019 Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

We propose a new objective, the counterfactual objective, unifying existing objectives for off-policy policy gradient algorithms in the continuing reinforcement learning (RL) setting.

counterfactual reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.