Search Results for author: Paul Barde

Found 6 papers, 2 papers with code

A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem

no code implementations • 26 May 2023 • Paul Barde, Jakob Foerster, Derek Nowrouzezahrai, Amy Zhang

Training multiple agents to coordinate is an essential problem with applications in robotics, game theory, economics, and social sciences.

Multi-agent Reinforcement Learning

Paper
Add Code

Learning to Guide and to Be Guided in the Architect-Builder Problem

1 code implementation • ICLR 2022 • Paul Barde, Tristan Karch, Derek Nowrouzezahrai, Clément Moulin-Frier, Christopher Pal, Pierre-Yves Oudeyer

ABIG results in a low-level, high-frequency, guiding communication protocol that not only enables an architect-builder pair to solve the task at hand, but that can also generalize to unseen tasks.

Imitation Learning

Paper
Code

Regularized Inverse Reinforcement Learning

no code implementations • ICLR 2021 • Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert behavior by acquiring reward functions that explain the expert's decisions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

3 code implementations • NeurIPS 2020 • Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator.

Imitation Learning reinforcement-learning +1

Paper
Code

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

no code implementations • 24 Feb 2020 • Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai, Joelle Pineau

Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a recent approach that applies single-agent AIRL to multi-agent problems where we seek to recover both policies for our agents and reward functions that promote expert-like behavior.

Open-Ended Question Answering reinforcement-learning +1

Paper
Add Code

Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

no code implementations • NeurIPS 2020 • Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.

Continuous Control Inductive Bias +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.