Search Results for author: Julien Roy

Found 6 papers, 3 papers with code

Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design

no code implementations7 Jun 2023 Julien Roy, Pierre-Luc Bacon, Christopher Pal, Emmanuel Bengio

In recent years, in-silico molecular design has received much attention from the machine learning community.

Direct Behavior Specification via Constrained Reinforcement Learning

1 code implementation22 Dec 2021 Julien Roy, Roger Girgis, Joshua Romoff, Pierre-Luc Bacon, Christopher Pal

The standard formulation of Reinforcement Learning lacks a practical way of specifying what are admissible and forbidden behaviors.

Continuous Control reinforcement-learning +1

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

3 code implementations NeurIPS 2020 Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator.

Imitation Learning reinforcement-learning +1

Option-Critic in Cooperative Multi-agent Systems

1 code implementation28 Nov 2019 Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup

In this paper, we investigate learning temporal abstractions in cooperative multi-agent systems, using the options framework (Sutton et al, 1999).

Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

no code implementations NeurIPS 2020 Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.

Continuous Control Inductive Bias +3

Recurrent Semi-supervised Classification and Constrained Adversarial Generation with Motion Capture Data

no code implementations20 Nov 2015 Félix G. Harvey, Julien Roy, David Kanaa, Christopher Pal

We find that using such constraints allow to stabilize the training of recurrent adversarial architectures for animation generation.

Clustering General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.