no code implementations • 10 Dec 2024 • Julien Roy
In the last decade, Deep Reinforcement Learning has evolved into a powerful tool for complex sequential decision-making problems.
no code implementations • 25 Oct 2024 • Ihor Neporozhnii, Julien Roy, Emmanuel Bengio, Jason Hartford
In drug discovery, highly automated high-throughput laboratories are used to screen a large number of compounds in search of effective drugs.
1 code implementation • 2 May 2024 • Miruna Cretu, Charles Harris, Ilia Igashov, Arne Schneuing, Marwin Segler, Bruno Correia, Julien Roy, Emmanuel Bengio, Pietro Liò
To address this, we introduce various strategies for learning the GFlowNet backward policy and thus demonstrate how additional constraints can be integrated into the GFlowNet MDP framework.
no code implementations • 7 Jun 2023 • Julien Roy, Pierre-Luc Bacon, Christopher Pal, Emmanuel Bengio
In recent years, in-silico molecular design has received much attention from the machine learning community.
1 code implementation • 22 Dec 2021 • Julien Roy, Roger Girgis, Joshua Romoff, Pierre-Luc Bacon, Christopher Pal
The standard formulation of Reinforcement Learning lacks a practical way of specifying what are admissible and forbidden behaviors.
3 code implementations • NeurIPS 2020 • Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai
Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator.
1 code implementation • 28 Nov 2019 • Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup
In this paper, we investigate learning temporal abstractions in cooperative multi-agent systems, using the options framework (Sutton et al, 1999).
no code implementations • NeurIPS 2020 • Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal
Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.
no code implementations • 20 Nov 2015 • Félix G. Harvey, Julien Roy, David Kanaa, Christopher Pal
We find that using such constraints allow to stabilize the training of recurrent adversarial architectures for animation generation.