Effective Reward Specification in Deep Reinforcement Learning

no code implementations10 Dec 2024 Julien Roy

In the last decade, Deep Reinforcement Learning has evolved into a powerful tool for complex sequential decision-making problems.

Deep Reinforcement Learning reinforcement-learning +1

Efficient Biological Data Acquisition through Inference Set Design

no code implementations25 Oct 2024 Ihor Neporozhnii, Julien Roy, Emmanuel Bengio, Jason Hartford

In drug discovery, highly automated high-throughput laboratories are used to screen a large number of compounds in search of effective drugs.

Active Learning Drug Discovery

SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints

1 code implementation2 May 2024 Miruna Cretu, Charles Harris, Ilia Igashov, Arne Schneuing, Marwin Segler, Bruno Correia, Julien Roy, Emmanuel Bengio, Pietro Liò

To address this, we introduce various strategies for learning the GFlowNet backward policy and thus demonstrate how additional constraints can be integrated into the GFlowNet MDP framework.

Diversity Drug Design +2

Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design

no code implementations7 Jun 2023 Julien Roy, Pierre-Luc Bacon, Christopher Pal, Emmanuel Bengio

In recent years, in-silico molecular design has received much attention from the machine learning community.

Direct Behavior Specification via Constrained Reinforcement Learning

1 code implementation22 Dec 2021 Julien Roy, Roger Girgis, Joshua Romoff, Pierre-Luc Bacon, Christopher Pal

The standard formulation of Reinforcement Learning lacks a practical way of specifying what are admissible and forbidden behaviors.

continuous-control Continuous Control +3

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

3 code implementations NeurIPS 2020 Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

Adversarial Imitation Learning alternates between learning a discriminator -- which tells apart expert's demonstrations from generated ones -- and a generator's policy to produce trajectories that can fool this discriminator.

Imitation Learning reinforcement-learning +2

Option-Critic in Cooperative Multi-agent Systems

1 code implementation28 Nov 2019 Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup

In this paper, we investigate learning temporal abstractions in cooperative multi-agent systems, using the options framework (Sutton et al, 1999).

Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

no code implementations NeurIPS 2020 Julien Roy, Paul Barde, Félix G. Harvey, Derek Nowrouzezahrai, Christopher Pal

Finally, we analyze the effects of our proposed methods on the policies that our agents learn and show that our methods successfully enforce the qualities that we propose as proxies for coordinated behaviors.

continuous-control Continuous Control +5

Recurrent Semi-supervised Classification and Constrained Adversarial Generation with Motion Capture Data

no code implementations20 Nov 2015 Félix G. Harvey, Julien Roy, David Kanaa, Christopher Pal

We find that using such constraints allow to stabilize the training of recurrent adversarial architectures for animation generation.

Clustering Decoder +1

