Search Results for author: Alexandre Piché

Found 8 papers, 5 papers with code

Causal Discovery with Language Models as Imperfect Experts

1 code implementation5 Jul 2023 Stephanie Long, Alexandre Piché, Valentina Zantedeschi, Tibor Schuster, Alexandre Drouin

Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making.

Causal Discovery Decision Making +2

Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

1 code implementation24 Sep 2022 Sai Rajeswar, Pietro Mazzaglia, Tim Verbelen, Alexandre Piché, Bart Dhoedt, Aaron Courville, Alexandre Lacoste

In this work, we study the URLB and propose a new method to solve it, using unsupervised model-based RL, for pre-training the agent, and a task-aware fine-tuning strategy combined with a new proposed hybrid planner, Dyna-MPC, to adapt the agent for downstream tasks.

reinforcement-learning Reinforcement Learning (RL) +1

Bridging the Gap Between Target Networks and Functional Regularization

1 code implementation4 Jun 2021 Alexandre Piché, Valentin Thomas, Rafael Pardinas, Joseph Marino, Gian Maria Marconi, Christopher Pal, Mohammad Emtiyaz Khan

Our findings emphasize that Functional Regularization can be used as a drop-in replacement for Target Networks and result in performance improvement.

Q-Learning

Iterative Amortized Policy Optimization

1 code implementation NeurIPS 2021 Joseph Marino, Alexandre Piché, Alessandro Davide Ialongo, Yisong Yue

Policy networks are a central feature of deep reinforcement learning (RL) algorithms for continuous control, enabling the estimation and sampling of high-value actions.

Continuous Control reinforcement-learning +2

Adaptive Stochastic Dual Coordinate Ascent for Conditional Random Fields

no code implementations22 Dec 2017 Rémi Le Priol, Alexandre Piché, Simon Lacoste-Julien

In this paper, we adapt SDCA to train CRFs, and we enhance it with an adaptive non-uniform sampling strategy based on block duality gaps.

Binary Classification General Classification

Bayesian Nonparametric Modeling of Heterogeneous Groups of Censored Data

no code implementations24 Oct 2016 Alexandre Piché, Russell Steele, Ian Shrier, Stephanie Long

Datasets containing large samples of time-to-event data arising from several small heterogeneous groups are commonly encountered in statistics.

Cannot find the paper you are looking for? You can Submit a new open access paper.