Search Results for author: Audrey Durand

Found 21 papers, 8 papers with code

Randomized Confidence Bounds for Stochastic Partial Monitoring

no code implementations • 7 Feb 2024 • Maxime Heuillet, Ola Ahmad, Audrey Durand

In this paper, we consider the contextual and non-contextual PM settings with stochastic outcomes.

Paper
Add Code

Association Rules Mining with Auto-Encoders

no code implementations • 26 Apr 2023 • Théophile Berteloot, Richard Khoury, Audrey Durand

Classical association rule mining algorithms have several limitations, especially with regards to their high execution times and number of rules produced.

Paper
Add Code

Interpret Your Care: Predicting the Evolution of Symptoms for Cancer Patients

no code implementations • 19 Feb 2023 • Rupali Bhati, Jennifer Jones, Audrey Durand

The focus of this study is on predicting the pain and tiredness level of a patient post their diagnosis.

Paper
Add Code

Neural Bandits for Data Mining: Searching for Dangerous Polypharmacy

1 code implementation • 10 Dec 2022 • Alexandre Larouche, Audrey Durand, Richard Khoury, Caroline Sirois

Polypharmacy, most often defined as the simultaneous consumption of five or more drugs at once, is a prevalent phenomenon in the older population.

Thompson Sampling

Paper
Code

Cambrian Explosion Algorithm for Multi-Objective Association Rules Mining

no code implementations • 23 Nov 2022 • Théophile Berteloot, Richard Khoury, Audrey Durand

Association rule mining is one of the most studied research fields of data mining, with applications ranging from grocery basket problems to highly explainable classification systems.

Paper
Add Code

Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization

no code implementations • 15 Dec 2021 • Tong Li, Jacob Nogas, Haochen Song, Harsh Kumar, Audrey Durand, Anna Rafferty, Nina Deliu, Sofia S. Villar, Joseph J. Williams

TS-PostDiff takes a Bayesian approach to mixing TS and Uniform Random (UR): the probability a participant is assigned using UR allocation is the posterior probability that the difference between two arms is 'small' (below a certain threshold), allowing for more UR exploration when there is little or no reward to be gained.

Thompson Sampling

Paper
Add Code

GrowSpace: Learning How to Shape Plants

no code implementations • 15 Oct 2021 • Yasmeen Hitti, Ionelia Buzatu, Manuel Del Verme, Mark Lefsrud, Florian Golemo, Audrey Durand

We argue that plant responses to an environmental stimulus are a good example of a real-world problem that can be approached within a reinforcement learning (RL)framework.

Fairness Reinforcement Learning (RL)

Paper
Add Code

Sequential Automated Machine Learning: Bandits-driven Exploration using a Collaborative Filtering Representation

no code implementations • ICML Workshop AutoML 2021 • Maxime Heuillet, Benoit Debaque, Audrey Durand

The goal of Automated Machine Learning (AutoML) is to make Machine Learning (ML) tools more accessible.

AutoML BIG-bench Machine Learning +1

Paper
Add Code

Challenges in Statistical Analysis of Data Collected by a Bandit Algorithm: An Empirical Exploration in Applications to Adaptively Randomized Experiments

no code implementations • 22 Mar 2021 • Joseph Jay Williams, Jacob Nogas, Nina Deliu, Hammad Shaikh, Sofia S. Villar, Audrey Durand, Anna Rafferty

We therefore use our case study of the ubiquitous two-arm binary reward setting to empirically investigate the impact of using Thompson Sampling instead of uniform random assignment.

Thompson Sampling

Paper
Add Code

Comparison of pharmacist evaluation of medication orders with predictions of a machine learning model

1 code implementation • 3 Nov 2020 • Sophie-Camille Hogue, Flora Chen, Geneviève Brassard, Denis Lebel, Jean-François Bussières, Audrey Durand, Maxime Thibault

The objective of this work was to assess the clinical performance of an unsupervised machine learning model aimed at identifying unusual medication orders and pharmacological profiles.

BIG-bench Machine Learning

Paper
Code

Deep interpretability for GWAS

no code implementations • 3 Jul 2020 • Deepak Sharma, Audrey Durand, Marc-André Legault, Louis-Philippe Lemieux Perreault, Audrey Lemaçon, Marie-Pierre Dubé, Joelle Pineau

Genome-Wide Association Studies are typically conducted using linear models to find genetic variants associated with common diseases.

Paper
Add Code

A Robust Self-Learning Method for Fully Unsupervised Cross-Lingual Mappings of Word Embeddings: Making the Method Robustly Reproducible as Well

1 code implementation • LREC 2020 • Nicolas Garneau, Mathieu Godbout, David Beauchemin, Audrey Durand, Luc Lamontagne

In this paper, we reproduce the experiments of Artetxe et al. (2018b) regarding the robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings.

Self-Learning Word Embeddings

Paper
Code

Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

1 code implementation • 11 Oct 2019 • Sharan Vaswani, Abbas Mehrabian, Audrey Durand, Branislav Kveton

We propose $\tt RandUCB$, a bandit strategy that builds on theoretically derived confidence intervals similar to upper confidence bound (UCB) algorithms, but akin to Thompson sampling (TS), it uses randomization to trade off exploration and exploitation.

Thompson Sampling

Paper
Code

Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning

no code implementations • 17 Sep 2019 • Thang Doan, Bogdan Mazoure, Moloud Abdar, Audrey Durand, Joelle Pineau, R. Devon Hjelm

Continuous control tasks in reinforcement learning are important because they provide an important framework for learning in high-dimensional state spaces with deceptive rewards, where the agent can easily become trapped into suboptimal solutions.

Continuous Control reinforcement-learning +1

Paper
Add Code

Leveraging exploration in off-policy algorithms via normalizing flows

1 code implementation • 16 May 2019 • Bogdan Mazoure, Thang Doan, Audrey Durand, R. Devon Hjelm, Joelle Pineau

The ability to discover approximately optimal policies in domains with sparse rewards is crucial to applying reinforcement learning (RL) in many real-world scenarios.

Continuous Control Reinforcement Learning (RL)

Paper
Code

Temporal Regularization for Markov Decision Process

1 code implementation • NeurIPS 2018 • Pierre Thodoroff, Audrey Durand, Joelle Pineau, Doina Precup

Several applications of Reinforcement Learning suffer from instability due to high variance.

Atari Games reinforcement-learning +1

Paper
Code

Temporal Regularization in Markov Decision Process

2 code implementations • 1 Nov 2018 • Pierre Thodoroff, Audrey Durand, Joelle Pineau, Doina Precup

Several applications of Reinforcement Learning suffer from instability due to high variance.

Atari Games reinforcement-learning +1

Paper
Code

On-line Adaptative Curriculum Learning for GANs

3 code implementations • 31 Jul 2018 • Thang Doan, Joao Monteiro, Isabela Albuquerque, Bogdan Mazoure, Audrey Durand, Joelle Pineau, R. Devon Hjelm

We argue that less expressive discriminators are smoother and have a general coarse grained view of the modes map, which enforces the generator to cover a wide portion of the data distribution support.

Multi-Armed Bandits Stochastic Optimization

Paper
Code

Learning to Become an Expert: Deep Networks Applied To Super-Resolution Microscopy

no code implementations • 28 Mar 2018 • Louis-Émile Robitaille, Audrey Durand, Marc-André Gardner, Christian Gagné, Paul De Koninck, Flavie Lavoie-Cardinal

More specifically, we are proposing a system based on a deep neural network that can provide a quantitative quality measure of a STED image of neuronal structures given as input.

Super-Resolution

Paper
Add Code

Streaming kernel regression with provably adaptive mean, variance, and regularization

no code implementations • 2 Aug 2017 • Audrey Durand, Odalric-Ambrym Maillard, Joelle Pineau

The variance of the noise is not assumed to be known.

regression Thompson Sampling +1

Paper
Add Code

Estimating Quality in Multi-Objective Bandits Optimization

no code implementations • 4 Jan 2017 • Audrey Durand, Christian Gagné

The question is: how good do estimations of these objectives have to be in order for the solution maximizing the preference function to remain unchanged?

Thompson Sampling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.