Search Results for author: Vincent Mai

Found 5 papers, 2 papers with code

Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads

no code implementations6 Jan 2023 Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, Hadi Nekoei, Liam Paull, Antoine Lesage-Landry

To integrate high amounts of renewable energy resources, electrical power grids must be able to cope with high amplitude, fast timescale variations in power generation.

Multi-agent Reinforcement Learning reinforcement-learning +1

Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation

1 code implementation ICLR 2022 Vincent Mai, Kaustubh Mani, Liam Paull

In model-free deep reinforcement learning (RL) algorithms, using noisy value estimates to supervise policy evaluation and optimization is detrimental to the sample efficiency.

Continuous Control reinforcement-learning +1

Batch Inverse-Variance Weighting: Deep Heteroscedastic Regression

no code implementations9 Jul 2021 Vincent Mai, Waleed Khamies, Liam Paull

In many situations however, the labelling process is able to estimate the variance of such distribution for each label, which can be used as an additional information to mitigate this impact.

regression

Batch Inverse-Variance Weighting: Deep Heteroscedastic Regression using Privileged Information

no code implementations1 Jan 2021 Vincent Mai, Waleed Khamies, Liam Paull

In this work, we consider this setting and additionally assume that the label generating process is able to provide us with a quantity for the role of each label in the misalignment between the datasets, which we consider to be privileged information.

regression

Deep Active Localization

1 code implementation5 Mar 2019 Sai Krishna, Keehong Seo, Dhaivat Bhatt, Vincent Mai, Krishna Murthy, Liam Paull

Traditional approaches to this use an information-theoretic criterion for action selection and hand-crafted perceptual models.

OpenAI Gym

Cannot find the paper you are looking for? You can Submit a new open access paper.