Search Results for author: Vincent Mai

Found 5 papers, 2 papers with code

Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response of Residential Loads

no code implementations • 6 Jan 2023 • Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, Hadi Nekoei, Liam Paull, Antoine Lesage-Landry

To integrate high amounts of renewable energy resources, electrical power grids must be able to cope with high amplitude, fast timescale variations in power generation.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation

1 code implementation • ICLR 2022 • Vincent Mai, Kaustubh Mani, Liam Paull

In model-free deep reinforcement learning (RL) algorithms, using noisy value estimates to supervise policy evaluation and optimization is detrimental to the sample efficiency.

Continuous Control reinforcement-learning +1

Paper
Code

Batch Inverse-Variance Weighting: Deep Heteroscedastic Regression

no code implementations • 9 Jul 2021 • Vincent Mai, Waleed Khamies, Liam Paull

In many situations however, the labelling process is able to estimate the variance of such distribution for each label, which can be used as an additional information to mitigate this impact.

regression

Paper
Add Code

Batch Inverse-Variance Weighting: Deep Heteroscedastic Regression using Privileged Information

no code implementations • 1 Jan 2021 • Vincent Mai, Waleed Khamies, Liam Paull

In this work, we consider this setting and additionally assume that the label generating process is able to provide us with a quantity for the role of each label in the misalignment between the datasets, which we consider to be privileged information.

regression

Paper
Add Code

Deep Active Localization

1 code implementation • 5 Mar 2019 • Sai Krishna, Keehong Seo, Dhaivat Bhatt, Vincent Mai, Krishna Murthy, Liam Paull

Traditional approaches to this use an information-theoretic criterion for action selection and hand-crafted perceptual models.

OpenAI Gym

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.