no code implementations • 6 Jan 2023 • Vincent Mai, Philippe Maisonneuve, Tianyu Zhang, Hadi Nekoei, Liam Paull, Antoine Lesage-Landry
To integrate high amounts of renewable energy resources, electrical power grids must be able to cope with high amplitude, fast timescale variations in power generation.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • ICLR 2022 • Vincent Mai, Kaustubh Mani, Liam Paull
In model-free deep reinforcement learning (RL) algorithms, using noisy value estimates to supervise policy evaluation and optimization is detrimental to the sample efficiency.
no code implementations • 9 Jul 2021 • Vincent Mai, Waleed Khamies, Liam Paull
In many situations however, the labelling process is able to estimate the variance of such distribution for each label, which can be used as an additional information to mitigate this impact.
no code implementations • 1 Jan 2021 • Vincent Mai, Waleed Khamies, Liam Paull
In this work, we consider this setting and additionally assume that the label generating process is able to provide us with a quantity for the role of each label in the misalignment between the datasets, which we consider to be privileged information.
1 code implementation • 5 Mar 2019 • Sai Krishna, Keehong Seo, Dhaivat Bhatt, Vincent Mai, Krishna Murthy, Liam Paull
Traditional approaches to this use an information-theoretic criterion for action selection and hand-crafted perceptual models.