no code implementations • 3 Aug 2023 • Asen Nachkov, Luchen Li, Giulia Luise, Filippo Valdettaro, Aldo Faisal
To test whether optimistic ensemble method can improve on distributional RL as did on scalar RL, by e. g. Bootstrapped DQN, we implement the BoP approach with a population of distributional actor-critics using Bayesian Distributional Policy Gradients (BDPG).
no code implementations • 20 Mar 2021 • Luchen Li, A. Aldo Faisal
Distributional Reinforcement Learning (RL) maintains the entire probability distribution of the reward-to-go, i. e. the return, providing more learning signals that account for the uncertainty associated with policy performance, which may be beneficial for trading off exploration and exploitation and policy learning in general.
no code implementations • 13 Mar 2020 • Luchen Li, Ignacio Albert-Smet, Aldo A. Faisal
Our aim is to establish a framework where reinforcement learning (RL) of optimizing interventions retrospectively allows us a regulatory compliant pathway to prospective clinical testing of the learned policies in a clinical deployment.
no code implementations • 17 May 2019 • Luchen Li, Matthieu Komorowski, Aldo A. Faisal
Health-related data is noisy and stochastic in implying the true physiological states of patients, limiting information contained in single-moment observations for sequential clinical decision making.
no code implementations • 29 May 2018 • Luchen Li, Matthieu Komorowski, Aldo A. Faisal
We capture this situation with partially observable Markov decision process, in which an agent optimises its actions in a belief represented as a distribution of patient states inferred from individual history trajectories.