no code implementations • 16 Aug 2023 • Ziteng Cheng, Anthony Coache, Sebastian Jaimungal
Specifically, we prove that the agent's risk aversion can be identified as the number of questions tends to infinity, and the questions are randomly designed.
1 code implementation • 29 Jun 2022 • Anthony Coache, Sebastian Jaimungal, Álvaro Cartea
We propose a novel framework to solve risk-sensitive reinforcement learning (RL) problems where the agent optimises time-consistent dynamic spectral risk measures.
1 code implementation • 26 Dec 2021 • Anthony Coache, Sebastian Jaimungal
We develop an approach for solving time-consistent risk-sensitive stochastic optimization problems using model-free reinforcement learning (RL).