no code implementations • 28 Feb 2024 • Tonghe Zhang, Yu Chen, Longbo Huang
This work pioneers regret analysis of risk-sensitive reinforcement learning in partially observable environments with hindsight observation, addressing a gap in theoretical exploration.