no code implementations • 6 Jul 2023 • Yu Chen, Yihan Du, Pihe Hu, Siwei Wang, Desheng Wu, Longbo Huang
Risk-sensitive reinforcement learning (RL) aims to optimize policies that balance the expected reward and risk.
1 code implementation • 30 Aug 2022 • Pihe Hu, Ling Pan, Yu Chen, Zhixuan Fang, Longbo Huang
Multi-user delay constrained scheduling is important in many real-world applications including wireless communication, live streaming, and cloud computing.
no code implementations • 23 Jun 2022 • Pihe Hu, Yu Chen, Longbo Huang
We study reinforcement learning with linear function approximation where the transition probability and reward functions are linear with respect to a feature mapping $\boldsymbol{\phi}(s, a)$.
1 code implementation • 30 May 2022 • Yiqin Tan, Pihe Hu, Ling Pan, Jiatai Huang, Longbo Huang
Training deep reinforcement learning (DRL) models usually requires high computation costs.