Search Results for author: Pihe Hu

Found 4 papers, 2 papers with code

Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback

no code implementations • 6 Jul 2023 • Yu Chen, Yihan Du, Pihe Hu, Siwei Wang, Desheng Wu, Longbo Huang

Risk-sensitive reinforcement learning (RL) aims to optimize policies that balance the expected reward and risk.

Paper
Add Code

Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning

1 code implementation • 30 Aug 2022 • Pihe Hu, Ling Pan, Yu Chen, Zhixuan Fang, Longbo Huang

Multi-user delay constrained scheduling is important in many real-world applications including wireless communication, live streaming, and cloud computing.

Cloud Computing reinforcement-learning +2

Paper
Code

Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation

no code implementations • 23 Jun 2022 • Pihe Hu, Yu Chen, Longbo Huang

We study reinforcement learning with linear function approximation where the transition probability and reward functions are linear with respect to a feature mapping $\boldsymbol{\phi}(s, a)$.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch

1 code implementation • 30 May 2022 • Yiqin Tan, Pihe Hu, Ling Pan, Jiatai Huang, Longbo Huang

Training deep reinforcement learning (DRL) models usually requires high computation costs.

Continuous Control Knowledge Distillation +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.