Search Results for author: Guang Huang

Found 1 papers, 1 papers with code

Successive Convex Approximation Based Off-Policy Optimization for Constrained Reinforcement Learning

1 code implementation • 26 May 2021 • Chang Tian, An Liu, Guang Huang, Wu Luo

We propose a successive convex approximation based off-policy optimization (SCAOPO) algorithm to solve the general constrained reinforcement learning problem, which is formulated as a constrained Markov decision process (CMDP) in the context of average cost.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.