Search Results for author: Guang Huang

Found 1 papers, 1 papers with code

Successive Convex Approximation Based Off-Policy Optimization for Constrained Reinforcement Learning

1 code implementation26 May 2021 Chang Tian, An Liu, Guang Huang, Wu Luo

We propose a successive convex approximation based off-policy optimization (SCAOPO) algorithm to solve the general constrained reinforcement learning problem, which is formulated as a constrained Markov decision process (CMDP) in the context of average cost.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.