Search Results for author: Mao Hong

Found 2 papers, 0 papers with code

MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning

no code implementations21 Jan 2024 Mao Hong, Zhiyue Zhang, Yue Wu, Yanxun Xu

Model-based offline reinforcement learning methods (RL) have achieved state-of-the-art performance in many decision-making problems thanks to their sample efficiency and generalizability.

Decision Making Offline RL +1

A Policy Gradient Method for Confounded POMDPs

no code implementations26 May 2023 Mao Hong, Zhengling Qi, Yanxun Xu

To the best of our knowledge, this is the first work studying the policy gradient method for POMDPs under the offline setting.

Cannot find the paper you are looking for? You can Submit a new open access paper.