Search Results for author: Anran Hu

Found 4 papers, 0 papers with code

Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods

no code implementations • 13 Sep 2021 • Xin Guo, Anran Hu, Junzi Zhang

To our best knowledge, this is the first theoretical guarantee on fictitious discount algorithms for the episodic reinforcement learning of finite-time-horizon MDPs, which also leads to the (first) global convergence of policy gradient methods for finite-time-horizon episodic reinforcement learning.

Policy Gradient Methods reinforcement-learning +1

Paper
Add Code

Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls

no code implementations • 19 Apr 2021 • Xin Guo, Anran Hu, Yufei Zhang

We study finite-time horizon continuous-time linear-convex reinforcement learning problems in an episodic setting.

Reinforcement Learning (RL)

Paper
Add Code

Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon

no code implementations • 27 Jun 2020 • Matteo Basei, Xin Guo, Anran Hu, Yufei Zhang

We study finite-time horizon continuous-time linear-quadratic reinforcement learning problems in an episodic setting, where both the state and control coefficients are unknown to the controller.

Reinforcement Learning (RL)

Paper
Add Code

A General Framework for Learning Mean-Field Games

no code implementations • 13 Mar 2020 • Xin Guo, Anran Hu, Renyuan Xu, Junzi Zhang

This paper presents a general mean-field game (GMFG) framework for simultaneous learning and decision-making in stochastic games with a large population.

Decision Making Multi-agent Reinforcement Learning +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.