Search Results for author: Bowen Weng

Found 6 papers, 0 papers with code

Momentum Q-learning with Finite-Sample Convergence Guarantee

no code implementations30 Jul 2020 Bowen Weng, Huaqing Xiong, Lin Zhao, Yingbin Liang, Wei zhang

For the infinite state-action space case, we establish the convergence guarantee for MomentumQ with linear function approximations and Markovian sampling.

Q-Learning

Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent

no code implementations15 Jul 2020 Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei zhang

In this paper, we first characterize the convergence rate for Q-AMSGrad, which is the Q-learning algorithm with AMSGrad update (a commonly adopted alternative of Adam for theoretical analysis).

Atari Games Q-Learning

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

no code implementations ICML 2020 Kaiyi Ji, Zhe Wang, Bowen Weng, Yi Zhou, Wei zhang, Yingbin Liang

In this paper, we propose a novel scheme, which eliminates backtracking line search but still exploits the information along optimization path by adapting the batch size via history stochastic gradients.

Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

no code implementations3 Oct 2019 Guillermo A. Castillo, Bowen Weng, Wei zhang, Ayonga Hereid

This paper presents a novel model-free reinforcement learning (RL) framework to design feedback control policies for 3D bipedal walking.

Reinforcement Learning (RL)

CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY

no code implementations25 Sep 2019 Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei zhang

Differently from the popular Deep Q-Network (DQN) learning, Alternating Q-learning (AltQ) does not fully fit a target Q-function at each iteration, and is generally known to be unstable and inefficient.

Atari Games Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.