Search Results for author: Keqin Liu

Found 4 papers, 1 papers with code

Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training

1 code implementation NeurIPS 2023 Yefan Zhou, Tianyu Pang, Keqin Liu, Charles H. Martin, Michael W. Mahoney, Yaoqing Yang

In particular, the learning rate, which can be interpreted as a temperature-like parameter within the statistical mechanics of learning, plays a crucial role in neural network training.


PCL-Indexability and Whittle Index for Restless Bandits with General Observation Models

no code implementations6 Jul 2023 Keqin Liu, Chengzhong Zhang

In this paper, we consider a general observation model for restless multi-armed bandit problems.

The Extended UCB Policies for Frequentist Multi-armed Bandit Problems

no code implementations8 Dec 2011 Keqin Liu, Haoran Chen, Weibing Deng, Ting Wu

The multi-armed bandit (MAB) problem is a widely studied model in the field of reinforcement learning.

Cannot find the paper you are looking for? You can Submit a new open access paper.