Search Results for author: Nikki Lijing Kuang

Found 7 papers, 0 papers with code

Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning

no code implementations • 15 Jun 2023 • Amin Karbasi, Nikki Lijing Kuang, Yi-An Ma, Siddharth Mitra

Thompson sampling (TS) is widely used in sequential decision making due to its ease of use and appealing empirical performance.

Decision Making Multi-Armed Bandits +3

Paper
Add Code

Statistical and Computational Trade-offs in Variational Inference: A Case Study in Inferential Model Selection

no code implementations • 22 Jul 2022 • Kush Bhatia, Nikki Lijing Kuang, Yi-An Ma, Yixin Wang

Focusing on Gaussian inferential models (or variational approximating families) with diagonal plus low-rank precision matrices, we initiate a theoretical study of the trade-offs in two aspects, Bayesian posterior inference error and frequentist uncertainty quantification error.

Bayesian Inference Computational Efficiency +4

Paper
Add Code

Analysis of Evolutionary Behavior in Self-Learning Media Search Engines

no code implementations • 22 Nov 2019 • Nikki Lijing Kuang, Clement H. C. Leung

In a SelfLearning Search Engine architecture, the subtle nuances of human perceptions and deep knowledge are taught and captured through unsupervised reinforcement learning, where the degree of reinforcement may be suitably calibrated.

reinforcement-learning Reinforcement Learning (RL) +3

Paper
Add Code

Performance Effectiveness of Multimedia Information Search Using the Epsilon-Greedy Algorithm

no code implementations • 22 Nov 2019 • Nikki Lijing Kuang, Clement H. C. Leung

However, by systematically capturing and analyzing the feedback patterns of human users, vital information concerning the multimedia contents can be harvested for effective indexing and subsequent search.

Retrieval

Paper
Add Code

Leveraging Reinforcement Learning Techniques for Effective Policy Adoption and Validation

no code implementations • 21 Jun 2019 • Nikki Lijing Kuang, Clement H. C. Leung

Rewards and punishments in different forms are pervasive and present in a wide variety of decision-making scenarios.

Decision Making reinforcement-learning +1

Paper
Add Code

Performance Dynamics and Termination Errors in Reinforcement Learning: A Unifying Perspective

no code implementations • 11 Feb 2019 • Nikki Lijing Kuang, Clement H. C. Leung

A situation that often calls for learning termination is when the number of negative rewards exceeds the number of positive rewards.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Stochastic Reinforcement Learning

no code implementations • 11 Feb 2019 • Nikki Lijing Kuang, Clement H. C. Leung, Vienne W. K. Sung

In reinforcement learning episodes, the rewards and punishments are often non-deterministic, and there are invariably stochastic elements governing the underlying situation.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.