no code implementations • 15 Jun 2023 • Amin Karbasi, Nikki Lijing Kuang, Yi-An Ma, Siddharth Mitra
Thompson sampling (TS) is widely used in sequential decision making due to its ease of use and appealing empirical performance.
no code implementations • 22 Jul 2022 • Kush Bhatia, Nikki Lijing Kuang, Yi-An Ma, Yixin Wang
Focusing on Gaussian inferential models (or variational approximating families) with diagonal plus low-rank precision matrices, we initiate a theoretical study of the trade-offs in two aspects, Bayesian posterior inference error and frequentist uncertainty quantification error.
no code implementations • 22 Nov 2019 • Nikki Lijing Kuang, Clement H. C. Leung
In a SelfLearning Search Engine architecture, the subtle nuances of human perceptions and deep knowledge are taught and captured through unsupervised reinforcement learning, where the degree of reinforcement may be suitably calibrated.
no code implementations • 22 Nov 2019 • Nikki Lijing Kuang, Clement H. C. Leung
However, by systematically capturing and analyzing the feedback patterns of human users, vital information concerning the multimedia contents can be harvested for effective indexing and subsequent search.
no code implementations • 21 Jun 2019 • Nikki Lijing Kuang, Clement H. C. Leung
Rewards and punishments in different forms are pervasive and present in a wide variety of decision-making scenarios.
no code implementations • 11 Feb 2019 • Nikki Lijing Kuang, Clement H. C. Leung
A situation that often calls for learning termination is when the number of negative rewards exceeds the number of positive rewards.
no code implementations • 11 Feb 2019 • Nikki Lijing Kuang, Clement H. C. Leung, Vienne W. K. Sung
In reinforcement learning episodes, the rewards and punishments are often non-deterministic, and there are invariably stochastic elements governing the underlying situation.