no code implementations • 28 Oct 2023 • Shima Alizadeh, Aniruddha Bhargava, Karthick Gopalswamy, Lalit Jain, Branislav Kveton, Ge Liu
The pessimistic estimator can be optimized by policy gradients and performs well in all of our experiments.
no code implementations • 9 Mar 2019 • Urvashi Oswal, Aniruddha Bhargava, Robert Nowak
In comparison, the regret of traditional linear bandits is $d\sqrt{T}$, where $d$ is the total number of (relevant and irrelevant) features, so the improvement can be dramatic if $k\ll d$.
no code implementations • NeurIPS 2017 • Kwang-Sung Jun, Aniruddha Bhargava, Robert Nowak, Rebecca Willett
Second, for the case where the number $N$ of arms is very large, we propose new algorithms in which each next arm is selected via an inner product search.
no code implementations • 14 Mar 2016 • Aniruddha Bhargava, Ravi Ganti, Robert Nowak
In this paper we model the problem of learning preferences of a population as an active learning problem.
no code implementations • NeurIPS 2011 • Alyson K. Fletcher, Sundeep Rangan, Lav R. Varshney, Aniruddha Bhargava
Many functional descriptions of spiking neurons assume a cascade structure where inputs are passed through an initial linear filtering stage that produces a low-dimensional signal that drives subsequent nonlinear stages.