Search Results for author: Aniruddha Bhargava

Found 5 papers, 0 papers with code

Pessimistic Off-Policy Multi-Objective Optimization

no code implementations28 Oct 2023 Shima Alizadeh, Aniruddha Bhargava, Karthick Gopalswamy, Lalit Jain, Branislav Kveton, Ge Liu

The pessimistic estimator can be optimized by policy gradients and performs well in all of our experiments.

Decision Making

Linear Bandits with Feature Feedback

no code implementations9 Mar 2019 Urvashi Oswal, Aniruddha Bhargava, Robert Nowak

In comparison, the regret of traditional linear bandits is $d\sqrt{T}$, where $d$ is the total number of (relevant and irrelevant) features, so the improvement can be dramatic if $k\ll d$.

Scalable Generalized Linear Bandits: Online Computation and Hashing

no code implementations NeurIPS 2017 Kwang-Sung Jun, Aniruddha Bhargava, Robert Nowak, Rebecca Willett

Second, for the case where the number $N$ of arms is very large, we propose new algorithms in which each next arm is selected via an inner product search.

Thompson Sampling

Active Algorithms For Preference Learning Problems with Multiple Populations

no code implementations14 Mar 2016 Aniruddha Bhargava, Ravi Ganti, Robert Nowak

In this paper we model the problem of learning preferences of a population as an active learning problem.

Active Learning

Neural Reconstruction with Approximate Message Passing (NeuRAMP)

no code implementations NeurIPS 2011 Alyson K. Fletcher, Sundeep Rangan, Lav R. Varshney, Aniruddha Bhargava

Many functional descriptions of spiking neurons assume a cascade structure where inputs are passed through an initial linear filtering stage that produces a low-dimensional signal that drives subsequent nonlinear stages.

Cannot find the paper you are looking for? You can Submit a new open access paper.