Search Results for author: Nima Hamidi

Found 8 papers, 3 papers with code

The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling

no code implementations16 Feb 2021 Nima Hamidi, Mohsen Bayati

The elliptical potential lemma is a key tool for quantifying uncertainty in estimating parameters of the reward function, but it requires the noise and the prior distributions to be Gaussian.

Decision Making LEMMA +1

Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

1 code implementation NeurIPS 2020 Mohsen Bayati, Nima Hamidi, Ramesh Johari, Khashayar Khosravi

We study the structure of regret-minimizing policies in the {\em many-armed} Bayesian multi-armed bandit problem: in particular, with $k$ the number of arms and $T$ the time horizon, we consider the case where $k \geq \sqrt{T}$.

Multi-Armed Bandits

On Frequentist Regret of Linear Thompson Sampling

no code implementations11 Jun 2020 Nima Hamidi, Mohsen Bayati

This paper studies the stochastic linear bandit problem, where a decision-maker chooses actions from possibly time-dependent sets of vectors in $\mathbb{R}^d$ and receives noisy rewards.

Thompson Sampling

The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

2 code implementations24 Feb 2020 Mohsen Bayati, Nima Hamidi, Ramesh Johari, Khashayar Khosravi

This finding diverges from the notion of free exploration, which relates to covariate variation, as recently discussed in contextual bandit literature.

Multi-Armed Bandits

A General Theory of the Stochastic Linear Bandit and Its Applications

no code implementations12 Feb 2020 Nima Hamidi, Mohsen Bayati

First, our new notion of optimism in expectation gives rise to a new algorithm, called sieved greedy (SG) that reduces the overexploration problem in OFUL.

Multi-Armed Bandits Thompson Sampling

Personalizing Many Decisions with High-Dimensional Covariates

no code implementations NeurIPS 2019 Nima Hamidi, Mohsen Bayati, Kapil Gupta

We consider the k-armed stochastic contextual bandit problem with d dimensional features, when both k and d can be large.

Vocal Bursts Intensity Prediction

Multi-scale Embedded CNN for Music Tagging (MsE-CNN)

no code implementations16 Jun 2019 Nima Hamidi, Mohsen Vahidzadeh, Stephen Baek

Convolutional neural networks (CNN) recently gained notable attraction in a variety of machine learning tasks: including music classification and style tagging.

BIG-bench Machine Learning General Classification +2

On Low-rank Trace Regression under General Sampling Distribution

1 code implementation18 Apr 2019 Nima Hamidi, Mohsen Bayati

In this paper, we study the trace regression when a matrix of parameters B* is estimated via the convex relaxation of a rank-regularized regression or via regularized non-convex optimization.

Matrix Completion Multi-Task Learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.