Search Results for author: MohammadJavad Azizi

Found 3 papers, 2 papers with code

Meta-Learning for Simple Regret Minimization

1 code implementation • 25 Feb 2022 • MohammadJavad Azizi, Branislav Kveton, Mohammad Ghavamzadeh, Sumeet Katariya

The Bayesian algorithm has access to a prior distribution over the meta-parameters and its meta simple regret over $m$ bandit tasks with horizon $n$ is mere $\tilde{O}(m / \sqrt{n})$.

Meta-Learning

Paper
Code

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

1 code implementation • 25 Feb 2022 • MohammadJavad Azizi, Thang Duong, Yasin Abbasi-Yadkori, András György, Claire Vernade, Mohammad Ghavamzadeh

We study a sequential decision problem where the learner faces a sequence of $K$-armed bandit tasks.

Meta-Learning

Paper
Code

Guaranteed Fixed-Confidence Best Arm Identification in Multi-Armed Bandits: Simple Sequential Elimination Algorithms

no code implementations • 12 Jun 2021 • MohammadJavad Azizi, Sheldon M Ross, Zhengyu Zhang

We propose to use the classical "vector at a time" (VT) rule, which samples each remaining arm once in each round.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.