Search Results for author: Priyank Agrawal

Found 5 papers, 0 papers with code

A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit

no code implementations • 28 Nov 2020 • Priyank Agrawal, Theja Tulabandhula, Vashist Avadhanula

In this paper, we propose an optimistic algorithm and show that the regret is bounded by $O(\sqrt{dT} + \kappa)$, significantly improving the performance over existing methods.

Decision Making Multi-Armed Bandits

Paper
Add Code

Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration

no code implementations • 23 Oct 2020 • Priyank Agrawal, Jinglin Chen, Nan Jiang

This paper studies regret minimization with randomized value functions in reinforcement learning.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect

no code implementations • 18 Jun 2020 • Priyank Agrawal, Theja Tulabandhula

We study the effect of persistence of engagement on learning in a stochastic multi-armed bandit setting.

Decision Making Multi-Armed Bandits +2

Paper
Add Code

Incentivising Exploration and Recommendations for Contextual Bandits with Payments

no code implementations • 22 Jan 2020 • Priyank Agrawal, Theja Tulabandhula

We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users.

Multi-Armed Bandits

Paper
Add Code

Bandits with Temporal Stochastic Constraints

no code implementations • 22 Nov 2018 • Priyank Agrawal, Theja Tulabandhula

We study the effect of impairment on stochastic multi-armed bandits and develop new ways to mitigate it.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.