Search Results for author: Priyank Agrawal

Found 5 papers, 0 papers with code

A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit

no code implementations28 Nov 2020 Priyank Agrawal, Theja Tulabandhula, Vashist Avadhanula

In this paper, we propose an optimistic algorithm and show that the regret is bounded by $O(\sqrt{dT} + \kappa)$, significantly improving the performance over existing methods.

Decision Making Multi-Armed Bandits

Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect

no code implementations18 Jun 2020 Priyank Agrawal, Theja Tulabandhula

We study the effect of persistence of engagement on learning in a stochastic multi-armed bandit setting.

Decision Making Multi-Armed Bandits +2

Incentivising Exploration and Recommendations for Contextual Bandits with Payments

no code implementations22 Jan 2020 Priyank Agrawal, Theja Tulabandhula

We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users.

Multi-Armed Bandits

Bandits with Temporal Stochastic Constraints

no code implementations22 Nov 2018 Priyank Agrawal, Theja Tulabandhula

We study the effect of impairment on stochastic multi-armed bandits and develop new ways to mitigate it.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.