Search Results for author: Joe Suk

Found 3 papers, 1 papers with code

Optimal and Adaptive Non-Stationary Dueling Bandits Under a Generalized Borda Criterion

no code implementations • 19 Mar 2024 • Joe Suk, Arpit Agarwal

In dueling bandits, the learner receives preference feedback between arms, and the regret of an arm is defined in terms of its suboptimality to a winner arm.

Paper
Add Code

When Can We Track Significant Preference Shifts in Dueling Bandits?

1 code implementation • NeurIPS 2023 • Joe Suk, Arpit Agarwal

Specifically, we study the recent notion of significant shifts (Suk and Kpotufe, 2022), and ask whether one can design an adaptive algorithm for the dueling problem with $O(\sqrt{K\tilde{L}T})$ dynamic regret, where $\tilde{L}$ is the (unknown) number of significant shifts in preferences.

Information Retrieval Recommendation Systems +1

Paper
Code

Tracking Most Significant Arm Switches in Bandits

no code implementations • 27 Dec 2021 • Joe Suk, Samory Kpotufe

In bandit with distribution shifts, one aims to automatically adapt to unknown changes in reward distribution, and restart exploration when necessary.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.