Search Results for author: Joe Suk

Found 3 papers, 1 papers with code

Optimal and Adaptive Non-Stationary Dueling Bandits Under a Generalized Borda Criterion

no code implementations19 Mar 2024 Joe Suk, Arpit Agarwal

In dueling bandits, the learner receives preference feedback between arms, and the regret of an arm is defined in terms of its suboptimality to a winner arm.

When Can We Track Significant Preference Shifts in Dueling Bandits?

1 code implementation NeurIPS 2023 Joe Suk, Arpit Agarwal

Specifically, we study the recent notion of significant shifts (Suk and Kpotufe, 2022), and ask whether one can design an adaptive algorithm for the dueling problem with $O(\sqrt{K\tilde{L}T})$ dynamic regret, where $\tilde{L}$ is the (unknown) number of significant shifts in preferences.

Information Retrieval Recommendation Systems +1

Tracking Most Significant Arm Switches in Bandits

no code implementations27 Dec 2021 Joe Suk, Samory Kpotufe

In bandit with distribution shifts, one aims to automatically adapt to unknown changes in reward distribution, and restart exploration when necessary.

Cannot find the paper you are looking for? You can Submit a new open access paper.