no code implementations • ICML 2017 • Bangrui Chen, Peter I. Frazier
We consider online content recommendation with implicit feedback through pairwise comparisons, formalized as the so-called dueling bandit problem.
no code implementations • 30 May 2016 • Bangrui Chen, Peter I. Frazier
We present a Bayesian sequential decision-making formulation of the information filtering problem, in which an algorithm presents items (news articles, scientific papers, tweets) arriving in a stream, and learns relevance from user feedback on presented items.
no code implementations • 28 May 2016 • Bangrui Chen, Peter I. Frazier
We study dueling bandits with weak utility-based regret when preferences over arms have a total order and carry observable feature vectors.