no code implementations • 29 Jul 2023 • Xinyang Yi, Shao-Chuan Wang, Ruining He, Hariharan Chandrasekaran, Charles Wu, Lukasz Heldt, Lichan Hong, Minmin Chen, Ed H. Chi
In this paper, we introduce Online Matching: a scalable closed-loop bandit system learning from users' direct feedback on items in real time.