Search Results for author: Mingwei Yang

Found 1 papers, 0 papers with code

On the Re-Solving Heuristic for (Binary) Contextual Bandits with Knapsacks

no code implementations • 25 Nov 2022 • Rui Ai, Zhaohua Chen, Xiaotie Deng, Yuqi Pan, Chang Wang, Mingwei Yang

To the best of our knowledge, this is the first $\widetilde O(1)$ regret result in the CBwK problem regardless of information feedback models.

Management Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.