Search Results for author: Mingwei Yang

Found 1 papers, 0 papers with code

On the Re-Solving Heuristic for (Binary) Contextual Bandits with Knapsacks

no code implementations25 Nov 2022 Rui Ai, Zhaohua Chen, Xiaotie Deng, Yuqi Pan, Chang Wang, Mingwei Yang

To the best of our knowledge, this is the first $\widetilde O(1)$ regret result in the CBwK problem regardless of information feedback models.

Management Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.