no code implementations • 29 Aug 2023 • Xueping Gong, Jiheng Zhang
In this paper, we investigate the stochastic contextual bandit with general function space and graph feedback.
no code implementations • 7 Aug 2023 • Xueping Gong, Jiheng Zhang
We then show how causal bounds can be applied to improving classical bandit algorithms and affect the regrets with respect to the size of action sets and function spaces.
no code implementations • 7 Sep 2022 • Xueping Gong, Jiheng Zhang
The contextual bandit problem is a theoretically justified framework with wide applications in various fields.