no code implementations • 8 Feb 2024 • Yige Hong, Qiaomin Xie, Yudong Chen, Weina Wang
We consider the infinite-horizon, average-reward restless bandit problem in discrete time.
1 code implementation • NeurIPS 2023 • Yige Hong, Qiaomin Xie, Yudong Chen, Weina Wang
In both settings, our work is the first asymptotic optimality result that does not require UGAP.