no code implementations • 26 Feb 2024 • Yongxin Xu, Shangshang Wang, Hengquan Guo, Xin Liu, Ziyu Shao
These render reward and cost hard to model thus unknown before decision making.
no code implementations • 27 Nov 2022 • Hengquan Guo, Qi Zhu, Xin Liu
This paper studies the problem of stochastic continuum-armed bandit with constraints (SCBwC), where we optimize a black-box reward function $f(x)$ subject to a black-box constraint function $g(x)\leq 0$ over a continuous space $\mathcal X$.