no code implementations • 29 Oct 2021 • Weici Pan, Guanya Shi, Yiheng Lin, Adam Wierman
We study a variant of online optimization in which the learner receives $k$-round $\textit{delayed feedback}$ about hitting cost and there is a multi-step nonlinear switching cost, i. e., costs depend on multiple previous actions in a nonlinear manner.