no code implementations • NeurIPS 2019 • Nirandika Wanigasekara, Christina Yu
Consider a nonparametric contextual multi-arm bandit problem where each arm $a \in [K]$ is associated to a nonparametric reward function $f_a: [0, 1] \to \mathbb{R}$ mapping from contexts to the expected reward.
no code implementations • 3 Aug 2019 • Nirandika Wanigasekara, Christina Lee Yu
Consider a nonparametric contextual multi-arm bandit problem where each arm $a \in [K]$ is associated to a nonparametric reward function $f_a: [0, 1] \to \mathbb{R}$ mapping from contexts to the expected reward.