no code implementations • 12 Oct 2022 • Kensei Morijiri, Kento Takehana, Takatomo Mihana, Kazutaka Kanno, Makoto Naruse, Atsushi Uchida
We solve a 512-armed bandit problem online, which is much larger than previous experiments by two orders of magnitude.