no code implementations • 1 Jan 2021 • Tom Zahavy, Ofir Nabati, Leor Cohen, Shie Mannor
We study neural-linear bandits for solving problems where both exploration and representation learning play an important role.
Efficient Exploration Multi-Armed Bandits +2