no code implementations • 7 Sep 2019 • Praneet Dutta, Joe Cheuk, Jonathan S Kim, Massimo Mascaro
We see that our model is able to perform much better than random exploration, being more regret efficient and able to converge with a limited number of samples, while remaining very general and easy to use due to the meta-learning approach.