1 code implementation • NeurIPS 2016 • Artem Sokolov, Julia Kreutzer, Christopher Lo, Stefan Riezler
Stochastic structured prediction under bandit feedback follows a learning protocol where on each of a sequence of iterations, the learner receives an input, predicts an output structure, and receives partial feedback in form of a task loss evaluation of the predicted structure.