no code implementations • 9 Aug 2014 • Alina Beygelzimer, John Langford, Yuri Lifshits, Gregory Sorkin, Alexander L. Strehl
We consider the problem of estimating the conditional probability of a label in time O(log n), where n is the number of possible labels.
no code implementations • NeurIPS 2008 • Sharad Goel, John Langford, Alexander L. Strehl
We tackle the computational problem of query-conditioned search.
no code implementations • NeurIPS 2007 • Alexander L. Strehl, Michael L. Littman
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting.