no code implementations • 9 Aug 2014 • Stephane Ross, Paul Mineiro, John Langford
We introduce online learning algorithms which are independent of feature scales, proving regret bounds dependent on the ratio of scales existent in the data rather than the absolute scale.
no code implementations • 23 Jun 2014 • Stephane Ross, J. Andrew Bagnell
Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach and analyzed using no-regret online learning.
no code implementations • NeurIPS 2016 • Kai-Wei Chang, He He, Hal Daumé III, John Langford, Stephane Ross
Many machine learning applications involve jointly predicting multiple mutually dependent output variables.
no code implementations • 16 Aug 2013 • Jiaji Zhou, Stephane Ross, Yisong Yue, Debadeepta Dey, J. Andrew Bagnell
We study the problem of predicting a set or list of options under knapsack constraint.
1 code implementation • 28 May 2013 • Stephane Ross, Paul Mineiro, John Langford
We introduce online learning algorithms which are independent of feature scales, proving regret bounds dependent on the ratio of scales existent in the data rather than the absolute scale.
no code implementations • 11 May 2013 • Stephane Ross, Jiaji Zhou, Yisong Yue, Debadeepta Dey, J. Andrew Bagnell
Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a set or list of options.
3 code implementations • 2 Nov 2010 • Stephane Ross, Geoffrey J. Gordon, J. Andrew Bagnell
Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i. i. d.
no code implementations • NeurIPS 2007 • Stephane Ross, Joelle Pineau, Brahim Chaib-Draa
The algorithm uses search heuristics based on an error analysis of lookahead search, to guide the online search towards reachable beliefs with the most potential to reduce error.