1 code implementation • NeurIPS 2020 • Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino
However, for such settings (large models and multiple heterogeneous devices), we require automated algorithms and toolchains that can partition the ML workload across devices.
no code implementations • 26 May 2017 • Sébastien Bubeck, Nikhil R. Devanur, Zhiyi Huang, Rad Niazadeh
For the online posted pricing problem, we show regret bounds that scale with the best fixed price, rather than the range of the values.
no code implementations • NeurIPS 2016 • Shipra Agrawal, Nikhil R. Devanur
We consider the linear contextual bandit problem with resource consumption, in addition to reward generation.
no code implementations • 10 Jun 2015 • Shipra Agrawal, Nikhil R. Devanur, Lihong Li
This problem was introduced by Badanidiyuru et al. (2014), who gave a computationally inefficient algorithm with near-optimal regret bounds for it.
no code implementations • 28 Oct 2014 • Shipra Agrawal, Nikhil R. Devanur
We introduce the online stochastic Convex Programming (CP) problem, a very general version of stochastic online problems which allows arbitrary concave objectives and convex feasibility constraints.
no code implementations • 24 Feb 2014 • Shipra Agrawal, Nikhil R. Devanur
In this paper, we consider a very general model for exploration-exploitation tradeoff which allows arbitrary concave rewards and convex constraints on the decisions across time, in addition to the customary limitation on the time horizon.