Search Results for author: Nikhil R. Devanur

Found 6 papers, 1 papers with code

Efficient Algorithms for Device Placement of DNN Graph Operators

1 code implementation • NeurIPS 2020 • Jakub Tarnawski, Amar Phanishayee, Nikhil R. Devanur, Divya Mahajan, Fanny Nina Paravecino

However, for such settings (large models and multiple heterogeneous devices), we require automated algorithms and toolchains that can partition the ML workload across devices.

Paper
Code

Multi-scale Online Learning and its Applications to Online Auctions

no code implementations • 26 May 2017 • Sébastien Bubeck, Nikhil R. Devanur, Zhiyi Huang, Rad Niazadeh

For the online posted pricing problem, we show regret bounds that scale with the best fixed price, rather than the range of the values.

Paper
Add Code

Linear Contextual Bandits with Knapsacks

no code implementations • NeurIPS 2016 • Shipra Agrawal, Nikhil R. Devanur

We consider the linear contextual bandit problem with resource consumption, in addition to reward generation.

Multi-Armed Bandits

Paper
Add Code

An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives

no code implementations • 10 Jun 2015 • Shipra Agrawal, Nikhil R. Devanur, Lihong Li

This problem was introduced by Badanidiyuru et al. (2014), who gave a computationally inefficient algorithm with near-optimal regret bounds for it.

Multi-Armed Bandits Open-Ended Question Answering

Paper
Add Code

Fast Algorithms for Online Stochastic Convex Programming

no code implementations • 28 Oct 2014 • Shipra Agrawal, Nikhil R. Devanur

We introduce the online stochastic Convex Programming (CP) problem, a very general version of stochastic online problems which allows arbitrary concave objectives and convex feasibility constraints.

Paper
Add Code

Bandits with concave rewards and convex knapsacks

no code implementations • 24 Feb 2014 • Shipra Agrawal, Nikhil R. Devanur

In this paper, we consider a very general model for exploration-exploitation tradeoff which allows arbitrary concave rewards and convex constraints on the decisions across time, in addition to the customary limitation on the time horizon.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.