no code implementations • 12 Feb 2024 • Rachitesh Kumar, Jon Schneider, Balasubramanian Sivan
Concretely, we show that our algorithms achieve $O(\sqrt{T})$ regret when the highest competing bids are generated adversarially, and show that no online algorithm can do better.
no code implementations • 3 Feb 2023 • Christoph Dann, Yishay Mansour, Mehryar Mohri, Jon Schneider, Balasubramanian Sivan
We then use that to show, modulo mild normalization assumptions, that there exists an $\ell_\infty$-approachability algorithm whose convergence is independent of the dimension of the original vectorial payoff.
no code implementations • 3 Feb 2023 • Santiago Balseiro, Rachitesh Kumar, Vahab Mirrokni, Balasubramanian Sivan, Di Wang
Given the inherent non-stationarity in an advertiser's value and also competing advertisers' values over time, a commonly used approach is to learn a target expenditure plan that specifies a target spend as a function of time, and then run a controller that tracks this plan.
no code implementations • 17 May 2022 • Yishay Mansour, Mehryar Mohri, Jon Schneider, Balasubramanian Sivan
We study repeated two-player games where one of the players, the learner, employs a no-regret learning strategy, while the other, the optimizer, is a rational utility maximizer.
no code implementations • 12 Feb 2022 • Santiago R. Balseiro, Haihao Lu, Vahab Mirrokni, Balasubramanian Sivan
As a byproduct of our proofs, we provide the first regret bound for CMD for non-smooth convex optimization, which might be of independent interest.
no code implementations • 8 Jun 2021 • Renato Paes Leme, Balasubramanian Sivan, Yifeng Teng, Pratik Worah
In the Learning to Price setting, a seller posts prices over time with the goal of maximizing revenue while learning the buyer's valuation.
no code implementations • NeurIPS 2019 • Yuan Deng, Jon Schneider, Balasubramanian Sivan
We show that even in this prior-free setting, it is possible to extract a $(1-\varepsilon)$-approximation of the full economic surplus for any $\varepsilon > 0$.
no code implementations • 11 Jul 2016 • Nick Gravin, Yuval Peres, Balasubramanian Sivan
We study the fundamental problem of prediction with expert advice and develop regret lower bounds for a large family of algorithms for this problem.
no code implementations • 10 Sep 2014 • Nick Gravin, Yuval Peres, Balasubramanian Sivan
Further, we show that the optimal algorithm for $2$ and $3$ experts is a probability matching algorithm (analogous to Thompson sampling) against a particular randomized adversary.