Search Results for author: Sudeep Salgia

Found 11 papers, 1 papers with code

Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness

no code implementations20 Feb 2024 Nikola Pavlovic, Sudeep Salgia, Qing Zhao

Agents can share information through a central server, with the objective of minimizing regret that is accumulating over time $T$ and aggregating over agents.

A Communication-Efficient Adaptive Algorithm for Federated Learning under Cumulative Regret

no code implementations21 Jan 2023 Sudeep Salgia, Qing Zhao, Tamir Gabay, Kobi Cohen

We develop a distributed online learning algorithm that achieves order-optimal cumulative regret with low communication cost measured in the total number of bits transmitted over the entire learning horizon.

Federated Learning Stochastic Optimization

Distributed Linear Bandits under Communication Constraints

no code implementations4 Nov 2022 Sudeep Salgia, Qing Zhao

We consider distributed linear bandits where $M$ agents learn collaboratively to minimize the overall cumulative regret incurred by all agents.

Collaborative Learning in Kernel-based Bandits for Distributed Users

no code implementations16 Jul 2022 Sudeep Salgia, Sattar Vakili, Qing Zhao

We study collaborative learning among distributed clients facilitated by a central server.

Federated Learning

Provably and Practically Efficient Neural Contextual Bandits

no code implementations31 May 2022 Sudeep Salgia, Sattar Vakili, Qing Zhao

The non-asymptotic error bounds may be of broader interest as a tool to establish the relation between the smoothness of the activation functions in neural contextual bandits and the smoothness of the kernels in kernel bandits.

Multi-Armed Bandits

As Easy as ABC: Adaptive Binning Coincidence Test for Uniformity Testing

no code implementations12 Oct 2021 Sudeep Salgia, Qing Zhao, Lang Tong

The alternative hypothesis is a composite set of Lipschitz continuous distributions that are at least $\varepsilon$ away in $\ell_1$ distance from the uniform distribution.

Disagreement-based Active Learning in Online Settings

no code implementations19 Apr 2019 Boshuang Huang, Sudeep Salgia, Qing Zhao

We show that the proposed algorithm has a label complexity of $O(dT^{\frac{2-2\alpha}{2-\alpha}}\log^2 T)$ under a constraint of bounded regret in terms of classification errors, where $d$ is the VC dimension of the hypothesis space and $\alpha$ is the Tsybakov noise parameter.

Active Learning General Classification +1

Stochastic Gradient Descent on a Tree: an Adaptive and Robust Approach to Stochastic Convex Optimization

no code implementations17 Jan 2019 Sattar Vakili, Sudeep Salgia, Qing Zhao

Online minimization of an unknown convex function over the interval $[0, 1]$ is considered under first-order stochastic bandit feedback, which returns a random realization of the gradient of the function at each query point.

Cannot find the paper you are looking for? You can Submit a new open access paper.