Search Results for author: Samarth Gupta

Found 11 papers, 2 papers with code

Uncertainty Informed Optimal Resource Allocation with Gaussian Process based Bayesian Inference

no code implementations • 30 Jun 2023 • Samarth Gupta, Saurabh Amin

(2) How can we computationally handle both nonlinear ODE constraints and parameter uncertainties for a generic stochastic optimization problem for resource allocation?

Bayesian Inference Gaussian Processes +1

Paper
Add Code

Bayesian regularization of empirical MDPs

no code implementations • 3 Aug 2022 • Samarth Gupta, Daniel N. Hill, Lexing Ying, Inderjit Dhillon

Due to noise, the policy learnedfrom the estimated model is often far from the optimal policy of the underlying model.

Paper
Add Code

Approximate Newton policy gradient algorithms

no code implementations • 5 Oct 2021 • Haoya Li, Samarth Gupta, HsiangFu Yu, Lexing Ying, Inderjit Dhillon

This paper proposes an approximate Newton method for the policy gradient algorithm with entropy regularization.

Paper
Add Code

Best-Arm Identification in Correlated Multi-Armed Bandits

no code implementations • 10 Sep 2021 • Samarth Gupta, Gauri Joshi, Osman Yağan

In this paper we consider the problem of best-arm identification in multi-armed bandits in the fixed confidence setting, where the goal is to identify, with probability $1-\delta$ for some $\delta>0$, the arm with the highest mean reward in minimum possible samples from the set of arms $\mathcal{K}$.

Multi-Armed Bandits

Paper
Add Code

Bandit-based Communication-Efficient Client Selection Strategies for Federated Learning

no code implementations • 14 Dec 2020 • Yae Jee Cho, Samarth Gupta, Gauri Joshi, Osman Yağan

Due to communication constraints and intermittent client availability in federated learning, only a subset of clients can participate in each training round.

Fairness Federated Learning

Paper
Add Code

Integer Programming-based Error-Correcting Output Code Design for Robust Classification

no code implementations • 30 Oct 2020 • Samarth Gupta, Saurabh Amin

We also estimate the adversarial accuracy of our ECOC-based classifiers in a white-box setting.

General Classification Robust classification

Paper
Add Code

Multi-Armed Bandits with Correlated Arms

2 code implementations • 6 Nov 2019 • Samarth Gupta, Shreyas Chaudhari, Gauri Joshi, Osman Yağan

We consider a multi-armed bandit framework where the rewards obtained by pulling different arms are correlated.

Multi-Armed Bandits

Paper
Code

A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

no code implementations • 18 Oct 2018 • Samarth Gupta, Shreyas Chaudhari, Subhojyoti Mukherjee, Gauri Joshi, Osman Yağan

We consider a finite-armed structured bandit problem in which mean rewards of different arms are known functions of a common hidden parameter $\theta^*$.

Thompson Sampling

Paper
Add Code

Correlated Multi-armed Bandits with a Latent Random Source

2 code implementations • 17 Aug 2018 • Samarth Gupta, Gauri Joshi, Osman Yağan

As a result, there are regimes where our algorithm achieves a $\mathcal{O}(1)$ regret as opposed to the typical logarithmic regret scaling of multi-armed bandit algorithms.

Multi-Armed Bandits

Paper
Code

Active Distribution Learning from Indirect Samples

no code implementations • 16 Aug 2018 • Samarth Gupta, Gauri Joshi, Osman Yağan

At each time step, we choose one of the possible $K$ functions, $g_1, \ldots, g_K$ and observe the corresponding sample $g_i(X)$.

Privacy Preserving

Paper
Add Code

Request Patterns and Caching for VoD Services with Recommendation Systems

no code implementations • 8 Sep 2016 • Samarth Gupta, Sharayu Moharir

We propose a Markovian request model to capture the time-correlation in user requests and show that our model is consistent with the observations of existing empirical studies.

Networking and Internet Architecture

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.