Search Results for author: Vaneet Aggarwal

Found 113 papers, 20 papers with code

A Proximal Jacobian ADMM Approach for Fast Massive MIMO Signal Detection in Low-Latency Communications

1 code implementation2 Mar 2019 Anis Elgabli, Ali Elghariani, Vaneet Aggarwal, *Mehdi Bennis, Mark R. Bell

We introduce an objective function that is a sum of strictly convex and separable functions based on decomposing the received vector into multiple vectors.

Information Theory Information Theory

Multi-Stage Hybrid Federated Learning over Large-Scale D2D-Enabled Fog Networks

1 code implementation18 Jul 2020 Seyyedali Hosseinalipour, Sheikh Shams Azam, Christopher G. Brinton, Nicolo Michelusi, Vaneet Aggarwal, David J. Love, Huaiyu Dai

We derive the upper bound of convergence for MH-FL with respect to parameters of the network topology (e. g., the spectral radius) and the learning algorithm (e. g., the number of D2D rounds in different clusters).

Federated Learning

Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control

1 code implementation5 Oct 2022 Jiayu Chen, Tian Lan, Vaneet Aggarwal

In this work, we develop a novel HIL algorithm based on Adversarial Inverse Reinforcement Learning and adapt it with the Expectation-Maximization algorithm in order to directly recover a hierarchical policy from the unannotated demonstrations.

Imitation Learning Multi-Task Learning +2

Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

1 code implementation21 Feb 2024 Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal

This work offers a hands-on reference for the research progress in deep generative models for offline policy learning, and aims to inspire improved DGM-based offline RL or IL algorithms.

Imitation Learning Offline RL

Domain Adaptive Few-Shot Open-Set Learning

1 code implementation ICCV 2023 Debabrata Pal, Deeptej More, Sai Bhargav, Dipesh Tamboli, Vaneet Aggarwal, Biplab Banerjee

Few-shot learning has made impressive strides in addressing the crucial challenges of recognizing unknown samples from novel classes in target query sets and managing visual shifts between domains.

cross-domain few-shot learning Few-Shot Learning +1

DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioning

1 code implementation9 Sep 2021 Xinwu Qian, Shuocheng Guo, Vaneet Aggarwal

This study proposes the deep relocating option policy (DROP) that supervises vehicle agents to escape from oversupply areas and effectively relocate to potentially underserved areas.

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

1 code implementation22 Jun 2022 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Multi-agent reinforcement learning (MARL) has witnessed significant progress with the development of value function factorization methods.

counterfactual Multi-agent Reinforcement Learning +5

Multi-task Hierarchical Adversarial Inverse Reinforcement Learning

1 code implementation22 May 2023 Jiayu Chen, Dipesh Tamboli, Tian Lan, Vaneet Aggarwal

Multi-task Imitation Learning (MIL) aims to train a policy capable of performing a distribution of tasks based on multi-task expert demonstrations, which is essential for general-purpose robots.

Imitation Learning Multi-Task Learning +1

FastScan: Robust Low-Complexity Rate Adaptation Algorithm for Video Streaming over HTTP

1 code implementation7 Jun 2018 Anis Elgabli, Vaneet Aggarwal

For example, on an experiment conducted over 100 real cellular bandwidth traces of a public dataset that spans different bandwidth regimes, our proposed algorithm (FastScan) achieves the minimum re-buffering (stall) time and the maximum average playback rate in every single trace as compared to the original dash. js rate adaptation scheme, Festive, BBA, RB, and FastMPC algorithms.

Networking and Internet Architecture Multimedia

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

1 code implementation4 Jan 2022 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

To this end, we present LSF-SAC, a novel framework that features a variational inference-based information-sharing mechanism as extra state information to assist individual agents in the value function factorization.

Starcraft Starcraft II +1

Hierarchical Deep Counterfactual Regret Minimization

1 code implementation27 May 2023 Jiayu Chen, Tian Lan, Vaneet Aggarwal

Imperfect Information Games (IIGs) offer robust models for scenarios where decision-makers face uncertainty or lack complete information.

counterfactual Decision Making

Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer

1 code implementation12 Mar 2024 Dipesh Tamboli, Jiayu Chen, Kiran Pranesh Jotheeswaran, Denny Yu, Vaneet Aggarwal

Sepsis, a life-threatening condition triggered by the body's exaggerated response to infection, demands urgent intervention to prevent severe complications.

Decision Making

An FEA surrogate model with Boundary Oriented Graph Embedding approach

1 code implementation30 Aug 2021 Xingyu Fu, Fengfeng Zhou, Dheeraj Peddireddy, Zhengyang Kang, Martin Byung-Guk Jun, Vaneet Aggarwal

In this work, we present a Boundary Oriented Graph Embedding (BOGE) approach for the Graph Neural Network (GNN) to serve as a general surrogate model for regressing physical fields and solving boundary value problems.

Cantilever Beam Decision Making +2

Deep Learning based Coverage and Rate Manifold Estimation in Cellular Networks

2 code implementations13 Feb 2022 Washim Uddin Mondal, Praful D. Mankar, Goutam Das, Vaneet Aggarwal, Satish V. Ukkusuri

This article proposes Convolutional Neural Network-based Auto Encoder (CNN-AE) to predict location-dependent rate and coverage probability of a network from its topology.

Principal Component Analysis with Tensor Train Subspace

no code implementations13 Mar 2018 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Tensor train is a hierarchical tensor network structure that helps alleviate the curse of dimensionality by parameterizing large-scale multidimensional data via a set of network of low-rank tensors.

Tensor Train Neighborhood Preserving Embedding

no code implementations3 Dec 2017 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

In this paper, we propose a Tensor Train Neighborhood Preserving Embedding (TTNPE) to embed multi-dimensional tensor data into low dimensional tensor subspace.

Classification Dimensionality Reduction +1

Deterministic and Probabilistic Conditions for Finite Completability of Low-Tucker-Rank Tensor

no code implementations6 Dec 2016 Morteza Ashraphijuo, Vaneet Aggarwal, Xiaodong Wang

We investigate the fundamental conditions on the sampling pattern, i. e., locations of the sampled entries, for finite completability of a low-rank tensor given some components of its Tucker rank.

Wide Compression: Tensor Ring Nets

no code implementations CVPR 2018 Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet Aggarwal

Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications.

Image Classification

On Deterministic Sampling Patterns for Robust Low-Rank Matrix Completion

no code implementations5 Dec 2017 Morteza Ashraphijuo, Vaneet Aggarwal, Xiaodong Wang

In this letter, we study the deterministic sampling patterns for the completion of low rank matrix, when corrupted with a sparse noise, also known as robust matrix completion.

Low-Rank Matrix Completion valid

Efficient Low Rank Tensor Ring Completion

no code implementations ICCV 2017 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Using the matrix product state (MPS) representation of the recently proposed tensor ring decompositions, in this paper we propose a tensor completion algorithm, which is an alternating minimization algorithm that alternates over the factors in the MPS representation.

Matrix Completion

Rank Determination for Low-Rank Data Completion

no code implementations3 Jul 2017 Morteza Ashraphijuo, Xiaodong Wang, Vaneet Aggarwal

Moreover, for both single-view matrix and CP tensor, we are able to show that the obtained upper bound is exactly equal to the unknown rank if the lowest-rank completion is given.

Deterministic and Probabilistic Conditions for Finite Completability of Low-rank Multi-View Data

no code implementations3 Jan 2017 Morteza Ashraphijuo, Xiaodong Wang, Vaneet Aggarwal

We provide a deterministic necessary and sufficient condition on the sampling pattern for finite completability.

Matrix Completion

Unsupervised clustering under the Union of Polyhedral Cones (UOPC) model

no code implementations15 Oct 2016 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Similar to the Union of Subspaces (UOS) model where each data from each subspace is generated from a (unknown) basis, in the UOPC model each data from each cone is assumed to be generated from a finite number of (unknown) \emph{extreme rays}. To cluster data under this model, we consider several algorithms - (a) Sparse Subspace Clustering by Non-negative constraints Lasso (NCL), (b) Least squares approximation (LSA), and (c) K-nearest neighbor (KNN) algorithm to arrive at affinity between data points.

Clustering

Low-tubal-rank Tensor Completion using Alternating Minimization

no code implementations5 Oct 2016 Xiao-Yang Liu, Shuchin Aeron, Vaneet Aggarwal, Xiaodong Wang

The low-tubal-rank tensor model has been recently proposed for real-world multidimensional data.

Low-Rank Matrix Completion

Tensor Completion by Alternating Minimization under the Tensor Train (TT) Model

no code implementations19 Sep 2016 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Using the matrix product state (MPS) representation of tensor train decompositions, in this paper we propose a tensor completion algorithm which alternates over the matrices (tensors) in the MPS representation.

Matrix Completion

On Deterministic Conditions for Subspace Clustering under Missing Data

no code implementations11 Jul 2016 Wenqi Wang, Shuchin Aeron, Vaneet Aggarwal

In this paper we present deterministic conditions for success of sparse subspace clustering (SSC) under missing data, when data is assumed to come from a Union of Subspaces (UoS) model.

Clustering

On deterministic conditions for subspace clustering under missing data

no code implementations15 Apr 2016 Wenqi Wang, Shuchin Aeron, Vaneet Aggarwal

We provide extensive set of simulation results for clustering as well as completion of data under missing entries, under the UoS model.

Clustering

Adaptive Sampling of RF Fingerprints for Fine-grained Indoor Localization

no code implementations10 Aug 2015 Xiao-Yang Liu, Shuchin Aeron, Vaneet Aggarwal, Xiaodong Wang, Min-You Wu

In contrast to several existing work that rely on random sampling, this paper shows that adaptivity in sampling can lead to significant improvements in localization accuracy.

Indoor Localization

Information-theoretic Bounds on Matrix Completion under Union of Subspaces Model

no code implementations14 Aug 2015 Vaneet Aggarwal, Shuchin Aeron

In this short note we extend some of the recent results on matrix completion under the assumption that the columns of the matrix can be grouped (clustered) into subspaces (not necessarily disjoint or independent).

Clustering Matrix Completion

Covfefe: A Computer Vision Approach For Estimating Force Exertion

no code implementations25 Sep 2018 Vaneet Aggarwal, Hamed Asadi, Mayank Gupta, Jae Joong Lee, Denny Yu

We note that the PPG signals can be obtained from the face videos, thus giving an efficient classification algorithm for the force exertion levels using face videos.

General Classification Photoplethysmography (PPG)

Stochastic Top-$K$ Subset Bandits with Linear Space and Non-Linear Feedback

no code implementations29 Nov 2018 Mridul Agarwal, Vaneet Aggarwal, Christopher J. Quinn, Abhishek K. Umrawal

Many real-world problems like Social Influence Maximization face the dilemma of choosing the best $K$ out of $N$ options at a given time instant.

Multi-Armed Bandits

DeepPool: Distributed Model-free Algorithm for Ride-sharing using Deep Reinforcement Learning

no code implementations9 Mar 2019 Abubakr Alabbasi, Arnob Ghosh, Vaneet Aggarwal

The success of modern ride-sharing platforms crucially depends on the profit of the ride-sharing fleet operating companies, and how efficiently the resources are managed.

reinforcement-learning Reinforcement Learning (RL)

Reinforcement Learning for Mean Field Game

no code implementations30 May 2019 Mridul Agarwal, Vaneet Aggarwal, Arnob Ghosh, Nilay Tiwari

This paper focuses on finding a mean-field equilibrium (MFE) in an action coupled stochastic game setting in an episodic framework.

reinforcement-learning Reinforcement Learning (RL)

GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning

no code implementations30 Aug 2019 Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal

When the data is distributed across multiple servers, lowering the communication cost between the servers (or workers) while solving the distributed learning problem is an important problem and is the focus of this paper.

BIG-bench Machine Learning

Encoders and Decoders for Quantum Expander Codes Using Machine Learning

no code implementations6 Sep 2019 Sathwik Chadaga, Mridul Agarwal, Vaneet Aggarwal

However, large-scale design of quantum encoders and decoders have to depend on the channel characteristics and require look-up tables which require memory that is exponential in the number of qubits.

BIG-bench Machine Learning Q-Learning

Reinforcement Learning for Joint Optimization of Multiple Rewards

no code implementations6 Sep 2019 Mridul Agarwal, Vaneet Aggarwal

Finding optimal policies which maximize long term rewards of Markov Decision Processes requires the use of dynamic programming and backward induction to solve the Bellman optimality equation.

Decision Making Fairness +3

A Distributed Model-Free Algorithm for Multi-hop Ride-sharing using Deep Reinforcement Learning

no code implementations30 Oct 2019 Ashutosh Singh, Abubakr Alabbasi, Vaneet Aggarwal

The growth of autonomous vehicles, ridesharing systems, and self driving technology will bring a shift in the way ride hailing platforms plan out their services.

Autonomous Vehicles Reinforcement Learning (RL)

Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints

no code implementations11 Mar 2020 Qinbo Bai, Vaneet Aggarwal, Ather Gattami

The proposed algorithm is proved to achieve an $(\epsilon, p)$-PAC policy when the episode $K\geq\Omega(\frac{I^2H^6SA\ell}{\epsilon^2})$, where $S$ and $A$ are the number of states and actions, respectively.

Q-Learning Scheduling

Regret and Belief Complexity Trade-off in Gaussian Process Bandits via Information Thresholding

no code implementations23 Mar 2020 Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Brian M. Sadler, Alec Koppel

Doing so permits us to precisely characterize the trade-off between regret bounds of GP bandit algorithms and complexity of the posterior distributions depending on the compression parameter $\epsilon$ for both discrete and continuous action sets.

Bayesian Optimization Decision Making +1

From Federated to Fog Learning: Distributed Machine Learning over Heterogeneous Wireless Networks

no code implementations7 Jun 2020 Seyyedali Hosseinalipour, Christopher G. Brinton, Vaneet Aggarwal, Huaiyu Dai, Mung Chiang

There are several challenges with employing conventional federated learning in contemporary networks, due to the significant heterogeneity in compute and communication capabilities that exist across devices.

BIG-bench Machine Learning Federated Learning +1

Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints

no code implementations10 Jun 2020 Qinbo Bai, Vaneet Aggarwal, Ather Gattami

This paper uses concepts from constrained optimization and Q-learning to propose an algorithm for CMDP with long-term constraints.

Q-Learning

FlexPool: A Distributed Model-Free Deep Reinforcement Learning Algorithm for Joint Passengers & Goods Transportation

no code implementations27 Jul 2020 Kaushik Manchella, Abhishek K. Umrawal, Vaneet Aggarwal

Through simulations on a realistic multi-agent urban mobility platform, we demonstrate that FlexPool outperforms other model-free settings in serving the demands from passengers & goods.

Reinforcement Learning (RL)

LBP: Robust Rate Adaptation Algorithm for SVC Video Streaming

no code implementations30 Apr 2018 Anis Elgabli, Vaneet Aggarwal, Shuai Hao, Feng Qian, Subhabrata Sen

The objective is to optimize a novel QoE metric that models a combination of the three objectives of minimizing the stall/skip duration of the video, maximizing the playback quality of every chunk, and minimizing the number of quality switches.

Networking and Internet Architecture Multimedia

A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning

no code implementations5 Oct 2020 Marina Haliem, Ganapathy Mani, Vaneet Aggarwal, Bharat Bhargava

In this paper, we present a dynamic, demand aware, and pricing-based vehicle-passenger matching and route planning framework that (1) dynamically generates optimal routes for each vehicle based on online demand, pricing associated with each ride, vehicle capacities and locations.

Decision Making Reinforcement Learning (RL)

Blind Decision Making: Reinforcement Learning with Delayed Observations

no code implementations16 Nov 2020 Mridul Agarwal, Vaneet Aggarwal

This paper proposes an approach, where the delay in the knowledge of the state can be used, and the decisions are made based on the available information which may not include the current state information.

Decision Making reinforcement-learning +1

PassGoodPool: Joint Passengers and Goods Fleet Management with Reinforcement Learning aided Pricing, Matching, and Route Planning

no code implementations17 Nov 2020 Kaushik Manchella, Marina Haliem, Vaneet Aggarwal, Bharat Bhargava

The ubiquitous growth of mobility-on-demand services for passenger and goods delivery has brought various challenges and opportunities within the realm of transportation systems.

Decision Making Management +1

Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning

no code implementations27 Sep 2020 Ramkumar Raghu, Mahadesh Panju, Vaneet Aggarwal, Vinod Sharma

In this paper, we use deep reinforcement learning where we use function approximation of the Q-function via a deep neural network to obtain a power control policy that matches the optimal policy for a small network.

reinforcement-learning Reinforcement Learning (RL) +2

A multi-agent evolutionary robotics framework to train spiking neural networks

no code implementations7 Dec 2020 Souvik Das, Anirudh Shankar, Vaneet Aggarwal

Rules of the framework select certain bots and their SNNs for reproduction and others for elimination based on their efficacy in capturing food in a competitive environment.

Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents

no code implementations31 Dec 2020 Arnob Ghosh, Vaneet Aggarwal

We consider a multi-agent Markov strategic interaction over an infinite horizon where agents can be of multiple types.

Reinforcement Learning (RL)

A Supervised Learning Approach for Robust Health Monitoring using Face Videos

no code implementations30 Jan 2021 Mayank Gupta, Lingjun Chen, Denny Yu, Vaneet Aggarwal

Non-contact methods can have additional advantages since they are scalable with any environment where video can be captured, can be used for continuous measurements, and can be used on patients with varying levels of dexterity and independence, from people with physical impairments to infants (e. g., baby camera).

Multi-Agent Multi-Armed Bandits with Limited Communication

no code implementations10 Feb 2021 Mridul Agarwal, Vaneet Aggarwal, Kamyar Azizzadenesheli

With our algorithm, LCC-UCB, each agent enjoys a regret of $\tilde{O}\left(\sqrt{({K/N}+ N)T}\right)$, communicates for $O(\log T)$ steps and broadcasts $O(\log K)$ bits in each communication step.

Multi-Armed Bandits

Communication Efficient Parallel Reinforcement Learning

no code implementations22 Feb 2021 Mridul Agarwal, Bhargav Ganguly, Vaneet Aggarwal

We provide \NAM\ which runs at each agent and prove that the total cumulative regret of $M$ agents is upper bounded as $\Tilde{O}(DS\sqrt{MAT})$ for a Markov Decision Process with diameter $D$, number of states $S$, and number of actions $A$.

reinforcement-learning Reinforcement Learning (RL)

Quantum Entropic Causal Inference

no code implementations23 Feb 2021 Mohammad Ali Javidian, Vaneet Aggarwal, Fanglin Bao, Zubin Jacob

This successful inference on a synthetic quantum dataset can have practical applications in identifying originators of malicious activity on future multi-node quantum networks as well as quantum error correction.

Causal Inference

AdaPool: A Diurnal-Adaptive Fleet Management Framework using Model-Free Deep Reinforcement Learning and Change Point Detection

no code implementations1 Apr 2021 Marina Haliem, Vaneet Aggarwal, Bharat Bhargava

To mitigate this problem in highly dynamic environments, we (1) adopt an online Dirichlet change point detection (ODCP) algorithm to detect the changes in the distribution of experiences, (2) develop a Deep Q Network (DQN) agent that is capable of recognizing diurnal patterns and making informed dispatching decisions according to the changes in the underlying environment.

Change Point Detection Management +1

Quantum causal inference in the presence of hidden common causes: An entropic approach

no code implementations24 Apr 2021 Mohammad Ali Javidian, Vaneet Aggarwal, Zubin Jacob

We also demonstrate that the proposed approach outperforms the results of classical causal inference for the Tubingen database when the variables are classical by exploiting quantum dependence between variables through density matrices rather than joint probability distributions.

Causal Inference

Markov Decision Processes with Long-Term Average Constraints

no code implementations12 Jun 2021 Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

We consider the problem of constrained Markov Decision Process (CMDP) where an agent interacts with a unichain Markov Decision Process.

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)

no code implementations9 Sep 2021 Washim Uddin Mondal, Mridul Agarwal, Vaneet Aggarwal, Satish V. Ukkusuri

We show that, in these cases, the $K$-class MARL problem can be approximated by MFC with errors given as $e_1=\mathcal{O}(\frac{\sqrt{|\mathcal{X}|}+\sqrt{|\mathcal{U}|}}{N_{\mathrm{pop}}}\sum_{k}\sqrt{N_k})$, $e_2=\mathcal{O}(\left[\sqrt{|\mathcal{X}|}+\sqrt{|\mathcal{U}|}\right]\sum_{k}\frac{1}{\sqrt{N_k}})$ and $e_3=\mathcal{O}\left(\left[\sqrt{|\mathcal{X}|}+\sqrt{|\mathcal{U}|}\right]\left[\frac{A}{N_{\mathrm{pop}}}\sum_{k\in[K]}\sqrt{N_k}+\frac{B}{\sqrt{N_{\mathrm{pop}}}}\right]\right)$, respectively, where $A, B$ are some constants and $|\mathcal{X}|,|\mathcal{U}|$ are the sizes of state and action spaces of each agent.

Multi-agent Reinforcement Learning

Concave Utility Reinforcement Learning with Zero-Constraint Violations

no code implementations12 Sep 2021 Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

We consider the problem of tabular infinite horizon concave utility reinforcement learning (CURL) with convex constraints.

reinforcement-learning Reinforcement Learning (RL)

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

no code implementations13 Sep 2021 Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal

To achieve that, we advocate the use of randomized primal-dual approach to solve the CMDP problems and propose a conservative stochastic primal-dual algorithm (CSPDA) which is shown to exhibit $\tilde{\mathcal{O}}\left(1/\epsilon^2\right)$ sample complexity to achieve $\epsilon$-optimal cumulative reward with zero constraint violations.

Decision Making reinforcement-learning +1

Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming

no code implementations22 Oct 2021 Alec Koppel, Amrit Singh Bedi, Bhargav Ganguly, Vaneet Aggarwal

We establish that the sample complexity to obtain near-globally optimal solutions matches tight dependencies on the cardinality of the state and action spaces, and exhibits classical scalings with respect to the network in accordance with multi-agent optimization.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Learning Circular Hidden Quantum Markov Models: A Tensor Network Approach

no code implementations29 Oct 2021 Mohammad Ali Javidian, Vaneet Aggarwal, Zubin Jacob

In this paper, we propose circular Hidden Quantum Markov Models (c-HQMMs), which can be applied for modeling temporal data in quantum datasets (with classical datasets as a special case).

Efficient Large-Scale Gaussian Process Bandits by Believing only Informative Actions

no code implementations L4DC 2020 Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Alec Koppel

Experimentally, we observe state of the art accuracy and complexity tradeoffs for GP bandit algorithms on various hyper-parameter tuning tasks, suggesting the merits of managing the complexity of GPs in bandit settings

Bayesian Optimization

Learning Multi-agent Skills for Tabular Reinforcement Learning using Factor Graphs

no code implementations20 Jan 2022 Jiayu Chen, Jingdi Chen, Tian Lan, Vaneet Aggarwal

Covering skill (a. k. a., option) discovery has been developed to improve the exploration of reinforcement learning in single-agent scenarios with sparse reward signals, through connecting the most distant states in the embedding space provided by the Fiedler vector of the state transition graph.

reinforcement-learning Reinforcement Learning (RL)

Classical Simulation of Variational Quantum Classifiers using Tensor Rings

no code implementations21 Jan 2022 Dheeraj Peddireddy, Vipul Bansal, Vaneet Aggarwal

This manuscript proposes an algorithm that compresses the quantum state within a circuit using a tensor ring representation which allows for the implementation of VQC based algorithms on a classical simulator at a fraction of the usual storage and computational complexity.

BIG-bench Machine Learning Combinatorial Optimization +1

Parallel Successive Learning for Dynamic Distributed Model Training over Heterogeneous Wireless Networks

no code implementations7 Feb 2022 Seyyedali Hosseinalipour, Su Wang, Nicolo Michelusi, Vaneet Aggarwal, Christopher G. Brinton, David J. Love, Mung Chiang

PSL considers the realistic scenario where global aggregations are conducted with idle times in-between them for resource efficiency improvements, and incorporates data dispersion and model dispersion with local model condensation into FedL.

Federated Learning

Can Mean Field Control (MFC) Approximate Cooperative Multi Agent Reinforcement Learning (MARL) with Non-Uniform Interaction?

1 code implementation28 Feb 2022 Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

We prove that, if the reward of each agent is an affine function of the mean-field seen by that agent, then one can approximate such a non-uniform MARL problem via its associated MFC problem within an error of $e=\mathcal{O}(\frac{1}{\sqrt{N}}[\sqrt{|\mathcal{X}|} + \sqrt{|\mathcal{U}|}])$ where $N$ is the population size and $|\mathcal{X}|$, $|\mathcal{U}|$ are the sizes of state and action spaces respectively.

Multi-agent Reinforcement Learning

Multi-Edge Server-Assisted Dynamic Federated Learning with an Optimized Floating Aggregation Point

no code implementations26 Mar 2022 Bhargav Ganguly, Seyyedali Hosseinalipour, Kwang Taik Kim, Christopher G. Brinton, Vaneet Aggarwal, David J. Love, Mung Chiang

CE-FL also introduces floating aggregation point, where the local models generated at the devices and the servers are aggregated at an edge server, which varies from one model training round to another to cope with the network evolution in terms of data distribution and users' mobility.

Distributed Optimization Federated Learning

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

no code implementations12 Jun 2022 Qinbo Bai, Amrit Singh Bedi, Vaneet Aggarwal

We propose a novel Conservative Natural Policy Gradient Primal-Dual Algorithm (C-NPG-PD) to achieve zero constraint violation while achieving state of the art convergence results for the objective value function.

A Community-Aware Framework for Social Influence Maximization

1 code implementation18 Jul 2022 Abhishek K. Umrawal, Christopher J. Quinn, Vaneet Aggarwal

We propose a community-aware divide-and-conquer framework that involves (i) learning the inherent community structure of the social network, (ii) generating candidate solutions by solving the influence maximization problem for each community, and (iii) selecting the final set of seed nodes using a novel progressive budgeting scheme.

On the Near-Optimality of Local Policies in Large Cooperative Multi-Agent Reinforcement Learning

no code implementations7 Sep 2022 Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

We show that in a cooperative $N$-agent network, one can design locally executable policies for the agents such that the resulting discounted sum of average rewards (value) well approximates the optimal value computed over all (including non-local) policies.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)

no code implementations15 Sep 2022 Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

In a special case where the reward, cost, and state transition functions are independent of the action distribution of the population, we prove that the error can be improved to $e=\mathcal{O}(\sqrt{|\mathcal{X}|}/\sqrt{N})$.

Multi-agent Reinforcement Learning reinforcement-learning +1

Multi-agent Deep Covering Skill Discovery

no code implementations7 Oct 2022 Jiayu Chen, Marina Haliem, Tian Lan, Vaneet Aggarwal

In this case, we propose Multi-agent Deep Covering Option Discovery, which constructs the multi-agent options through minimizing the expected cover time of the multiple agents' joint state space.

Multi-agent Reinforcement Learning reinforcement-learning +1

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization

no code implementations14 Nov 2022 Mudit Gaur, Vaneet Aggarwal, Mridul Agarwal

Deep Q-learning based algorithms have been applied successfully in many decision making problems, while their theoretical foundations are not as well understood.

Decision Making Q-Learning

Online Federated Learning via Non-Stationary Detection and Adaptation amidst Concept Drift

no code implementations22 Nov 2022 Bhargav Ganguly, Vaneet Aggarwal

Federated Learning (FL) is an emerging domain in the broader context of artificial intelligence research.

Federated Learning

Quantum Heavy-tailed Bandits

no code implementations23 Jan 2023 Yulian Wu, Chaowen Guan, Vaneet Aggarwal, Di Wang

In this paper, we study multi-armed bandits (MAB) and stochastic linear bandits (SLB) with heavy-tailed rewards and quantum reward oracle.

Multi-Armed Bandits

Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback

no code implementations2 Feb 2023 Fares Fourati, Vaneet Aggarwal, Christopher John Quinn, Mohamed-Slim Alouini

We investigate the problem of unconstrained combinatorial multi-armed bandits with full-bandit feedback and stochastic rewards for submodular maximization.

Multi-Armed Bandits

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

no code implementations13 Feb 2023 Fares Fourati, Salma Kharrat, Vaneet Aggarwal, Mohamed-Slim Alouini, Marco Canini

Federated learning is an emerging machine learning paradigm that enables clients to train collaboratively without exchanging local data.

Federated Learning

Towards Cooperative Federated Learning over Heterogeneous Edge/Fog Networks

no code implementations15 Mar 2023 Su Wang, Seyyedali Hosseinalipour, Vaneet Aggarwal, Christopher G. Brinton, David J. Love, Weifeng Su, Mung Chiang

Federated learning (FL) has been promoted as a popular technique for training machine learning (ML) models over edge/fog networks.

Federated Learning

Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback

no code implementations23 Mar 2023 Mohammad Pedramfar, Vaneet Aggarwal

This paper investigates the problem of combinatorial multiarmed bandits with stochastic submodular (in expectation) rewards and full-bandit delayed feedback, where the delayed feedback is assumed to be composite and anonymous.

Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward

no code implementations4 May 2023 Washim Uddin Mondal, Vaneet Aggarwal

We investigate an infinite-horizon average reward Markov Decision Process (MDP) with delayed, composite, and partially anonymous reward feedback.

Attribute reinforcement-learning

A Unified Approach for Maximizing Continuous DR-submodular Functions

no code implementations NeurIPS 2023 Mohammad Pedramfar, Christopher John Quinn, Vaneet Aggarwal

This paper presents a unified approach for maximizing continuous DR-submodular functions that encompasses a range of settings and oracle access types.

FERN: Leveraging Graph Attention Networks for Failure Evaluation and Robust Network Design

no code implementations30 May 2023 Chenyi Liu, Vaneet Aggarwal, Tian Lan, Nan Geng, Yuan Yang, Mingwei Xu, Qing Li

By providing a neural network function approximation of this common kernel using graph attention networks, we develop a unified learning-based framework, FERN, for scalable Failure Evaluation and Robust Network design.

Graph Attention

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

no code implementations18 Jun 2023 Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal

To achieve that, we propose a Natural Actor-Critic algorithm with 2-Layer critic parametrization (NAC2L).

Decision Making

Noisy Tensor Ring approximation for computing gradients of Variational Quantum Eigensolver for Combinatorial Optimization

no code implementations8 Jul 2023 Dheeraj Peddireddy, Utkarsh Priyam, Vaneet Aggarwal

While the single qubit gates do not alter the ring structure, the state transformations from the two qubit rotations are evaluated by truncating the singular values thereby preserving the structure of the tensor ring and reducing the computational complexity.

Combinatorial Optimization

Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs

no code implementations21 Jul 2023 Jiayu Chen, Jingdi Chen, Tian Lan, Vaneet Aggarwal

Our key idea is to approximate the joint state space as a Kronecker graph, based on which we can directly estimate its Fiedler vector using the Laplacian spectrum of individual agents' transition graphs.

Representation Learning

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

no code implementations28 Aug 2023 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Offline reinforcement learning aims to utilize datasets of previously gathered environment-action interaction records to learn a policy without access to the real environment.

D4RL Off-policy evaluation +2

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes

no code implementations5 Sep 2023 Qinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal

Remarkably, this paper marks a pioneering effort by presenting the first exploration into regret-bound computation for the general parameterized policy gradient algorithm in the context of average reward scenarios.

Tensor Ring Optimized Quantum-Enhanced Tensor Neural Networks

1 code implementation2 Oct 2023 Debanjan Konar, Dheeraj Peddireddy, Vaneet Aggarwal, Bijaya K. Panigrahi

Quantum machine learning researchers often rely on incorporating Tensor Networks (TN) into Deep Neural Networks (DNN) and variational optimization.

Binary Classification Quantum Machine Learning +1

Improved Analysis of Sparse Linear Regression in Local Differential Privacy Model

no code implementations11 Oct 2023 Liyang Zhu, Meng Ding, Vaneet Aggarwal, Jinhui Xu, Di Wang

To address these issues, we first consider the problem in the $\epsilon$ non-interactive LDP model and provide a lower bound of $\Omega(\frac{\sqrt{dk\log d}}{\sqrt{n}\epsilon})$ on the $\ell_2$-norm estimation error for sub-Gaussian data, where $n$ is the sample size and $d$ is the dimension of the space.

regression

Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes

no code implementations18 Oct 2023 Bhargav Ganguly, Yang Xu, Vaneet Aggarwal

Through thorough theoretical analysis, we demonstrate that the quantum advantage in mean estimation leads to exponential advancements in regret guarantees for infinite horizon Reinforcement Learning.

reinforcement-learning

Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes

no code implementations18 Oct 2023 Washim Uddin Mondal, Vaneet Aggarwal

In the class of Hessian-free and IS-free algorithms, ANPG beats the best-known sample complexity by a factor of $\mathcal{O}(\epsilon^{-\frac{1}{2}})$ and simultaneously matches their state-of-the-art iteration complexity.

Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision

no code implementations4 Nov 2023 Aditya Malusare, Harish Kothandaraman, Dipesh Tamboli, Nadia A. Lanman, Vaneet Aggarwal

This paper presents the Ensemble Nucleotide Byte-level Encoder-Decoder (ENBED) foundation model, analyzing DNA sequences at byte-level precision with an encoder-decoder Transformer architecture.

Language Modelling Masked Language Modeling

Combinatorial Stochastic-Greedy Bandit

no code implementations13 Dec 2023 Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini, Vaneet Aggarwal

We propose a novel combinatorial stochastic-greedy bandit (SGB) algorithm for combinatorial multi-armed bandit problems when no extra information other than the joint reward of the selected set of $n$ arms at each time step $t\in [T]$ is observed.

A Generalized Approach to Online Convex Optimization

no code implementations13 Feb 2024 Mohammad Pedramfar, Vaneet Aggarwal

We also show that any such algorithm that requires full-information feedback may be transformed to an algorithm with semi-bandit feedback with comparable regret bound.

Improving Molecule Generation and Drug Discovery with a Knowledge-enhanced Generative Model

no code implementations13 Feb 2024 Aditya Malusare, Vaneet Aggarwal

Recent advancements in generative models have established state-of-the-art benchmarks in generating molecules and novel drug candidates.

Drug Discovery Knowledge Graph Embeddings +1

Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization

no code implementations15 Mar 2024 Mohammad Pedramfar, Yididiya Y. Nadew, Christopher J. Quinn, Vaneet Aggarwal

This paper introduces unified projection-free Frank-Wolfe type algorithms for adversarial continuous DR-submodular optimization, spanning scenarios such as full information and (semi-)bandit feedback, monotone and non-monotone functions, different constraints, and types of stochastic queries.

Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries

no code implementations15 Mar 2024 Swetha Ganesh, Jiayu Chen, Gugan Thoppe, Vaneet Aggarwal

Federated Reinforcement Learning (FRL) allows multiple agents to collaboratively build a decision making policy without sharing raw trajectories.

Decision Making Policy Gradient Methods

Global Optimality without Mixing Time Oracles in Average-reward RL via Multi-level Actor-Critic

no code implementations18 Mar 2024 Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha

In the context of average-reward reinforcement learning, the requirement for oracle knowledge of the mixing time, a measure of the duration a Markov chain under a fixed policy needs to achieve its stationary distribution-poses a significant challenge for the global convergence of policy gradient methods.

Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.