Search Results for author: Vaneet Aggarwal

Found 126 papers, 24 papers with code

Last-Iterate Convergence of General Parameterized Policies in Constrained MDPs

no code implementations21 Aug 2024 Washim Uddin Mondal, Vaneet Aggarwal

We consider the problem of learning a Constrained Markov Decision Process (CMDP) via general parameterization.

A Scalable Quantum Non-local Neural Network for Image Classification

1 code implementation26 Jul 2024 Sparsh Gupta, Debanjan Konar, Vaneet Aggarwal

Non-local operations play a crucial role in computer vision enabling the capture of long-range dependencies through weighted sums of features across the input, surpassing the constraints of traditional convolution operations that focus solely on local neighborhoods.

Binary Classification Image Classification

Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms

no code implementations17 Jun 2024 Vaneet Aggarwal, Washim Uddin Mondal, Qinbo Bai

This monograph focuses on the exploration of various model-based and model-free approaches for Constrained RL within the context of average reward Markov Decision Processes (MDPs).

Autonomous Driving Decision Making +2

Variational Offline Multi-agent Skill Discovery

no code implementations26 May 2024 Jiayu Chen, Bhargav Ganguly, Tian Lan, Vaneet Aggarwal

Skills are effective temporal abstractions established for sequential decision making tasks, which enable efficient hierarchical learning for long-horizon tasks and facilitate multi-task learning through their transferability.

Decision Making Multi-agent Reinforcement Learning +2

Sample-Efficient Constrained Reinforcement Learning with General Parameterization

no code implementations17 May 2024 Washim Uddin Mondal, Vaneet Aggarwal

We consider a constrained Markov Decision Problem (CMDP) where the goal of an agent is to maximize the expected discounted sum of rewards over an infinite horizon while ensuring that the expected discounted sum of costs exceeds a certain threshold.

reinforcement-learning

Stochastic Q-learning for Large Discrete Action Spaces

no code implementations16 May 2024 Fares Fourati, Vaneet Aggarwal, Mohamed-Slim Alouini

In complex environments with large discrete action spaces, effective decision-making is critical in reinforcement learning (RL).

Decision Making Q-Learning +1

Federated Combinatorial Multi-Agent Multi-Armed Bandits

no code implementations9 May 2024 Fares Fourati, Mohamed-Slim Alouini, Vaneet Aggarwal

Additionally, the algorithm is notably communication-efficient, requiring only a sublinear number of communication rounds, quantified as $\tilde{\mathcal{O}}\left(\psi T^\frac{\beta}{\beta+1}\right)$.

Combinatorial Optimization Data Summarization +2

Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization

no code implementations3 May 2024 Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal

The current state-of-the-art theoretical analysis of Actor-Critic (AC) algorithms significantly lags in addressing the practical aspects of AC implementations.

From Linear to Linearizable Optimization: A Novel Framework with Applications to Stationary and Non-stationary DR-submodular Optimization

no code implementations27 Apr 2024 Mohammad Pedramfar, Vaneet Aggarwal

This paper introduces the notion of upper linearizable/quadratizable functions, a class that extends concavity and DR-submodularity in various settings, including monotone and non-monotone cases over different convex sets.

A Bi-directional Quantum Search Algorithm

1 code implementation24 Apr 2024 Debanjan Konar, Zain Hafeez, Vaneet Aggarwal

Grover's search algorithms, including various partial Grover searches, experience scaling problems as the number of iterations rises with increased qubits, making implementation more computationally expensive.

Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis

no code implementations9 Apr 2024 Guangchen Lan, Dong-Jun Han, Abolfazl Hashemi, Vaneet Aggarwal, Christopher G. Brinton

Moreover, compared to synchronous FedPG, AFedPG improves the time complexity from $\mathcal{O}(\frac{t_{\max}}{N})$ to $\mathcal{O}(\frac{1}{\sum_{i=1}^{N} \frac{1}{t_{i}}})$, where $t_{i}$ denotes the time consumption in each iteration at the agent $i$, and $t_{\max}$ is the largest one.

Variance-Reduced Policy Gradient Approaches for Infinite Horizon Average Reward Markov Decision Processes

no code implementations2 Apr 2024 Swetha Ganesh, Washim Uddin Mondal, Vaneet Aggarwal

The second approach, rooted in Hessian-based techniques, ensures an expected regret of the order $\tilde{\mathcal{O}}(\sqrt{T})$.

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

no code implementations18 Mar 2024 Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha

In the context of average-reward reinforcement learning, the requirement for oracle knowledge of the mixing time, a measure of the duration a Markov chain under a fixed policy needs to achieve its stationary distribution, poses a significant challenge for the global convergence of policy gradient methods.

Policy Gradient Methods

Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries

no code implementations15 Mar 2024 Swetha Ganesh, Jiayu Chen, Gugan Thoppe, Vaneet Aggarwal

Federated Reinforcement Learning (FRL) allows multiple agents to collaboratively build a decision making policy without sharing raw trajectories.

Decision Making Policy Gradient Methods

Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization

1 code implementation15 Mar 2024 Mohammad Pedramfar, Yididiya Y. Nadew, Christopher J. Quinn, Vaneet Aggarwal

This paper introduces unified projection-free Frank-Wolfe type algorithms for adversarial continuous DR-submodular optimization, spanning scenarios such as full information and (semi-)bandit feedback, monotone and non-monotone functions, different constraints, and types of stochastic queries.

Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer

1 code implementation12 Mar 2024 Dipesh Tamboli, Jiayu Chen, Kiran Pranesh Jotheeswaran, Denny Yu, Vaneet Aggarwal

Sepsis, a life-threatening condition triggered by the body's exaggerated response to infection, demands urgent intervention to prevent severe complications.

Decision Making

A Generalized Approach to Online Convex Optimization

no code implementations13 Feb 2024 Mohammad Pedramfar, Vaneet Aggarwal

We show that any algorithm for online linear optimization with fully adaptive adversaries is an algorithm for online convex optimization.

Improving Molecule Generation and Drug Discovery with a Knowledge-enhanced Generative Model

no code implementations13 Feb 2024 Aditya Malusare, Vaneet Aggarwal

Recent advancements in generative models have established state-of-the-art benchmarks in generating molecules and novel drug candidates.

Drug Discovery Knowledge Graph Embeddings +1

Combinatorial Stochastic-Greedy Bandit

no code implementations13 Dec 2023 Fares Fourati, Christopher John Quinn, Mohamed-Slim Alouini, Vaneet Aggarwal

We propose a novel combinatorial stochastic-greedy bandit (SGB) algorithm for combinatorial multi-armed bandit problems when no extra information other than the joint reward of the selected set of $n$ arms at each time step $t\in [T]$ is observed.

Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision

no code implementations4 Nov 2023 Aditya Malusare, Harish Kothandaraman, Dipesh Tamboli, Nadia A. Lanman, Vaneet Aggarwal

This paper presents the Ensemble Nucleotide Byte-level Encoder-Decoder (ENBED) foundation model, analyzing DNA sequences at byte-level precision with an encoder-decoder Transformer architecture.

Decoder Language Modelling +1

Improved Sample Complexity Analysis of Natural Policy Gradient Algorithm with General Parameterization for Infinite Horizon Discounted Reward Markov Decision Processes

no code implementations18 Oct 2023 Washim Uddin Mondal, Vaneet Aggarwal

In the class of Hessian-free and IS-free algorithms, ANPG beats the best-known sample complexity by a factor of $\mathcal{O}(\epsilon^{-\frac{1}{2}})$ and simultaneously matches their state-of-the-art iteration complexity.

Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes

no code implementations18 Oct 2023 Bhargav Ganguly, Yang Xu, Vaneet Aggarwal

Through thorough theoretical analysis, we demonstrate that the quantum advantage in mean estimation leads to exponential advancements in regret guarantees for infinite horizon Reinforcement Learning.

reinforcement-learning

Improved Analysis of Sparse Linear Regression in Local Differential Privacy Model

no code implementations11 Oct 2023 Liyang Zhu, Meng Ding, Vaneet Aggarwal, Jinhui Xu, Di Wang

To address these issues, we first consider the problem in the $\epsilon$ non-interactive LDP model and provide a lower bound of $\Omega(\frac{\sqrt{dk\log d}}{\sqrt{n}\epsilon})$ on the $\ell_2$-norm estimation error for sub-Gaussian data, where $n$ is the sample size and $d$ is the dimension of the space.

regression

Tensor Ring Optimized Quantum-Enhanced Tensor Neural Networks

1 code implementation2 Oct 2023 Debanjan Konar, Dheeraj Peddireddy, Vaneet Aggarwal, Bijaya K. Panigrahi

Quantum machine learning researchers often rely on incorporating Tensor Networks (TN) into Deep Neural Networks (DNN) and variational optimization.

Binary Classification Quantum Machine Learning +1

Domain Adaptive Few-Shot Open-Set Learning

1 code implementation ICCV 2023 Debabrata Pal, Deeptej More, Sai Bhargav, Dipesh Tamboli, Vaneet Aggarwal, Biplab Banerjee

Few-shot learning has made impressive strides in addressing the crucial challenges of recognizing unknown samples from novel classes in target query sets and managing visual shifts between domains.

cross-domain few-shot learning Few-Shot Learning +1

Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes

no code implementations5 Sep 2023 Qinbo Bai, Washim Uddin Mondal, Vaneet Aggarwal

Remarkably, this paper marks a pioneering effort by presenting the first exploration into regret-bound computation for the general parameterized policy gradient algorithm in the context of average reward scenarios.

Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning

no code implementations28 Aug 2023 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Offline reinforcement learning aims to utilize datasets of previously gathered environment-action interaction records to learn a policy without access to the real environment.

D4RL Off-policy evaluation +2

Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs

no code implementations21 Jul 2023 Jiayu Chen, Jingdi Chen, Tian Lan, Vaneet Aggarwal

Our key idea is to approximate the joint state space as a Kronecker graph, based on which we can directly estimate its Fiedler vector using the Laplacian spectrum of individual agents' transition graphs.

Representation Learning

Noisy Tensor Ring approximation for computing gradients of Variational Quantum Eigensolver for Combinatorial Optimization

no code implementations8 Jul 2023 Dheeraj Peddireddy, Utkarsh Priyam, Vaneet Aggarwal

While the single qubit gates do not alter the ring structure, the state transformations from the two qubit rotations are evaluated by truncating the singular values thereby preserving the structure of the tensor ring and reducing the computational complexity.

Combinatorial Optimization

On the Global Convergence of Natural Actor-Critic with Two-layer Neural Network Parametrization

no code implementations18 Jun 2023 Mudit Gaur, Amrit Singh Bedi, Di Wang, Vaneet Aggarwal

To achieve that, we propose a Natural Actor-Critic algorithm with 2-Layer critic parametrization (NAC2L).

Decision Making

FERN: Leveraging Graph Attention Networks for Failure Evaluation and Robust Network Design

no code implementations30 May 2023 Chenyi Liu, Vaneet Aggarwal, Tian Lan, Nan Geng, Yuan Yang, Mingwei Xu, Qing Li

By providing a neural network function approximation of this common kernel using graph attention networks, we develop a unified learning-based framework, FERN, for scalable Failure Evaluation and Robust Network design.

Graph Attention

Hierarchical Deep Counterfactual Regret Minimization

1 code implementation27 May 2023 Jiayu Chen, Tian Lan, Vaneet Aggarwal

Imperfect Information Games (IIGs) offer robust models for scenarios where decision-makers face uncertainty or lack complete information.

counterfactual Decision Making

A Unified Approach for Maximizing Continuous DR-submodular Functions

no code implementations NeurIPS 2023 Mohammad Pedramfar, Christopher John Quinn, Vaneet Aggarwal

This paper presents a unified approach for maximizing continuous DR-submodular functions that encompasses a range of settings and oracle access types.

Multi-task Hierarchical Adversarial Inverse Reinforcement Learning

1 code implementation22 May 2023 Jiayu Chen, Dipesh Tamboli, Tian Lan, Vaneet Aggarwal

Multi-task Imitation Learning (MIL) aims to train a policy capable of performing a distribution of tasks based on multi-task expert demonstrations, which is essential for general-purpose robots.

Imitation Learning Multi-Task Learning +1

Reinforcement Learning with Delayed, Composite, and Partially Anonymous Reward

no code implementations4 May 2023 Washim Uddin Mondal, Vaneet Aggarwal

We investigate an infinite-horizon average reward Markov Decision Process (MDP) with delayed, composite, and partially anonymous reward feedback.

Attribute reinforcement-learning

Stochastic Submodular Bandits with Delayed Composite Anonymous Bandit Feedback

no code implementations23 Mar 2023 Mohammad Pedramfar, Vaneet Aggarwal

This paper investigates the problem of combinatorial multiarmed bandits with stochastic submodular (in expectation) rewards and full-bandit delayed feedback, where the delayed feedback is assumed to be composite and anonymous.

Towards Cooperative Federated Learning over Heterogeneous Edge/Fog Networks

no code implementations15 Mar 2023 Su Wang, Seyyedali Hosseinalipour, Vaneet Aggarwal, Christopher G. Brinton, David J. Love, Weifeng Su, Mung Chiang

Federated learning (FL) has been promoted as a popular technique for training machine learning (ML) models over edge/fog networks.

Federated Learning

FilFL: Client Filtering for Optimized Client Participation in Federated Learning

1 code implementation13 Feb 2023 Fares Fourati, Salma Kharrat, Vaneet Aggarwal, Mohamed-Slim Alouini, Marco Canini

We propose a novel approach, client filtering, to improve model generalization and optimize client participation and training.

Federated Learning

Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback

no code implementations2 Feb 2023 Fares Fourati, Vaneet Aggarwal, Christopher John Quinn, Mohamed-Slim Alouini

We investigate the problem of unconstrained combinatorial multi-armed bandits with full-bandit feedback and stochastic rewards for submodular maximization.

Multi-Armed Bandits

Quantum Heavy-tailed Bandits

no code implementations23 Jan 2023 Yulian Wu, Chaowen Guan, Vaneet Aggarwal, Di Wang

In this paper, we study multi-armed bandits (MAB) and stochastic linear bandits (SLB) with heavy-tailed rewards and quantum reward oracle.

Multi-Armed Bandits

Online Federated Learning via Non-Stationary Detection and Adaptation amidst Concept Drift

no code implementations22 Nov 2022 Bhargav Ganguly, Vaneet Aggarwal

Federated Learning (FL) is an emerging domain in the broader context of artificial intelligence research.

Federated Learning

On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization

no code implementations14 Nov 2022 Mudit Gaur, Vaneet Aggarwal, Mridul Agarwal

Deep Q-learning based algorithms have been applied successfully in many decision making problems, while their theoretical foundations are not as well understood.

Decision Making Q-Learning

Multi-agent Deep Covering Skill Discovery

no code implementations7 Oct 2022 Jiayu Chen, Marina Haliem, Tian Lan, Vaneet Aggarwal

In this case, we propose Multi-agent Deep Covering Option Discovery, which constructs the multi-agent options through minimizing the expected cover time of the multiple agents' joint state space.

Multi-agent Reinforcement Learning reinforcement-learning +1

Option-Aware Adversarial Inverse Reinforcement Learning for Robotic Control

1 code implementation5 Oct 2022 Jiayu Chen, Tian Lan, Vaneet Aggarwal

In this work, we develop a novel HIL algorithm based on Adversarial Inverse Reinforcement Learning and adapt it with the Expectation-Maximization algorithm in order to directly recover a hierarchical policy from the unannotated demonstrations.

Imitation Learning Multi-Task Learning +2

Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)

no code implementations15 Sep 2022 Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

In a special case where the reward, cost, and state transition functions are independent of the action distribution of the population, we prove that the error can be improved to $e=\mathcal{O}(\sqrt{|\mathcal{X}|}/\sqrt{N})$.

Multi-agent Reinforcement Learning reinforcement-learning +1

On the Near-Optimality of Local Policies in Large Cooperative Multi-Agent Reinforcement Learning

no code implementations7 Sep 2022 Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

We show that in a cooperative $N$-agent network, one can design locally executable policies for the agents such that the resulting discounted sum of average rewards (value) well approximates the optimal value computed over all (including non-local) policies.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

A Community-Aware Framework for Social Influence Maximization

1 code implementation18 Jul 2022 Abhishek K. Umrawal, Christopher J. Quinn, Vaneet Aggarwal

We propose a community-aware divide-and-conquer framework that involves (i) learning the inherent community structure of the social network, (ii) generating candidate solutions by solving the influence maximization problem for each community, and (iii) selecting the final set of seed nodes using a novel progressive budgeting scheme.

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

1 code implementation22 Jun 2022 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

Multi-agent reinforcement learning (MARL) has witnessed significant progress with the development of value function factorization methods.

counterfactual Multi-agent Reinforcement Learning +5

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Conservative Natural Policy Gradient Primal-Dual Algorithm

no code implementations12 Jun 2022 Qinbo Bai, Amrit Singh Bedi, Vaneet Aggarwal

We propose a novel Conservative Natural Policy Gradient Primal-Dual Algorithm (C-NPG-PD) to achieve zero constraint violation while achieving state of the art convergence results for the objective value function.

Multi-Edge Server-Assisted Dynamic Federated Learning with an Optimized Floating Aggregation Point

no code implementations26 Mar 2022 Bhargav Ganguly, Seyyedali Hosseinalipour, Kwang Taik Kim, Christopher G. Brinton, Vaneet Aggarwal, David J. Love, Mung Chiang

CE-FL also introduces floating aggregation point, where the local models generated at the devices and the servers are aggregated at an edge server, which varies from one model training round to another to cope with the network evolution in terms of data distribution and users' mobility.

Distributed Optimization Federated Learning

Can Mean Field Control (MFC) Approximate Cooperative Multi Agent Reinforcement Learning (MARL) with Non-Uniform Interaction?

1 code implementation28 Feb 2022 Washim Uddin Mondal, Vaneet Aggarwal, Satish V. Ukkusuri

We prove that, if the reward of each agent is an affine function of the mean-field seen by that agent, then one can approximate such a non-uniform MARL problem via its associated MFC problem within an error of $e=\mathcal{O}(\frac{1}{\sqrt{N}}[\sqrt{|\mathcal{X}|} + \sqrt{|\mathcal{U}|}])$ where $N$ is the population size and $|\mathcal{X}|$, $|\mathcal{U}|$ are the sizes of state and action spaces respectively.

Multi-agent Reinforcement Learning

Deep Learning based Coverage and Rate Manifold Estimation in Cellular Networks

2 code implementations13 Feb 2022 Washim Uddin Mondal, Praful D. Mankar, Goutam Das, Vaneet Aggarwal, Satish V. Ukkusuri

This article proposes Convolutional Neural Network-based Auto Encoder (CNN-AE) to predict location-dependent rate and coverage probability of a network from its topology.

Parallel Successive Learning for Dynamic Distributed Model Training over Heterogeneous Wireless Networks

no code implementations7 Feb 2022 Seyyedali Hosseinalipour, Su Wang, Nicolo Michelusi, Vaneet Aggarwal, Christopher G. Brinton, David J. Love, Mung Chiang

PSL considers the realistic scenario where global aggregations are conducted with idle times in-between them for resource efficiency improvements, and incorporates data dispersion and model dispersion with local model condensation into FedL.

Federated Learning

Classical Simulation of Variational Quantum Classifiers using Tensor Rings

no code implementations21 Jan 2022 Dheeraj Peddireddy, Vipul Bansal, Vaneet Aggarwal

This manuscript proposes an algorithm that compresses the quantum state within a circuit using a tensor ring representation which allows for the implementation of VQC based algorithms on a classical simulator at a fraction of the usual storage and computational complexity.

BIG-bench Machine Learning Combinatorial Optimization +1

Learning Multi-agent Skills for Tabular Reinforcement Learning using Factor Graphs

no code implementations20 Jan 2022 Jiayu Chen, Jingdi Chen, Tian Lan, Vaneet Aggarwal

Covering skill (a. k. a., option) discovery has been developed to improve the exploration of reinforcement learning in single-agent scenarios with sparse reward signals, through connecting the most distant states in the embedding space provided by the Fiedler vector of the state transition graph.

reinforcement-learning Reinforcement Learning (RL)

Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

1 code implementation4 Jan 2022 Hanhan Zhou, Tian Lan, Vaneet Aggarwal

To this end, we present LSF-SAC, a novel framework that features a variational inference-based information-sharing mechanism as extra state information to assist individual agents in the value function factorization.

Starcraft Starcraft II +1

Learning Circular Hidden Quantum Markov Models: A Tensor Network Approach

no code implementations29 Oct 2021 Mohammad Ali Javidian, Vaneet Aggarwal, Zubin Jacob

In this paper, we propose circular Hidden Quantum Markov Models (c-HQMMs), which can be applied for modeling temporal data in quantum datasets (with classical datasets as a special case).

Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming

no code implementations22 Oct 2021 Alec Koppel, Amrit Singh Bedi, Bhargav Ganguly, Vaneet Aggarwal

We establish that the sample complexity to obtain near-globally optimal solutions matches tight dependencies on the cardinality of the state and action spaces, and exhibits classical scalings with respect to the network in accordance with multi-agent optimization.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach

no code implementations13 Sep 2021 Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal

To achieve that, we advocate the use of randomized primal-dual approach to solve the CMDP problems and propose a conservative stochastic primal-dual algorithm (CSPDA) which is shown to exhibit $\tilde{\mathcal{O}}\left(1/\epsilon^2\right)$ sample complexity to achieve $\epsilon$-optimal cumulative reward with zero constraint violations.

Decision Making reinforcement-learning +1

Concave Utility Reinforcement Learning with Zero-Constraint Violations

no code implementations12 Sep 2021 Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

We consider the problem of tabular infinite horizon concave utility reinforcement learning (CURL) with convex constraints.

reinforcement-learning Reinforcement Learning (RL)

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)

no code implementations9 Sep 2021 Washim Uddin Mondal, Mridul Agarwal, Vaneet Aggarwal, Satish V. Ukkusuri

We show that, in these cases, the $K$-class MARL problem can be approximated by MFC with errors given as $e_1=\mathcal{O}(\frac{\sqrt{|\mathcal{X}|}+\sqrt{|\mathcal{U}|}}{N_{\mathrm{pop}}}\sum_{k}\sqrt{N_k})$, $e_2=\mathcal{O}(\left[\sqrt{|\mathcal{X}|}+\sqrt{|\mathcal{U}|}\right]\sum_{k}\frac{1}{\sqrt{N_k}})$ and $e_3=\mathcal{O}\left(\left[\sqrt{|\mathcal{X}|}+\sqrt{|\mathcal{U}|}\right]\left[\frac{A}{N_{\mathrm{pop}}}\sum_{k\in[K]}\sqrt{N_k}+\frac{B}{\sqrt{N_{\mathrm{pop}}}}\right]\right)$, respectively, where $A, B$ are some constants and $|\mathcal{X}|,|\mathcal{U}|$ are the sizes of state and action spaces of each agent.

Multi-agent Reinforcement Learning

DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioning

1 code implementation9 Sep 2021 Xinwu Qian, Shuocheng Guo, Vaneet Aggarwal

This study proposes the deep relocating option policy (DROP) that supervises vehicle agents to escape from oversupply areas and effectively relocate to potentially underserved areas.

An FEA surrogate model with Boundary Oriented Graph Embedding approach

1 code implementation30 Aug 2021 Xingyu Fu, Fengfeng Zhou, Dheeraj Peddireddy, Zhengyang Kang, Martin Byung-Guk Jun, Vaneet Aggarwal

In this work, we present a Boundary Oriented Graph Embedding (BOGE) approach for the Graph Neural Network (GNN) to serve as a general surrogate model for regressing physical fields and solving boundary value problems.

Cantilever Beam Decision Making +3

Markov Decision Processes with Long-Term Average Constraints

no code implementations12 Jun 2021 Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

We consider the problem of constrained Markov Decision Process (CMDP) where an agent interacts with a unichain Markov Decision Process.

Quantum causal inference in the presence of hidden common causes: An entropic approach

no code implementations24 Apr 2021 Mohammad Ali Javidian, Vaneet Aggarwal, Zubin Jacob

We also demonstrate that the proposed approach outperforms the results of classical causal inference for the Tubingen database when the variables are classical by exploiting quantum dependence between variables through density matrices rather than joint probability distributions.

Causal Inference

AdaPool: A Diurnal-Adaptive Fleet Management Framework using Model-Free Deep Reinforcement Learning and Change Point Detection

no code implementations1 Apr 2021 Marina Haliem, Vaneet Aggarwal, Bharat Bhargava

To mitigate this problem in highly dynamic environments, we (1) adopt an online Dirichlet change point detection (ODCP) algorithm to detect the changes in the distribution of experiences, (2) develop a Deep Q Network (DQN) agent that is capable of recognizing diurnal patterns and making informed dispatching decisions according to the changes in the underlying environment.

Change Point Detection Management +1

Quantum Entropic Causal Inference

no code implementations23 Feb 2021 Mohammad Ali Javidian, Vaneet Aggarwal, Fanglin Bao, Zubin Jacob

This successful inference on a synthetic quantum dataset can have practical applications in identifying originators of malicious activity on future multi-node quantum networks as well as quantum error correction.

Causal Inference

Communication Efficient Parallel Reinforcement Learning

no code implementations22 Feb 2021 Mridul Agarwal, Bhargav Ganguly, Vaneet Aggarwal

We provide \NAM\ which runs at each agent and prove that the total cumulative regret of $M$ agents is upper bounded as $\Tilde{O}(DS\sqrt{MAT})$ for a Markov Decision Process with diameter $D$, number of states $S$, and number of actions $A$.

reinforcement-learning Reinforcement Learning (RL)

Multi-Agent Multi-Armed Bandits with Limited Communication

no code implementations10 Feb 2021 Mridul Agarwal, Vaneet Aggarwal, Kamyar Azizzadenesheli

With our algorithm, LCC-UCB, each agent enjoys a regret of $\tilde{O}\left(\sqrt{({K/N}+ N)T}\right)$, communicates for $O(\log T)$ steps and broadcasts $O(\log K)$ bits in each communication step.

Multi-Armed Bandits

A Supervised Learning Approach for Robust Health Monitoring using Face Videos

no code implementations30 Jan 2021 Mayank Gupta, Lingjun Chen, Denny Yu, Vaneet Aggarwal

Non-contact methods can have additional advantages since they are scalable with any environment where video can be captured, can be used for continuous measurements, and can be used on patients with varying levels of dexterity and independence, from people with physical impairments to infants (e. g., baby camera).

Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents

no code implementations31 Dec 2020 Arnob Ghosh, Vaneet Aggarwal

We consider a multi-agent Markov strategic interaction over an infinite horizon where agents can be of multiple types.

Reinforcement Learning (RL)

A multi-agent evolutionary robotics framework to train spiking neural networks

no code implementations7 Dec 2020 Souvik Das, Anirudh Shankar, Vaneet Aggarwal

Rules of the framework select certain bots and their SNNs for reproduction and others for elimination based on their efficacy in capturing food in a competitive environment.

PassGoodPool: Joint Passengers and Goods Fleet Management with Reinforcement Learning aided Pricing, Matching, and Route Planning

no code implementations17 Nov 2020 Kaushik Manchella, Marina Haliem, Vaneet Aggarwal, Bharat Bhargava

The ubiquitous growth of mobility-on-demand services for passenger and goods delivery has brought various challenges and opportunities within the realm of transportation systems.

Decision Making Management +1

Blind Decision Making: Reinforcement Learning with Delayed Observations

no code implementations16 Nov 2020 Mridul Agarwal, Vaneet Aggarwal

This paper proposes an approach, where the delay in the knowledge of the state can be used, and the decisions are made based on the available information which may not include the current state information.

Decision Making reinforcement-learning +1

A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning

no code implementations5 Oct 2020 Marina Haliem, Ganapathy Mani, Vaneet Aggarwal, Bharat Bhargava

In this paper, we present a dynamic, demand aware, and pricing-based vehicle-passenger matching and route planning framework that (1) dynamically generates optimal routes for each vehicle based on online demand, pricing associated with each ride, vehicle capacities and locations.

Decision Making Reinforcement Learning (RL)

Scheduling and Power Control for Wireless Multicast Systems via Deep Reinforcement Learning

no code implementations27 Sep 2020 Ramkumar Raghu, Mahadesh Panju, Vaneet Aggarwal, Vinod Sharma

In this paper, we use deep reinforcement learning where we use function approximation of the Q-function via a deep neural network to obtain a power control policy that matches the optimal policy for a small network.

reinforcement-learning Reinforcement Learning (RL) +2

FlexPool: A Distributed Model-Free Deep Reinforcement Learning Algorithm for Joint Passengers & Goods Transportation

no code implementations27 Jul 2020 Kaushik Manchella, Abhishek K. Umrawal, Vaneet Aggarwal

Through simulations on a realistic multi-agent urban mobility platform, we demonstrate that FlexPool outperforms other model-free settings in serving the demands from passengers & goods.

Reinforcement Learning (RL)

Multi-Stage Hybrid Federated Learning over Large-Scale D2D-Enabled Fog Networks

1 code implementation18 Jul 2020 Seyyedali Hosseinalipour, Sheikh Shams Azam, Christopher G. Brinton, Nicolo Michelusi, Vaneet Aggarwal, David J. Love, Huaiyu Dai

We derive the upper bound of convergence for MH-FL with respect to parameters of the network topology (e. g., the spectral radius) and the learning algorithm (e. g., the number of D2D rounds in different clusters).

Federated Learning

Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints

no code implementations10 Jun 2020 Qinbo Bai, Vaneet Aggarwal, Ather Gattami

This paper uses concepts from constrained optimization and Q-learning to propose an algorithm for CMDP with long-term constraints.

Q-Learning

Efficient Large-Scale Gaussian Process Bandits by Believing only Informative Actions

no code implementations L4DC 2020 Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Alec Koppel

Experimentally, we observe state of the art accuracy and complexity tradeoffs for GP bandit algorithms on various hyper-parameter tuning tasks, suggesting the merits of managing the complexity of GPs in bandit settings

Bayesian Optimization

From Federated to Fog Learning: Distributed Machine Learning over Heterogeneous Wireless Networks

no code implementations7 Jun 2020 Seyyedali Hosseinalipour, Christopher G. Brinton, Vaneet Aggarwal, Huaiyu Dai, Mung Chiang

There are several challenges with employing conventional federated learning in contemporary networks, due to the significant heterogeneity in compute and communication capabilities that exist across devices.

BIG-bench Machine Learning Federated Learning +1

Regret and Belief Complexity Trade-off in Gaussian Process Bandits via Information Thresholding

no code implementations23 Mar 2020 Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Brian M. Sadler, Alec Koppel

Doing so permits us to precisely characterize the trade-off between regret bounds of GP bandit algorithms and complexity of the posterior distributions depending on the compression parameter $\epsilon$ for both discrete and continuous action sets.

Bayesian Optimization Decision Making +1

Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints

no code implementations11 Mar 2020 Qinbo Bai, Vaneet Aggarwal, Ather Gattami

The proposed algorithm is proved to achieve an $(\epsilon, p)$-PAC policy when the episode $K\geq\Omega(\frac{I^2H^6SA\ell}{\epsilon^2})$, where $S$ and $A$ are the number of states and actions, respectively.

Q-Learning Scheduling

A Distributed Model-Free Algorithm for Multi-hop Ride-sharing using Deep Reinforcement Learning

no code implementations30 Oct 2019 Ashutosh Singh, Abubakr Alabbasi, Vaneet Aggarwal

The growth of autonomous vehicles, ridesharing systems, and self driving technology will bring a shift in the way ride hailing platforms plan out their services.

Autonomous Vehicles Reinforcement Learning (RL)

Reinforcement Learning for Joint Optimization of Multiple Rewards

no code implementations6 Sep 2019 Mridul Agarwal, Vaneet Aggarwal

Finding optimal policies which maximize long term rewards of Markov Decision Processes requires the use of dynamic programming and backward induction to solve the Bellman optimality equation.

Decision Making Fairness +3

Encoders and Decoders for Quantum Expander Codes Using Machine Learning

no code implementations6 Sep 2019 Sathwik Chadaga, Mridul Agarwal, Vaneet Aggarwal

However, large-scale design of quantum encoders and decoders have to depend on the channel characteristics and require look-up tables which require memory that is exponential in the number of qubits.

BIG-bench Machine Learning Decoder +1

GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning

no code implementations30 Aug 2019 Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal

When the data is distributed across multiple servers, lowering the communication cost between the servers (or workers) while solving the distributed learning problem is an important problem and is the focus of this paper.

BIG-bench Machine Learning

Reinforcement Learning for Mean Field Game

no code implementations30 May 2019 Mridul Agarwal, Vaneet Aggarwal, Arnob Ghosh, Nilay Tiwari

This paper focuses on finding a mean-field equilibrium (MFE) in an action coupled stochastic game setting in an episodic framework.

reinforcement-learning Reinforcement Learning (RL)

DeepPool: Distributed Model-free Algorithm for Ride-sharing using Deep Reinforcement Learning

no code implementations9 Mar 2019 Abubakr Alabbasi, Arnob Ghosh, Vaneet Aggarwal

The success of modern ride-sharing platforms crucially depends on the profit of the ride-sharing fleet operating companies, and how efficiently the resources are managed.

reinforcement-learning Reinforcement Learning (RL)

A Proximal Jacobian ADMM Approach for Fast Massive MIMO Signal Detection in Low-Latency Communications

1 code implementation2 Mar 2019 Anis Elgabli, Ali Elghariani, Vaneet Aggarwal, *Mehdi Bennis, Mark R. Bell

We introduce an objective function that is a sum of strictly convex and separable functions based on decomposing the received vector into multiple vectors.

Information Theory Information Theory

Stochastic Top-$K$ Subset Bandits with Linear Space and Non-Linear Feedback

no code implementations29 Nov 2018 Mridul Agarwal, Vaneet Aggarwal, Christopher J. Quinn, Abhishek K. Umrawal

Many real-world problems like Social Influence Maximization face the dilemma of choosing the best $K$ out of $N$ options at a given time instant.

Multi-Armed Bandits

Covfefe: A Computer Vision Approach For Estimating Force Exertion

no code implementations25 Sep 2018 Vaneet Aggarwal, Hamed Asadi, Mayank Gupta, Jae Joong Lee, Denny Yu

We note that the PPG signals can be obtained from the face videos, thus giving an efficient classification algorithm for the force exertion levels using face videos.

General Classification Photoplethysmography (PPG)

FastScan: Robust Low-Complexity Rate Adaptation Algorithm for Video Streaming over HTTP

1 code implementation7 Jun 2018 Anis Elgabli, Vaneet Aggarwal

For example, on an experiment conducted over 100 real cellular bandwidth traces of a public dataset that spans different bandwidth regimes, our proposed algorithm (FastScan) achieves the minimum re-buffering (stall) time and the maximum average playback rate in every single trace as compared to the original dash. js rate adaptation scheme, Festive, BBA, RB, and FastMPC algorithms.

Networking and Internet Architecture Multimedia

LBP: Robust Rate Adaptation Algorithm for SVC Video Streaming

no code implementations30 Apr 2018 Anis Elgabli, Vaneet Aggarwal, Shuai Hao, Feng Qian, Subhabrata Sen

The objective is to optimize a novel QoE metric that models a combination of the three objectives of minimizing the stall/skip duration of the video, maximizing the playback quality of every chunk, and minimizing the number of quality switches.

Networking and Internet Architecture Multimedia

Principal Component Analysis with Tensor Train Subspace

no code implementations13 Mar 2018 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Tensor train is a hierarchical tensor network structure that helps alleviate the curse of dimensionality by parameterizing large-scale multidimensional data via a set of network of low-rank tensors.

Wide Compression: Tensor Ring Nets

no code implementations CVPR 2018 Wenqi Wang, Yifan Sun, Brian Eriksson, Wenlin Wang, Vaneet Aggarwal

Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications.

Image Classification

On Deterministic Sampling Patterns for Robust Low-Rank Matrix Completion

no code implementations5 Dec 2017 Morteza Ashraphijuo, Vaneet Aggarwal, Xiaodong Wang

In this letter, we study the deterministic sampling patterns for the completion of low rank matrix, when corrupted with a sparse noise, also known as robust matrix completion.

Low-Rank Matrix Completion valid

Tensor Train Neighborhood Preserving Embedding

no code implementations3 Dec 2017 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

In this paper, we propose a Tensor Train Neighborhood Preserving Embedding (TTNPE) to embed multi-dimensional tensor data into low dimensional tensor subspace.

Classification Dimensionality Reduction +1

Efficient Low Rank Tensor Ring Completion

no code implementations ICCV 2017 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Using the matrix product state (MPS) representation of the recently proposed tensor ring decompositions, in this paper we propose a tensor completion algorithm, which is an alternating minimization algorithm that alternates over the factors in the MPS representation.

Matrix Completion

Rank Determination for Low-Rank Data Completion

no code implementations3 Jul 2017 Morteza Ashraphijuo, Xiaodong Wang, Vaneet Aggarwal

Moreover, for both single-view matrix and CP tensor, we are able to show that the obtained upper bound is exactly equal to the unknown rank if the lowest-rank completion is given.

Deterministic and Probabilistic Conditions for Finite Completability of Low-rank Multi-View Data

no code implementations3 Jan 2017 Morteza Ashraphijuo, Xiaodong Wang, Vaneet Aggarwal

We provide a deterministic necessary and sufficient condition on the sampling pattern for finite completability.

Matrix Completion

Deterministic and Probabilistic Conditions for Finite Completability of Low-Tucker-Rank Tensor

no code implementations6 Dec 2016 Morteza Ashraphijuo, Vaneet Aggarwal, Xiaodong Wang

We investigate the fundamental conditions on the sampling pattern, i. e., locations of the sampled entries, for finite completability of a low-rank tensor given some components of its Tucker rank.

Unsupervised clustering under the Union of Polyhedral Cones (UOPC) model

no code implementations15 Oct 2016 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Similar to the Union of Subspaces (UOS) model where each data from each subspace is generated from a (unknown) basis, in the UOPC model each data from each cone is assumed to be generated from a finite number of (unknown) \emph{extreme rays}. To cluster data under this model, we consider several algorithms - (a) Sparse Subspace Clustering by Non-negative constraints Lasso (NCL), (b) Least squares approximation (LSA), and (c) K-nearest neighbor (KNN) algorithm to arrive at affinity between data points.

Clustering

Tensor Completion by Alternating Minimization under the Tensor Train (TT) Model

no code implementations19 Sep 2016 Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron

Using the matrix product state (MPS) representation of tensor train decompositions, in this paper we propose a tensor completion algorithm which alternates over the matrices (tensors) in the MPS representation.

Matrix Completion

On Deterministic Conditions for Subspace Clustering under Missing Data

no code implementations11 Jul 2016 Wenqi Wang, Shuchin Aeron, Vaneet Aggarwal

In this paper we present deterministic conditions for success of sparse subspace clustering (SSC) under missing data, when data is assumed to come from a Union of Subspaces (UoS) model.

Clustering

On deterministic conditions for subspace clustering under missing data

no code implementations15 Apr 2016 Wenqi Wang, Shuchin Aeron, Vaneet Aggarwal

We provide extensive set of simulation results for clustering as well as completion of data under missing entries, under the UoS model.

Clustering

Information-theoretic Bounds on Matrix Completion under Union of Subspaces Model

no code implementations14 Aug 2015 Vaneet Aggarwal, Shuchin Aeron

In this short note we extend some of the recent results on matrix completion under the assumption that the columns of the matrix can be grouped (clustered) into subspaces (not necessarily disjoint or independent).

Clustering Matrix Completion

Adaptive Sampling of RF Fingerprints for Fine-grained Indoor Localization

no code implementations10 Aug 2015 Xiao-Yang Liu, Shuchin Aeron, Vaneet Aggarwal, Xiaodong Wang, Min-You Wu

In contrast to several existing work that rely on random sampling, this paper shows that adaptivity in sampling can lead to significant improvements in localization accuracy.

Indoor Localization

Cannot find the paper you are looking for? You can Submit a new open access paper.