Search Results for author: Bhargav Ganguly

Found 7 papers, 1 papers with code

Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

1 code implementation21 Feb 2024 Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal

This work offers a hands-on reference for the research progress in deep generative models for offline policy learning, and aims to inspire improved DGM-based offline RL or IL algorithms.

Imitation Learning Offline RL

Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes

no code implementations18 Oct 2023 Bhargav Ganguly, Yang Xu, Vaneet Aggarwal

Through thorough theoretical analysis, we demonstrate that the quantum advantage in mean estimation leads to exponential advancements in regret guarantees for infinite horizon Reinforcement Learning.

reinforcement-learning

Online Federated Learning via Non-Stationary Detection and Adaptation amidst Concept Drift

no code implementations22 Nov 2022 Bhargav Ganguly, Vaneet Aggarwal

Federated Learning (FL) is an emerging domain in the broader context of artificial intelligence research.

Federated Learning

Multi-Edge Server-Assisted Dynamic Federated Learning with an Optimized Floating Aggregation Point

no code implementations26 Mar 2022 Bhargav Ganguly, Seyyedali Hosseinalipour, Kwang Taik Kim, Christopher G. Brinton, Vaneet Aggarwal, David J. Love, Mung Chiang

CE-FL also introduces floating aggregation point, where the local models generated at the devices and the servers are aggregated at an edge server, which varies from one model training round to another to cope with the network evolution in terms of data distribution and users' mobility.

Distributed Optimization Federated Learning

Convergence Rates of Average-Reward Multi-agent Reinforcement Learning via Randomized Linear Programming

no code implementations22 Oct 2021 Alec Koppel, Amrit Singh Bedi, Bhargav Ganguly, Vaneet Aggarwal

We establish that the sample complexity to obtain near-globally optimal solutions matches tight dependencies on the cardinality of the state and action spaces, and exhibits classical scalings with respect to the network in accordance with multi-agent optimization.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Communication Efficient Parallel Reinforcement Learning

no code implementations22 Feb 2021 Mridul Agarwal, Bhargav Ganguly, Vaneet Aggarwal

We provide \NAM\ which runs at each agent and prove that the total cumulative regret of $M$ agents is upper bounded as $\Tilde{O}(DS\sqrt{MAT})$ for a Markov Decision Process with diameter $D$, number of states $S$, and number of actions $A$.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.