Search Results for author: Arnob Ghosh

Found 10 papers, 0 papers with code

Adversarially Trained Actor Critic for offline CMDPs

no code implementations1 Jan 2024 Honghao Wei, Xiyue Peng, Xin Liu, Arnob Ghosh

Theoretically, we demonstrate that when the actor employs a no-regret optimization oracle, SATAC achieves two guarantees: (i) For the first time in the offline RL setting, we establish that SATAC can produce a policy that outperforms the behavior policy while maintaining the same level of safety, which is critical to designing an algorithm for offline RL.

Continuous Control Offline RL +1

Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning

no code implementations1 Jun 2023 Peizhong Ju, Arnob Ghosh, Ness B. Shroff

Fairness plays a crucial role in various multi-agent systems (e. g., communication networks, financial markets, etc.).

Fairness Offline RL +2

Provably Efficient Model-Free Algorithms for Non-stationary CMDPs

no code implementations10 Mar 2023 Honghao Wei, Arnob Ghosh, Ness Shroff, Lei Ying, Xingyu Zhou

We study model-free reinforcement learning (RL) algorithms in episodic non-stationary constrained Markov Decision Processes (CMDPs), in which an agent aims to maximize the expected cumulative reward subject to a cumulative constraint on the expected utility (cost).

Reinforcement Learning (RL)

Provably Efficient Model-free RL in Leader-Follower MDP with Linear Function Approximation

no code implementations28 Nov 2022 Arnob Ghosh

We focus on a setup where both the leader and followers are {\em non-myopic}, i. e., they both seek to maximize their rewards over the entire episode and consider a linear MDP which can model continuous state-space which is very common in many RL applications.

Provably Efficient Model-Free Constrained RL with Linear Function Approximation

no code implementations23 Jun 2022 Arnob Ghosh, Xingyu Zhou, Ness Shroff

To this end, we consider the episodic constrained Markov decision processes with linear function approximation, where the transition dynamics and the reward function can be represented as a linear function of some known feature mapping.

Traffic Control in a Mixed Autonomy Scenario at Urban Intersections: An Optimization-based Framework

no code implementations28 Aug 2021 Arnob Ghosh, Thomas Parisini

We consider an intersection zone where autonomous vehicles (AVs) and human-driven vehicles (HDVs) can be present.

Autonomous Vehicles

Control of a Mixed Autonomy Signalised Urban Intersection: An Action-Delayed Reinforcement Learning Approach

no code implementations24 Jun 2021 Erica Salvato, Arnob Ghosh, Gianfranco Fenu, Thomas Parisini

We consider a mixed autonomy scenario where the traffic intersection controller decides whether the traffic light will be green or red at each lane for multiple traffic-light blocks.

Reinforcement Learning (RL)

Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents

no code implementations31 Dec 2020 Arnob Ghosh, Vaneet Aggarwal

We consider a multi-agent Markov strategic interaction over an infinite horizon where agents can be of multiple types.

Reinforcement Learning (RL)

Reinforcement Learning for Mean Field Game

no code implementations30 May 2019 Mridul Agarwal, Vaneet Aggarwal, Arnob Ghosh, Nilay Tiwari

This paper focuses on finding a mean-field equilibrium (MFE) in an action coupled stochastic game setting in an episodic framework.

reinforcement-learning Reinforcement Learning (RL)

DeepPool: Distributed Model-free Algorithm for Ride-sharing using Deep Reinforcement Learning

no code implementations9 Mar 2019 Abubakr Alabbasi, Arnob Ghosh, Vaneet Aggarwal

The success of modern ride-sharing platforms crucially depends on the profit of the ride-sharing fleet operating companies, and how efficiently the resources are managed.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.