no code implementations • 1 Jan 2024 • Honghao Wei, Xiyue Peng, Xin Liu, Arnob Ghosh
Theoretically, we demonstrate that when the actor employs a no-regret optimization oracle, SATAC achieves two guarantees: (i) For the first time in the offline RL setting, we establish that SATAC can produce a policy that outperforms the behavior policy while maintaining the same level of safety, which is critical to designing an algorithm for offline RL.
no code implementations • 1 Jun 2023 • Peizhong Ju, Arnob Ghosh, Ness B. Shroff
Fairness plays a crucial role in various multi-agent systems (e. g., communication networks, financial markets, etc.).
no code implementations • 10 Mar 2023 • Honghao Wei, Arnob Ghosh, Ness Shroff, Lei Ying, Xingyu Zhou
We study model-free reinforcement learning (RL) algorithms in episodic non-stationary constrained Markov Decision Processes (CMDPs), in which an agent aims to maximize the expected cumulative reward subject to a cumulative constraint on the expected utility (cost).
no code implementations • 28 Nov 2022 • Arnob Ghosh
We focus on a setup where both the leader and followers are {\em non-myopic}, i. e., they both seek to maximize their rewards over the entire episode and consider a linear MDP which can model continuous state-space which is very common in many RL applications.
no code implementations • 23 Jun 2022 • Arnob Ghosh, Xingyu Zhou, Ness Shroff
To this end, we consider the episodic constrained Markov decision processes with linear function approximation, where the transition dynamics and the reward function can be represented as a linear function of some known feature mapping.
no code implementations • 28 Aug 2021 • Arnob Ghosh, Thomas Parisini
We consider an intersection zone where autonomous vehicles (AVs) and human-driven vehicles (HDVs) can be present.
no code implementations • 24 Jun 2021 • Erica Salvato, Arnob Ghosh, Gianfranco Fenu, Thomas Parisini
We consider a mixed autonomy scenario where the traffic intersection controller decides whether the traffic light will be green or red at each lane for multiple traffic-light blocks.
no code implementations • 31 Dec 2020 • Arnob Ghosh, Vaneet Aggarwal
We consider a multi-agent Markov strategic interaction over an infinite horizon where agents can be of multiple types.
no code implementations • 30 May 2019 • Mridul Agarwal, Vaneet Aggarwal, Arnob Ghosh, Nilay Tiwari
This paper focuses on finding a mean-field equilibrium (MFE) in an action coupled stochastic game setting in an episodic framework.
no code implementations • 9 Mar 2019 • Abubakr Alabbasi, Arnob Ghosh, Vaneet Aggarwal
The success of modern ride-sharing platforms crucially depends on the profit of the ride-sharing fleet operating companies, and how efficiently the resources are managed.