no code implementations • 18 Nov 2023 • Sihan Zeng, Sujay Bhatt, Eleonora Kreacic, Parisa Hassanzadeh, Alec Koppel, Sumitra Ganesh
We consider the design of mechanisms that allocate limited resources among self-interested agents using neural networks.
no code implementations • 24 Aug 2022 • Muhammad Aneeq uz Zaman, Alec Koppel, Sujay Bhatt, Tamer Başar
Given that the underlying Markov Decision Process (MDP) of the agent is communicating, we provide finite sample convergence guarantees in terms of convergence of the mean-field and control policy to the mean-field equilibrium.
no code implementations • 5 Aug 2022 • Sujay Bhatt, Guanhua Fang, Ping Li, Gennady Samorodnitsky
In this paper, we provide an extension of confidence sequences for settings where the variance of the data-generating distribution does not exist or is infinite.
no code implementations • 29 Sep 2021 • Muhammad Aneeq uz Zaman, Sujay Bhatt, Tamer Başar
In this paper, we propose a game between an exogenous adversary and a network of agents connected via a multigraph.
no code implementations • 9 Sep 2021 • Sujay Bhatt, Ping Li, Gennady Samorodnitsky
We consider a multi-armed bandit problem motivated by situations where only the extreme values, as opposed to expected values in the classical bandit setting, are of interest.
no code implementations • 9 Apr 2020 • Sujay Bhatt, Alec Koppel, Vikram Krishnamurthy
This paper considers policy search in continuous state-action reinforcement learning problems.