no code implementations • 16 Aug 2024 • Tejas Pagare, Avishek Ghosh
In this paper, we study a decentralized two-sided matching market, where we do not assume that the preference ranking over players are known to the arms apriori.
no code implementations • 7 Apr 2023 • Tejas Pagare, Vivek Borkar, Konstantin Avrachenkov
We extend the provably convergent Full Gradient DQN algorithm for discounted reward Markov decision processes from Avrachenkov et al. (2021) to average reward problems.