1 code implementation • 3 Mar 2021 • Varun Bhatt, Michael Buro
In this paper, we first study learning in matrix-based signaling games to empirically show that decentralized methods can converge to a suboptimal policy.
no code implementations • 7 Feb 2021 • Shivaram Kalyanakrishnan, Siddharth Aravindan, Vishwajeet Bagdawat, Varun Bhatt, Harshith Goka, Archit Gupta, Kalpesh Krishna, Vihari Piratla
In this paper, we investigate the role of the parameter $d$ in RL; $d$ is called the "frame-skip" parameter, since states in the Atari domain are images.
no code implementations • 9 Mar 2020 • Varun Bhatt, Shalini Shrivastava, Tanmay Chavan, Udayan Ganguly
The in-memory computing paradigm with emerging memory devices has been recently shown to be a promising way to accelerate deep learning.