no code implementations • 21 Dec 2020 • Jacopo Castellini, Sam Devlin, Frans A. Oliehoek, Rahul Savani
Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning.
no code implementations • 4 Apr 2019 • Jacopo Castellini
One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their recurrent connections.