Search Results for author: Esra'a Saleh

Found 2 papers, 0 papers with code

Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration

no code implementations4 Jun 2022 Dustin Morrill, Esra'a Saleh, Michael Bowling, Amy Greenwald

Neural replicator dynamics (NeuRD) is an alternative to the foundational softmax policy gradient (SPG) algorithm motivated by online learning and evolutionary game theory.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.