1 code implementation • NeurIPS 2023 • Revan MacQueen, James R. Wright
Self-play is a technique for machine learning in multi-agent systems where a learning algorithm learns by interacting with copies of itself.
no code implementations • 29 Sep 2021 • Erfan Miahi, Revan MacQueen, Alex Ayoub, Abbas Masoumzadeh, Martha White
Soft-greedy operators, namely $\varepsilon$-greedy and softmax, remain a common choice to induce a basic level of exploration for action-value methods in reinforcement learning.