no code implementations • 26 Jan 2024 • Jiachen Xi, Alfredo Garcia, Petar Momcilovic
Several successful reinforcement learning algorithms make use of regularization to promote multi-modal policies that exhibit enhanced exploration and robustness.
Q-Learning