no code implementations • 13 May 2022 • Hanna Krasowski, Jakob Thumm, Marlon Müller, Lukas Schäfer, Xiao Wang, Matthias Althoff
We categorize the methods based on how they adapt the action: action replacement, action projection, and action masking.
Benchmarking reinforcement-learning +2