no code implementations • 1 Jan 2021 • Daniel Wontae Nam, Younghoon Kim, Chan Youn Park
Recent distributional reinforcement learning methods, despite their successes, still contain fundamental problems that can lead to inaccurate representations of value distributions, such as distributional instability, action type restriction, and biased approximation.