no code implementations • 7 Mar 2020 • Konatsu Miyamoto, Masaya Suzuki, Yuma Kigami, Kodai Satake
In this paper, as a study of reinforcement learning, we converge the Q function to unbounded rewards such as Gaussian distribution.
reinforcement-learning Reinforcement Learning (RL)