1 code implementation • 9 Nov 2022 • Jernej Hribar, Luke Hackett, Ivana Dusparic
In this paper, we build on advances introduced by the Deep Q-Networks (DQN) approach to extend the multi-objective tabular Reinforcement Learning (RL) algorithm W-learning to large state spaces.