no code implementations • 6 Jan 2024 • Kewen Ding, Peter Vamplew, Cameron Foale, Richard Dazeley
One common approach to solve multi-objective reinforcement learning (MORL) problems is to extend conventional Q-learning by using vector Q-values in combination with a utility function.
no code implementations • 16 Nov 2022 • Kewen Ding
A variant of MORL Q-Learning incorporating global statistics is shown to outperform the baseline method in original Space Traders problem, but remains below 100 percent effectiveness in finding the find desired SER-optimal policy at the end of training.