no code implementations • 11 Feb 2024 • Willem Röpke, Mathieu Reymond, Patrick Mannion, Diederik M. Roijers, Ann Nowé, Roxana Rădulescu
A significant challenge in multi-objective reinforcement learning is obtaining a Pareto front of policies that attain optimal performance under different preferences.
Multi-Objective Reinforcement Learning reinforcement-learning
no code implementations • 23 Nov 2022 • Conor F. Hayes, Mathieu Reymond, Diederik M. Roijers, Enda Howley, Patrick Mannion
Both algorithms outperform the state-of-the-art in multi-objective reinforcement learning for the expected utility of the returns.
Multi-Objective Reinforcement Learning reinforcement-learning +1
1 code implementation • 11 Apr 2022 • Mathieu Reymond, Eugenio Bargiacchi, Ann Nowé
In multi-objective optimization, learning all the policies that reach Pareto-efficient solutions is an expensive process.
no code implementations • 11 Apr 2022 • Mathieu Reymond, Conor F. Hayes, Lander Willem, Roxana Rădulescu, Steven Abrams, Diederik M. Roijers, Enda Howley, Patrick Mannion, Niel Hens, Ann Nowé, Pieter Libin
As decision making in the context of epidemic mitigation is hard, reinforcement learning provides a methodology to automatically learn prevention strategies in combination with complex epidemic models.
no code implementations • 23 Dec 2021 • Raphaël Avalos, Mathieu Reymond, Ann Nowé, Diederik M. Roijers
Many recent successful off-policy multi-agent reinforcement learning (MARL) algorithms for cooperative partially observable environments focus on finding factorized value functions, leading to convoluted network structures.
1 code implementation • 17 Mar 2021 • Conor F. Hayes, Roxana Rădulescu, Eugenio Bargiacchi, Johan Källström, Matthew Macfarlane, Mathieu Reymond, Timothy Verstraeten, Luisa M. Zintgraf, Richard Dazeley, Fredrik Heintz, Enda Howley, Athirai A. Irissappane, Patrick Mannion, Ann Nowé, Gabriel Ramos, Marcello Restelli, Peter Vamplew, Diederik M. Roijers
Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives.
no code implementations • 1 Feb 2021 • Conor F. Hayes, Mathieu Reymond, Diederik M. Roijers, Enda Howley, Patrick Mannion
In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from the single execution of a policy.