no code implementations • 24 Jun 2020 • Hassam Ullah Sheikh, Ladislau Bölöni
Recently, the Maxmin and Ensemble Q-learning algorithms have used different estimates provided by the ensembles of learners to reduce the overestimation bias.
no code implementations • 24 Mar 2020 • Hassam Ullah Sheikh, Ladislau Bölöni
This is a challenging task for current state-of-the-art multi-agent reinforcement algorithms that are designed to either maximize the global reward of the team or the individual local rewards.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 24 Aug 2019 • Hassam Ullah Sheikh, Ladislau Bölöni
We explore a collaborative and cooperative multi-agent reinforcement learning setting where a team of reinforcement learning agents attempt to solve a single cooperative task in a multi-scenario setting.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 28 Jan 2019 • Hassam Ullah Sheikh, Ladislau Bölöni
We are considering a scenario where a team of bodyguard robots provides physical protection to a VIP in a crowded public space.