no code implementations • NeurIPS 2019 • Bastian Alt, Adrian Šošić, Heinz Koeppl
Many decision-making problems naturally exhibit pronounced structures inherited from the characteristics of the underlying environment.
1 code implementation • 17 Jul 2018 • Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann
However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties inherent to these systems: (i) the agents in the swarm are interchangeable and (ii) the exact number of agents in the swarm is irrelevant.
no code implementations • 1 Mar 2018 • Adrian Šošić, Elmar Rueckert, Jan Peters, Abdelhak M. Zoubir, Heinz Koeppl
Advances in the field of inverse reinforcement learning (IRL) have led to sophisticated inference frameworks that relax the original modeling assumption of observing an agent behavior that reflects only a single intention.
no code implementations • 21 Sep 2017 • Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann
Swarm systems constitute a challenging problem for reinforcement learning (RL) as the algorithm needs to learn decentralized control policies that can cope with limited local sensing and communication abilities of the agents.
1 code implementation • 18 Sep 2017 • Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann
Here, we follow a guided approach where a critic has central access to the global state during learning, which simplifies the policy evaluation problem from a reinforcement learning point of view.
no code implementations • 4 May 2016 • Adrian Šošić, Abdelhak M. Zoubir, Heinz Koeppl
Learning from demonstration (LfD) is the process of building behavioral models of a task from demonstrations provided by an expert.
no code implementations • 17 Feb 2016 • Adrian Šošić, Wasiur R. KhudaBukhsh, Abdelhak M. Zoubir, Heinz Koeppl
Inverse reinforcement learning (IRL) has become a useful tool for learning behavioral models from demonstration data.