no code implementations • 9 Jul 2024 • Fotios Lygerakis, Elmar Rueckert
Traditional Variational Autoencoders (VAEs) are constrained by the limitations of the Evidence Lower Bound (ELBO) formulation, particularly when utilizing simplistic, non-analytic, or unknown prior distributions.
1 code implementation • 30 Jan 2024 • Fotios Lygerakis, Vedant Dave, Elmar Rueckert
One of the most critical aspects of multimodal Reinforcement Learning (RL) is the effective integration of different observation modalities.
no code implementations • 23 Jan 2024 • Nikolaus Feith, Elmar Rueckert
Interactive Machine Learning (IML) seeks to integrate human expertise into machine learning processes.
1 code implementation • 22 Jan 2024 • Vedant Dave, Fotios Lygerakis, Elmar Rueckert
In evaluating our methodology, we focus on two distinct tasks: material classification and grasping success prediction.
1 code implementation • 6 Sep 2023 • Fotios Lygerakis, Elmar Rueckert
The Variational Autoencoder (VAE) is known to suffer from the phenomenon of \textit{posterior collapse}, where the latent representations generated by the model become independent of the inputs.
1 code implementation • 23 Feb 2022 • Honghu Xue, Benedikt Hein, Mohamed Bakr, Georg Schildbach, Bengt Abel, Elmar Rueckert
We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios.
no code implementations • 5 Jul 2021 • Honghu Xue, Rebecca Herzog, Till M Berger, Tobias Bäumer, Anne Weissbach, Elmar Rueckert
The benefit of ProMPs is that the features are directly learned from the data and ProMPs can capture important features describing the trajectory shape, which can easily be extended to other tasks.
no code implementations • 17 May 2021 • Daniel Tanneberg, Elmar Rueckert, Jan Peters
A key feature of intelligent behaviour is the ability to learn abstract strategies that scale and transfer to unfamiliar problems.
no code implementations • 26 Mar 2021 • Daniel Tanneberg, Kai Ploeger, Elmar Rueckert, Jan Peters
Integrating robots in complex everyday environments requires a multitude of problems to be solved.
no code implementations • ICLR 2020 • Nils Rottmann, Tjasa Kunavar, Jan Babic, Jan Peters, Elmar Rueckert
In order to reach similar performance, we developed a hierarchical Bayesian optimization algorithm that replicates the cognitive inference and memorization process for avoiding failures in motor control tasks.
no code implementations • 30 Oct 2019 • Daniel Tanneberg, Elmar Rueckert, Jan Peters
A key feature of intelligent behavior is the ability to learn abstract strategies that transfer to unfamiliar problems.
no code implementations • 25 Sep 2019 • Daniel Tanneberg, Elmar Rueckert, Jan Peters
A key feature of intelligent behavior is the ability to learn abstract strategies that transfer to unfamiliar problems.
no code implementations • 11 Aug 2019 • Svenja Stark, Jan Peters, Elmar Rueckert
Accordingly, for learning a new task, time could be saved by restricting the parameter search space by initializing it with the solution of a similar task.
no code implementations • 28 Apr 2019 • Zinan Liu, Kai Ploeger, Svenja Stark, Elmar Rueckert, Jan Peters
In quadruped gait learning, policy search methods that scale high dimensional continuous action spaces are commonly used.
no code implementations • 1 Mar 2018 • Adrian Šošić, Elmar Rueckert, Jan Peters, Abdelhak M. Zoubir, Heinz Koeppl
Advances in the field of inverse reinforcement learning (IRL) have led to sophisticated inference frameworks that relax the original modeling assumption of observing an agent behavior that reflects only a single intention.
no code implementations • 22 Feb 2018 • Daniel Tanneberg, Jan Peters, Elmar Rueckert
By using learning signals which mimic the intrinsic motivation signalcognitive dissonance in addition with a mental replay strategy to intensify experiences, the stochastic recurrent network can learn from few physical interactions and adapts to novel environments in seconds.