Search Results for author: Gerhard Neumann

Found 67 papers, 25 papers with code

Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation

no code implementations7 Mar 2024 Fabian Otto, Philipp Becker, Vien Ang Ngo, Gerhard Neumann

Existing off-policy reinforcement learning algorithms typically necessitate an explicit state-action-value function representation, which becomes problematic in high-dimensional action spaces.

Efficient Exploration

Physics-informed MeshGraphNets (PI-MGNs): Neural finite element solvers for non-stationary and nonlinear simulations on arbitrary meshes

no code implementations16 Feb 2024 Tobias Würth, Niklas Freymuth, Clemens Zimmerling, Gerhard Neumann, Luise Kärger

This work introduces PI-MGNs, a hybrid approach that combines PINNs and MGNs to quickly and accurately solve non-stationary and nonlinear partial differential equations (PDEs) on arbitrary meshes.

Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

1 code implementation21 Jan 2024 Ge Li, Hongyi Zhou, Dominik Roth, Serge Thilges, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

Current advancements in reinforcement learning (RL) have predominantly focused on learning step-based policies that generate actions for each perceived state.

Reinforcement Learning (RL)

Neural Contractive Dynamical Systems

no code implementations17 Jan 2024 Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Nadia Figueroa, Gerhard Neumann, Leonel Rozo

Stability guarantees are crucial when ensuring a fully autonomous robot does not take undesirable or potentially harmful actions.

Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming

no code implementations21 Dec 2023 Benjamin Alt, Urs Keßner, Aleksandar Taranovic, Darko Katic, Andreas Hermann, Rainer Jäkel, Gerhard Neumann

Industrial robots are applied in a widening range of industries, but robot programming mostly remains a task limited to programming experts.

Industrial Robots

Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects

no code implementations15 Dec 2023 Paul Maria Scheikl, Nicolas Schreiber, Christoph Haas, Niklas Freymuth, Gerhard Neumann, Rudolf Lioutikov, Franziska Mathis-Ullrich

Policy learning in robot-assisted surgery (RAS) lacks data efficient and versatile methods that exhibit the desired motion quality for delicate surgical interventions.

Imitation Learning

Registered and Segmented Deformable Object Reconstruction from a Single View Point Cloud

no code implementations13 Nov 2023 Pit Henrich, Balázs Gyenes, Paul Maria Scheikl, Gerhard Neumann, Franziska Mathis-Ullrich

In deformable object manipulation, we often want to interact with specific segments of an object that are only defined in non-deformed models of the object.

Deformable Object Manipulation Object +1

Latent Task-Specific Graph Network Simulators

1 code implementation9 Nov 2023 Philipp Dahlinger, Niklas Freymuth, Michael Volpp, Tai Hoang, Gerhard Neumann

Movement primitives further allow us to accommodate various types of context data, as demonstrated through the utilization of point clouds during inference.

Meta-Learning Trajectory Prediction

Information-Theoretic Trust Regions for Stochastic Gradient-Based Optimization

1 code implementation31 Oct 2023 Philipp Dahlinger, Philipp Becker, Maximilian Hüttenrauch, Gerhard Neumann

Before each update, it solves the trust region problem for an optimal step size, resulting in a more stable and faster optimization process.

Multi Time Scale World Models

1 code implementation NeurIPS 2023 Vaisakh Shaj, Saleh Gholam Zadeh, Ozan Demir, Luiz Ricardo Douat, Gerhard Neumann

Intelligent agents use internal world models to reason and make predictions about different courses of their actions at many scales.

SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects

no code implementations31 Aug 2023 Ning Gao, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects.

6D Pose Estimation Object

Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization

no code implementations22 Aug 2023 Ning Gao, Bernard Hohmann, Gerhard Neumann

In our work, we initialize the slot representations with clustering algorithms conditioned on the perceptual input features.

Clustering Novel View Synthesis +2

DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes

no code implementations1 Aug 2023 Philipp Blättner, Johannes Brand, Gerhard Neumann, Ngo Anh Vien

The results demonstrate the effectiveness of the proposed approach in predicting versatile and dense grasps, and in advancing the field of multi-fingered robotic grasping.

Computational Efficiency Grasp Generation +1

MP3: Movement Primitive-Based (Re-)Planning Policy

no code implementations22 Jun 2023 Fabian Otto, Hongyi Zhou, Onur Celik, Ge Li, Rudolf Lioutikov, Gerhard Neumann

We introduce a novel deep reinforcement learning (RL) approach called Movement Primitive-based Planning Policy (MP3).

Reinforcement Learning (RL)

Swarm Reinforcement Learning For Adaptive Mesh Refinement

1 code implementation NeurIPS 2023 Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Simon Reisch, Luise Kärger, Gerhard Neumann

Adaptive Mesh Refinement (AMR) enhances the Finite Element Method, an important technique for simulating complex problems in engineering, by dynamically refining mesh regions, enabling a favorable trade-off between computational speed and simulation accuracy.

reinforcement-learning

Grounding Graph Network Simulators using Physical Sensor Observations

1 code implementation23 Feb 2023 Jonas Linkerhägner, Niklas Freymuth, Paul Maria Scheikl, Franziska Mathis-Ullrich, Gerhard Neumann

Our method results in utilization of additional point cloud information to accurately predict stable simulations where existing Graph Network Simulators fail.

Imputation Motion Planning +1

Inferring Versatile Behavior from Demonstrations by Matching Geometric Descriptors

1 code implementation17 Oct 2022 Niklas Freymuth, Nicolas Schreiber, Philipp Becker, Aleksandar Taranovic, Gerhard Neumann

We find that the geometric descriptors greatly help in generalizing to new task configurations and that combining them with our distribution-matching objective is crucial for representing and reproducing versatile behavior.

Imitation Learning

On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning

1 code implementation17 Oct 2022 Philipp Becker, Gerhard Neumann

We show that RSSMs use a suboptimal inference scheme and that models trained using this inference overestimate the aleatoric uncertainty of the ground truth system.

Model-based Reinforcement Learning reinforcement-learning +2

ProDMPs: A Unified Perspective on Dynamic and Probabilistic Movement Primitives

no code implementations4 Oct 2022 Ge Li, Zeqi Jin, Michael Volpp, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

MPs can be broadly categorized into two types: (a) dynamics-based approaches that generate smooth trajectories from any initial state, e. g., Dynamic Movement Primitives (DMPs), and (b) probabilistic approaches that capture higher-order statistics of the motion, e. g., Probabilistic Movement Primitives (ProMPs).

Numerical Integration

A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

1 code implementation23 Sep 2022 Oleg Arenz, Philipp Dahlinger, Zihan Ye, Michael Volpp, Gerhard Neumann

The two currently most effective methods for GMM-based variational inference, VIPS and iBayes-GMM, both employ independent natural gradient updates for the individual components and their weights.

Variational Inference

MV6D: Multi-View 6D Pose Estimation on RGB-D Frames Using a Deep Point-wise Voting Network

no code implementations1 Aug 2022 Fabian Duffhauss, Tobias Demmler, Gerhard Neumann

We overcome this issue with our novel multi-view 6D pose estimation method called MV6D which accurately predicts the 6D poses of all objects in a cluttered scene based on RGB-D images from multiple perspectives.

6D Pose Estimation Semantic Segmentation

End-to-End Learning of Hybrid Inverse Dynamics Models for Precise and Compliant Impedance Control

no code implementations27 May 2022 Moritz Reuss, Niels van Duijkeren, Robert Krug, Philipp Becker, Vaisakh Shaj, Gerhard Neumann

These models need to precisely capture the robot dynamics, which consist of well-understood components, e. g., rigid body dynamics, and effects that remain challenging to capture, e. g., stick-slip friction and mechanical flexibilities.

Friction

Regret-Aware Black-Box Optimization with Natural Gradients, Trust-Regions and Entropy Control

no code implementations24 May 2022 Maximilian Hüttenrauch, Gerhard Neumann

In contrast, stochastic optimizers that are motivated by policy gradients, such as the Model-based Relative Entropy Stochastic Search (MORE) algorithm, directly optimize the expected fitness function without the use of rankings.

Scheduling

Meta-Learning Regrasping Strategies for Physical-Agnostic Objects

no code implementations23 May 2022 Ning Gao, Jingyu Zhang, Ruijie Chen, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

Grasping inhomogeneous objects in real-world applications remains a challenging task due to the unknown physical properties such as mass distribution and coefficient of friction.

Friction Meta-Learning

Reactive Motion Generation on Learned Riemannian Manifolds

no code implementations15 Mar 2022 Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

We argue that Riemannian manifolds may be learned via human demonstrations in which geodesics are natural motion skills.

What Matters For Meta-Learning Vision Regression Tasks?

2 code implementations CVPR 2022 Ning Gao, Hanna Ziesche, Ngo Anh Vien, Michael Volpp, Gerhard Neumann

To this end, we (i) exhaustively evaluate common meta-learning techniques on these tasks, and (ii) quantitatively analyze the effect of various deep learning techniques commonly used in recent meta-learning algorithms in order to strengthen the generalization capability: data augmentation, domain randomization, task augmentation and meta-regularization.

Contrastive Learning Data Augmentation +4

Specializing Versatile Skill Libraries using Local Mixture of Experts

1 code implementation8 Dec 2021 Onur Celik, Dongzhuoran Zhou, Ge Li, Philipp Becker, Gerhard Neumann

This local and incremental learning results in a modular MoE model of high accuracy and versatility, where both properties can be scaled by adding more components on the fly.

Incremental Learning Reinforcement Learning (RL)

Switching Recurrent Kalman Networks

no code implementations16 Nov 2021 Giao Nguyen-Quynh, Philipp Becker, Chen Qiu, Maja Rudolph, Gerhard Neumann

In addition, driving data can often be multimodal in distribution, meaning that there are distinct predictions that are likely, but averaging can hurt model performance.

Autonomous Driving Time Series +1

Versatile Inverse Reinforcement Learning via Cumulative Rewards

no code implementations15 Nov 2021 Niklas Freymuth, Philipp Becker, Gerhard Neumann

Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert.

reinforcement-learning Reinforcement Learning (RL)

A First-Order Method for Estimating Natural Gradients for Variational Inference with Gaussians and Gaussian Mixture Models

no code implementations29 Sep 2021 Oleg Arenz, Zihan Ye, Philipp Dahlinger, Gerhard Neumann

Effective approaches for Gaussian variational inference are MORE, VOGN, and VON, which are zero-order, first-order, and second-order, respectively.

Variational Inference

A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning

no code implementations6 Aug 2021 Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

We argue that it is crucial to automate the reward learning process so that new skills can be taught to robots by their users.

reinforcement-learning Reinforcement Learning (RL)

Differentiable Robust LQR Layers

no code implementations10 Jun 2021 Ngo Anh Vien, Gerhard Neumann

This paper proposes a differentiable robust LQR layer for reinforcement learning and imitation learning under model uncertainty and stochastic dynamics.

Imitation Learning Inductive Bias

Residual Feedback Learning for Contact-Rich Manipulation Tasks with Uncertainty

no code implementations8 Jun 2021 Alireza Ranjbar, Ngo Anh Vien, Hanna Ziesche, Joschka Boedecker, Gerhard Neumann

We propose a new formulation that addresses these limitations by also modifying the feedback signals to the controller with an RL policy and show superior performance of our approach on a contact-rich peg-insertion task under position and orientation uncertainty.

Position Reinforcement Learning (RL)

Learning Riemannian Manifolds for Geodesic Motion Skills

no code implementations8 Jun 2021 Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

For robots to work alongside humans and perform in unstructured environments, they must learn new motion skills and adapt them to unseen situations on the fly.

Bayesian Context Aggregation for Neural Processes

no code implementations ICLR 2021 Michael Volpp, Fabian Flürenbrock, Lukas Grossberger, Christian Daniel, Gerhard Neumann

Recently, casting probabilistic regression as a multi-task learning problem in terms of conditional latent variable (CLV) models such as the Neural Process (NP) has shown promising results.

Bayesian Inference Multi-Task Learning +1

Action-Conditional Recurrent Kalman Networks For Forward and Inverse Dynamics Learning

2 code implementations20 Oct 2020 Vaisakh Shaj, Philipp Becker, Dieter Buchler, Harit Pandya, Niels van Duijkeren, C. James Taylor, Marc Hanheide, Gerhard Neumann

We adopt a recent probabilistic recurrent neural network architecture, called Re-current Kalman Networks (RKNs), to model learning by conditioning its transition dynamics on the control actions.

Friction

Imitation Learning for Autonomous Trajectory Learning of Robot Arms in Space

no code implementations10 Aug 2020 RB Ashith Shyam, Zhou Hao, Umberto Montanaro, Gerhard Neumann

Since actual hardware implementation of microgravity environment is extremely expensive, the demonstration data for trajectory learning is generated using a model predictive controller (MPC) in a physics based simulator.

Imitation Learning Trajectory Planning

Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

1 code implementation8 Aug 2020 Oleg Arenz, Gerhard Neumann

We also show that our non-adversarial formulation can be used to derive novel algorithms by presenting a method for offline imitation learning that is inspired by the recent ValueDice algorithm, but does not rely on small policy updates for convergence.

Imitation Learning

Expected Information Maximization: Using the I-Projection for Mixture Density Estimation

1 code implementation ICLR 2020 Philipp Becker, Oleg Arenz, Gerhard Neumann

Such behavior is appealing whenever we deal with highly multi-modal data where modelling single modes correctly is more important than covering all the modes.

Density Estimation Traffic Prediction

Trust-Region Variational Inference with Gaussian Mixture Models

no code implementations10 Jul 2019 Oleg Arenz, Mingjun Zhong, Gerhard Neumann

For efficient improvement of the GMM approximation, we derive a lower bound on the corresponding optimization objective enabling us to update the components independently.

Variational Inference

Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces

3 code implementations17 May 2019 Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, Gerhard Neumann

In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors.

Image Imputation Imputation +4

An Algorithmic Perspective on Imitation Learning

no code implementations16 Nov 2018 Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters

This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning.

Imitation Learning Learning Theory

Adaptation and Robust Learning of Probabilistic Movement Primitives

1 code implementation31 Aug 2018 Sebastian Gomez-Gonzalez, Gerhard Neumann, Bernhard Schölkopf, Jan Peters

However, to be able to capture variability and correlations between different joints, a probabilistic movement primitive requires the estimation of a larger number of parameters compared to their deterministic counterparts, that focus on modeling only the mean behavior.

Towards Fine Grained Network Flow Prediction

no code implementations20 Aug 2018 Patrick Jahnke, Emmanuel Stapf, Jonas Mieseler, Gerhard Neumann, Patrick Eugster

In this space, into which we transform the input data via a Short-Time Fourier Transform (STFT), the peak structures of flows can be predicted after gleaning their key characteristics, with a Principal Component Analysis (PCA), from past and ongoing flows that stem from the same socket-to-socket connection.

Traffic Prediction

Deep Reinforcement Learning for Swarm Systems

1 code implementation17 Jul 2018 Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties inherent to these systems: (i) the agents in the swarm are interchangeable and (ii) the exact number of agents in the swarm is irrelevant.

Decision Making reinforcement-learning +1

Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning

no code implementations21 Sep 2017 Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

Swarm systems constitute a challenging problem for reinforcement learning (RL) as the algorithm needs to learn decentralized control policies that can cope with limited local sensing and communication abilities of the agents.

reinforcement-learning Reinforcement Learning (RL)

Guided Deep Reinforcement Learning for Swarm Systems

1 code implementation18 Sep 2017 Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

Here, we follow a guided approach where a critic has central access to the global state during learning, which simplifies the policy evaluation problem from a reinforcement learning point of view.

reinforcement-learning Reinforcement Learning (RL)

Local Bayesian Optimization of Motor Skills

no code implementations ICML 2017 Riad Akrour, Dmitry Sorokin, Jan Peters, Gerhard Neumann

Bayesian optimization is renowned for its sample efficiency but its application to higher dimensional tasks is impeded by its focus on global optimization.

Bayesian Optimization Imitation Learning

Catching heuristics are optimal control policies

no code implementations NeurIPS 2016 Boris Belousov, Gerhard Neumann, Constantin A. Rothkopf, Jan R. Peters

In this paper, we show that interception strategies appearing to be heuristics can be understood as computational solutions to the optimal control problem faced by a ball-catching agent acting under uncertainty.

Policy Search with High-Dimensional Context Variables

no code implementations10 Nov 2016 Voot Tangkaratt, Herke van Hoof, Simone Parisi, Gerhard Neumann, Jan Peters, Masashi Sugiyama

A naive application of unsupervised dimensionality reduction methods to the context variables, such as principal component analysis, is insufficient as task-relevant input may be ignored.

Dimensionality Reduction Vocal Bursts Intensity Prediction

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

no code implementations29 Jun 2016 Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann

In order to show the monotonic improvement of our algorithm, we additionally conduct a theoretical analysis of our policy update scheme to derive a lower bound of the change in policy return between successive iterations.

Probabilistic Movement Primitives

no code implementations NeurIPS 2013 Alexandros Paraschos, Christian Daniel, Jan R. Peters, Gerhard Neumann

In order to use such a trajectory distribution for robot movement control, we analytically derive a stochastic feedback controller which reproduces the given trajectory distribution.

Fitted Q-iteration by Advantage Weighted Regression

no code implementations NeurIPS 2008 Gerhard Neumann, Jan R. Peters

Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the resulting policy.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.