Search Results for author: Gerhard Neumann

Found 67 papers, 25 papers with code

Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts

no code implementations • 11 Mar 2024 • Onur Celik, Aleksandar Taranovic, Gerhard Neumann

Reinforcement learning (RL) is a powerful approach for acquiring a good-performing policy.

Paper
Add Code

Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation

no code implementations • 7 Mar 2024 • Fabian Otto, Philipp Becker, Vien Ang Ngo, Gerhard Neumann

Existing off-policy reinforcement learning algorithms typically necessitate an explicit state-action-value function representation, which becomes problematic in high-dimensional action spaces.

Efficient Exploration

Paper
Add Code

Physics-informed MeshGraphNets (PI-MGNs): Neural finite element solvers for non-stationary and nonlinear simulations on arbitrary meshes

no code implementations • 16 Feb 2024 • Tobias Würth, Niklas Freymuth, Clemens Zimmerling, Gerhard Neumann, Luise Kärger

This work introduces PI-MGNs, a hybrid approach that combines PINNs and MGNs to quickly and accurately solve non-stationary and nonlinear partial differential equations (PDEs) on arbitrary meshes.

Paper
Add Code

Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

1 code implementation • 21 Jan 2024 • Ge Li, Hongyi Zhou, Dominik Roth, Serge Thilges, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

Current advancements in reinforcement learning (RL) have predominantly focused on learning step-based policies that generate actions for each perceived state.

Reinforcement Learning (RL)

Paper
Code

Neural Contractive Dynamical Systems

no code implementations • 17 Jan 2024 • Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Nadia Figueroa, Gerhard Neumann, Leonel Rozo

Stability guarantees are crucial when ensuring a fully autonomous robot does not take undesirable or potentially harmful actions.

Paper
Add Code

Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming

no code implementations • 21 Dec 2023 • Benjamin Alt, Urs Keßner, Aleksandar Taranovic, Darko Katic, Andreas Hermann, Rainer Jäkel, Gerhard Neumann

Industrial robots are applied in a widening range of industries, but robot programming mostly remains a task limited to programming experts.

Industrial Robots

Paper
Add Code

Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects

no code implementations • 15 Dec 2023 • Paul Maria Scheikl, Nicolas Schreiber, Christoph Haas, Niklas Freymuth, Gerhard Neumann, Rudolf Lioutikov, Franziska Mathis-Ullrich

Policy learning in robot-assisted surgery (RAS) lacks data efficient and versatile methods that exhibit the desired motion quality for delicate surgical interventions.

Imitation Learning

Paper
Add Code

Registered and Segmented Deformable Object Reconstruction from a Single View Point Cloud

no code implementations • 13 Nov 2023 • Pit Henrich, Balázs Gyenes, Paul Maria Scheikl, Gerhard Neumann, Franziska Mathis-Ullrich

In deformable object manipulation, we often want to interact with specific segments of an object that are only defined in non-deformed models of the object.

Deformable Object Manipulation Object +1

Paper
Add Code

Latent Task-Specific Graph Network Simulators

1 code implementation • 9 Nov 2023 • Philipp Dahlinger, Niklas Freymuth, Michael Volpp, Tai Hoang, Gerhard Neumann

Movement primitives further allow us to accommodate various types of context data, as demonstrated through the utilization of point clouds during inference.

Meta-Learning Trajectory Prediction

Paper
Code

Information-Theoretic Trust Regions for Stochastic Gradient-Based Optimization

1 code implementation • 31 Oct 2023 • Philipp Dahlinger, Philipp Becker, Maximilian Hüttenrauch, Gerhard Neumann

Before each update, it solves the trust region problem for an optimal step size, resulting in a more stable and faster optimization process.

Paper
Code

Multi Time Scale World Models

1 code implementation • NeurIPS 2023 • Vaisakh Shaj, Saleh Gholam Zadeh, Ozan Demir, Luiz Ricardo Douat, Gerhard Neumann

Intelligent agents use internal world models to reason and make predictions about different courses of their actions at many scales.

Paper
Code

SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects

no code implementations • 31 Aug 2023 • Ning Gao, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects.

6D Pose Estimation Object

Paper
Add Code

Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization

no code implementations • 22 Aug 2023 • Ning Gao, Bernard Hohmann, Gerhard Neumann

In our work, we initialize the slot representations with clustering algorithms conditioned on the perceptual input features.

Clustering Novel View Synthesis +2

Paper
Add Code

DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes

no code implementations • 1 Aug 2023 • Philipp Blättner, Johannes Brand, Gerhard Neumann, Ngo Anh Vien

The results demonstrate the effectiveness of the proposed approach in predicting versatile and dense grasps, and in advancing the field of multi-fingered robotic grasping.

Computational Efficiency Grasp Generation +1

Paper
Add Code

SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation

1 code implementation • 1 Jul 2023 • Fabian Duffhauss, Sebastian Koch, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann

Detecting objects and estimating their 6D poses is essential for automated systems to interact safely with the environment.

6D Pose Estimation 6D Pose Estimation using RGB +3

Paper
Code

MP3: Movement Primitive-Based (Re-)Planning Policy

no code implementations • 22 Jun 2023 • Fabian Otto, Hongyi Zhou, Onur Celik, Ge Li, Rudolf Lioutikov, Gerhard Neumann

We introduce a novel deep reinforcement learning (RL) approach called Movement Primitive-based Planning Policy (MP3).

Reinforcement Learning (RL)

Paper
Add Code

Curriculum-Based Imitation of Versatile Skills

1 code implementation • 11 Apr 2023 • Maximilian Xiling Li, Onur Celik, Philipp Becker, Denis Blessing, Rudolf Lioutikov, Gerhard Neumann

Learning skills by imitation is a promising concept for the intuitive teaching of robots.

Imitation Learning

Paper
Code

Swarm Reinforcement Learning For Adaptive Mesh Refinement

1 code implementation • NeurIPS 2023 • Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Simon Reisch, Luise Kärger, Gerhard Neumann

Adaptive Mesh Refinement (AMR) enhances the Finite Element Method, an important technique for simulating complex problems in engineering, by dynamically refining mesh regions, enabling a favorable trade-off between computational speed and simulation accuracy.

reinforcement-learning

Paper
Code

Information Maximizing Curriculum: A Curriculum-Based Approach for Imitating Diverse Skills

1 code implementation • 27 Mar 2023 • Denis Blessing, Onur Celik, Xiaogang Jia, Moritz Reuss, Maximilian Xiling Li, Rudolf Lioutikov, Gerhard Neumann

Imitation learning uses data for training policies to solve complex tasks.

Imitation Learning

Paper
Code

Grounding Graph Network Simulators using Physical Sensor Observations

1 code implementation • 23 Feb 2023 • Jonas Linkerhägner, Niklas Freymuth, Paul Maria Scheikl, Franziska Mathis-Ullrich, Gerhard Neumann

Our method results in utilization of additional point cloud information to accurately predict stable simulations where existing Graph Network Simulators fail.

Imputation Motion Planning +1

Paper
Code

Joint Representations for Reinforcement Learning with Multiple Sensors

no code implementations • 10 Feb 2023 • Philipp Becker, Sebastian Markgraf, Fabian Otto, Gerhard Neumann

Combining inputs from multiple sensor modalities effectively in reinforcement learning (RL) is an open problem.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Deep Black-Box Reinforcement Learning with Movement Primitives

1 code implementation • 18 Oct 2022 • Fabian Otto, Onur Celik, Hongyi Zhou, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann

In this paper, we present a new algorithm for deep ERL.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Inferring Versatile Behavior from Demonstrations by Matching Geometric Descriptors

1 code implementation • 17 Oct 2022 • Niklas Freymuth, Nicolas Schreiber, Philipp Becker, Aleksandar Taranovic, Gerhard Neumann

We find that the geometric descriptors greatly help in generalizing to new task configurations and that combining them with our distribution-matching objective is crucial for representing and reproducing versatile behavior.

Imitation Learning

Paper
Code

On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning

1 code implementation • 17 Oct 2022 • Philipp Becker, Gerhard Neumann

We show that RSSMs use a suboptimal inference scheme and that models trained using this inference overestimate the aleatoric uncertainty of the ground truth system.

Model-based Reinforcement Learning reinforcement-learning +2

Paper
Code

ProDMPs: A Unified Perspective on Dynamic and Probabilistic Movement Primitives

no code implementations • 4 Oct 2022 • Ge Li, Zeqi Jin, Michael Volpp, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

MPs can be broadly categorized into two types: (a) dynamics-based approaches that generate smooth trajectories from any initial state, e. g., Dynamic Movement Primitives (DMPs), and (b) probabilistic approaches that capture higher-order statistics of the motion, e. g., Probabilistic Movement Primitives (ProMPs).

Numerical Integration

Paper
Add Code

A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

1 code implementation • 23 Sep 2022 • Oleg Arenz, Philipp Dahlinger, Zihan Ye, Michael Volpp, Gerhard Neumann

The two currently most effective methods for GMM-based variational inference, VIPS and iBayes-GMM, both employ independent natural gradient updates for the individual components and their weights.

Variational Inference

Paper
Code

FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion

no code implementations • 22 Sep 2022 • Fabian Duffhauss, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

Sensor fusion can significantly improve the performance of many computer vision tasks.

Sensor Fusion

Paper
Add Code

MV6D: Multi-View 6D Pose Estimation on RGB-D Frames Using a Deep Point-wise Voting Network

no code implementations • 1 Aug 2022 • Fabian Duffhauss, Tobias Demmler, Gerhard Neumann

We overcome this issue with our novel multi-view 6D pose estimation method called MV6D which accurately predicts the 6D poses of all objects in a cluttered scene based on RGB-D images from multiple perspectives.

6D Pose Estimation Semantic Segmentation

Paper
Add Code

Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination

no code implementations • 31 Jul 2022 • Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

Learning robotic tasks in the real world is still highly challenging and effective practical solutions remain to be found.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios

1 code implementation • ICLR 2022 • Vaisakh Shaj, Dieter Buchler, Rohit Sonker, Philipp Becker, Gerhard Neumann

Recurrent State-space models (RSSMs) are highly expressive models for learning patterns in time series data and system identification.

Time Series Time Series Analysis +1

Paper
Code

Category-Agnostic 6D Pose Estimation with Conditional Neural Processes

no code implementations • 14 Jun 2022 • Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann

We present a novel meta-learning approach for 6D pose estimation on unknown objects.

6D Pose Estimation Meta-Learning +1

Paper
Add Code

End-to-End Learning of Hybrid Inverse Dynamics Models for Precise and Compliant Impedance Control

no code implementations • 27 May 2022 • Moritz Reuss, Niels van Duijkeren, Robert Krug, Philipp Becker, Vaisakh Shaj, Gerhard Neumann

These models need to precisely capture the robot dynamics, which consist of well-understood components, e. g., rigid body dynamics, and effects that remain challenging to capture, e. g., stick-slip friction and mechanical flexibilities.

Friction

Paper
Add Code

Regret-Aware Black-Box Optimization with Natural Gradients, Trust-Regions and Entropy Control

no code implementations • 24 May 2022 • Maximilian Hüttenrauch, Gerhard Neumann

In contrast, stochastic optimizers that are motivated by policy gradients, such as the Model-based Relative Entropy Stochastic Search (MORE) algorithm, directly optimize the expected fitness function without the use of rankings.

Scheduling

Paper
Add Code

Meta-Learning Regrasping Strategies for Physical-Agnostic Objects

no code implementations • 23 May 2022 • Ning Gao, Jingyu Zhang, Ruijie Chen, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

Grasping inhomogeneous objects in real-world applications remains a challenging task due to the unknown physical properties such as mass distribution and coefficient of friction.

Friction Meta-Learning

Paper
Add Code

Reactive Motion Generation on Learned Riemannian Manifolds

no code implementations • 15 Mar 2022 • Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

We argue that Riemannian manifolds may be learned via human demonstrations in which geodesics are natural motion skills.

Paper
Add Code

What Matters For Meta-Learning Vision Regression Tasks?

2 code implementations • CVPR 2022 • Ning Gao, Hanna Ziesche, Ngo Anh Vien, Michael Volpp, Gerhard Neumann

To this end, we (i) exhaustively evaluate common meta-learning techniques on these tasks, and (ii) quantitatively analyze the effect of various deep learning techniques commonly used in recent meta-learning algorithms in order to strengthen the generalization capability: data augmentation, domain randomization, task augmentation and meta-regularization.

Contrastive Learning Data Augmentation +4

Paper
Code

Specializing Versatile Skill Libraries using Local Mixture of Experts

1 code implementation • 8 Dec 2021 • Onur Celik, Dongzhuoran Zhou, Ge Li, Philipp Becker, Gerhard Neumann

This local and incremental learning results in a modular MoE model of high accuracy and versatility, where both properties can be scaled by adding more components on the fly.

Incremental Learning Reinforcement Learning (RL)

Paper
Code

Switching Recurrent Kalman Networks

no code implementations • 16 Nov 2021 • Giao Nguyen-Quynh, Philipp Becker, Chen Qiu, Maja Rudolph, Gerhard Neumann

In addition, driving data can often be multimodal in distribution, meaning that there are distinct predictions that are likely, but averaging can hurt model performance.

Autonomous Driving Time Series +1

Paper
Add Code

Versatile Inverse Reinforcement Learning via Cumulative Rewards

no code implementations • 15 Nov 2021 • Niklas Freymuth, Philipp Becker, Gerhard Neumann

Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A First-Order Method for Estimating Natural Gradients for Variational Inference with Gaussians and Gaussian Mixture Models

no code implementations • 29 Sep 2021 • Oleg Arenz, Zihan Ye, Philipp Dahlinger, Gerhard Neumann

Effective approaches for Gaussian variational inference are MORE, VOGN, and VON, which are zero-order, first-order, and second-order, respectively.

Variational Inference

Paper
Add Code

A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning

no code implementations • 6 Aug 2021 • Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

We argue that it is crucial to automate the reward learning process so that new skills can be taught to robots by their users.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Differentiable Robust LQR Layers

no code implementations • 10 Jun 2021 • Ngo Anh Vien, Gerhard Neumann

This paper proposes a differentiable robust LQR layer for reinforcement learning and imitation learning under model uncertainty and stochastic dynamics.

Imitation Learning Inductive Bias

Paper
Add Code

Residual Feedback Learning for Contact-Rich Manipulation Tasks with Uncertainty

no code implementations • 8 Jun 2021 • Alireza Ranjbar, Ngo Anh Vien, Hanna Ziesche, Joschka Boedecker, Gerhard Neumann

We propose a new formulation that addresses these limitations by also modifying the feedback signals to the controller with an RL policy and show superior performance of our approach on a contact-rich peg-insertion task under position and orientation uncertainty.

Position Reinforcement Learning (RL)

Paper
Add Code

Learning Riemannian Manifolds for Geodesic Motion Skills

no code implementations • 8 Jun 2021 • Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

For robots to work alongside humans and perform in unstructured environments, they must learn new motion skills and adapt them to unseen situations on the fly.

Paper
Add Code

Differentiable Trust Region Layers for Deep Reinforcement Learning

1 code implementation • ICLR 2021 • Fabian Otto, Philipp Becker, Ngo Anh Vien, Hanna Carolin Ziesche, Gerhard Neumann

However, enforcing such trust regions in deep reinforcement learning is difficult.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Bayesian Context Aggregation for Neural Processes

no code implementations • ICLR 2021 • Michael Volpp, Fabian Flürenbrock, Lukas Grossberger, Christian Daniel, Gerhard Neumann

Recently, casting probabilistic regression as a multi-task learning problem in terms of conditional latent variable (CLV) models such as the Neural Process (NP) has shown promising results.

Bayesian Inference Multi-Task Learning +1

Paper
Add Code

Action-Conditional Recurrent Kalman Networks For Forward and Inverse Dynamics Learning

2 code implementations • 20 Oct 2020 • Vaisakh Shaj, Philipp Becker, Dieter Buchler, Harit Pandya, Niels van Duijkeren, C. James Taylor, Marc Hanheide, Gerhard Neumann

We adopt a recent probabilistic recurrent neural network architecture, called Re-current Kalman Networks (RKNs), to model learning by conditioning its transition dynamics on the control actions.

Friction

Paper
Code

Imitation Learning for Autonomous Trajectory Learning of Robot Arms in Space

no code implementations • 10 Aug 2020 • RB Ashith Shyam, Zhou Hao, Umberto Montanaro, Gerhard Neumann

Since actual hardware implementation of microgravity environment is extremely expensive, the demonstration data for trajectory learning is generated using a model predictive controller (MPC) in a physics based simulator.

Imitation Learning Trajectory Planning

Paper
Add Code

Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

1 code implementation • 8 Aug 2020 • Oleg Arenz, Gerhard Neumann

We also show that our non-adversarial formulation can be used to derive novel algorithms by presenting a method for offline imitation learning that is inspired by the recent ValueDice algorithm, but does not rely on small policy updates for convergence.

Imitation Learning

Paper
Code

Expected Information Maximization: Using the I-Projection for Mixture Density Estimation

1 code implementation • ICLR 2020 • Philipp Becker, Oleg Arenz, Gerhard Neumann

Such behavior is appealing whenever we deal with highly multi-modal data where modelling single modes correctly is more important than covering all the modes.

Density Estimation Traffic Prediction

Paper
Code

Trust-Region Variational Inference with Gaussian Mixture Models

no code implementations • 10 Jul 2019 • Oleg Arenz, Mingjun Zhong, Gerhard Neumann

For efficient improvement of the GMM approximation, we derive a lower bound on the corresponding optimization objective enabling us to update the components independently.

Variational Inference

Paper
Add Code

Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces

3 code implementations • 17 May 2019 • Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, Gerhard Neumann

In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors.

Image Imputation Imputation +4

Paper
Code

Compatible Natural Gradient Policy Search

no code implementations • 7 Feb 2019 • Joni Pajarinen, Hong Linh Thai, Riad Akrour, Jan Peters, Gerhard Neumann

Trust-region methods have yielded state-of-the-art results in policy search.

Continuous Control

Paper
Add Code

An Algorithmic Perspective on Imitation Learning

no code implementations • 16 Nov 2018 • Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters

This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning.

Imitation Learning Learning Theory

Paper
Add Code

Adaptation and Robust Learning of Probabilistic Movement Primitives

1 code implementation • 31 Aug 2018 • Sebastian Gomez-Gonzalez, Gerhard Neumann, Bernhard Schölkopf, Jan Peters

However, to be able to capture variability and correlations between different joints, a probabilistic movement primitive requires the estimation of a larger number of parameters compared to their deterministic counterparts, that focus on modeling only the mean behavior.

Paper
Code

Towards Fine Grained Network Flow Prediction

no code implementations • 20 Aug 2018 • Patrick Jahnke, Emmanuel Stapf, Jonas Mieseler, Gerhard Neumann, Patrick Eugster

In this space, into which we transform the input data via a Short-Time Fourier Transform (STFT), the peak structures of flows can be predicted after gleaning their key characteristics, with a Principal Component Analysis (PCA), from past and ongoing flows that stem from the same socket-to-socket connection.

Traffic Prediction

Paper
Add Code

Deep Reinforcement Learning for Swarm Systems

1 code implementation • 17 Jul 2018 • Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties inherent to these systems: (i) the agents in the swarm are interchangeable and (ii) the exact number of agents in the swarm is irrelevant.

Decision Making reinforcement-learning +1

Paper
Code

Efficient Gradient-Free Variational Inference using Policy Search

1 code implementation • ICML 2018 • Oleg Arenz, Gerhard Neumann, Mingjun Zhong

Inference from complex distributions is a common problem in machine learning needed for many Bayesian methods.

Efficient Exploration Variational Inference

Paper
Code

Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning

no code implementations • 21 Sep 2017 • Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

Swarm systems constitute a challenging problem for reinforcement learning (RL) as the algorithm needs to learn decentralized control policies that can cope with limited local sensing and communication abilities of the agents.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Guided Deep Reinforcement Learning for Swarm Systems

1 code implementation • 18 Sep 2017 • Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

Here, we follow a guided approach where a critic has central access to the global state during learning, which simplifies the policy evaluation problem from a reinforcement learning point of view.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Local Bayesian Optimization of Motor Skills

no code implementations • ICML 2017 • Riad Akrour, Dmitry Sorokin, Jan Peters, Gerhard Neumann

Bayesian optimization is renowned for its sample efficiency but its application to higher dimensional tasks is impeded by its focus on global optimization.

Bayesian Optimization Imitation Learning

Paper
Add Code

Catching heuristics are optimal control policies

no code implementations • NeurIPS 2016 • Boris Belousov, Gerhard Neumann, Constantin A. Rothkopf, Jan R. Peters

In this paper, we show that interception strategies appearing to be heuristics can be understood as computational solutions to the optimal control problem faced by a ball-catching agent acting under uncertainty.

Paper
Add Code

Policy Search with High-Dimensional Context Variables

no code implementations • 10 Nov 2016 • Voot Tangkaratt, Herke van Hoof, Simone Parisi, Gerhard Neumann, Jan Peters, Masashi Sugiyama

A naive application of unsupervised dimensionality reduction methods to the context variables, such as principal component analysis, is insufficient as task-relevant input may be ignored.

Dimensionality Reduction Vocal Bursts Intensity Prediction

Paper
Add Code

Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

no code implementations • 29 Jun 2016 • Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann

In order to show the monotonic improvement of our algorithm, we additionally conduct a theoretical analysis of our policy update scheme to derive a lower bound of the change in policy return between successive iterations.

Paper
Add Code

Model-Based Relative Entropy Stochastic Search

no code implementations • NeurIPS 2015 • Abbas Abdolmaleki, Rudolf Lioutikov, Jan R. Peters, Nuno Lau, Luis Pualo Reis, Gerhard Neumann

Stochastic search algorithms are general black-box optimizers.

Paper
Add Code

Probabilistic Movement Primitives

no code implementations • NeurIPS 2013 • Alexandros Paraschos, Christian Daniel, Jan R. Peters, Gerhard Neumann

In order to use such a trajectory distribution for robot movement control, we analytically derive a stochastic feedback controller which reproduces the given trajectory distribution.

Paper
Add Code

Fitted Q-iteration by Advantage Weighted Regression

no code implementations • NeurIPS 2008 • Gerhard Neumann, Jan R. Peters

Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the resulting policy.

regression

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.