no code implementations • 14 Oct 2024 • Cornelius V. Braun, Robert T. Lange, Marc Toussaint
Stein Variational Gradient Descent (SVGD) is a highly efficient method to sample from an unnormalized probability distribution.
no code implementations • 25 Jul 2024 • Cen-You Li, Marc Toussaint, Barbara Rakitsch, Christoph Zimmer
Active learning (AL) is a sequential learning scheme aiming to select the most informative data.
no code implementations • 3 Jul 2024 • Marc Toussaint, Cornelius V. Braun, Joaquim Ortiz-Haro
Generating diverse samples under hard constraints is a core challenge in many areas.
1 code implementation • 22 Feb 2024 • Cen-You Li, Olaf Duennbier, Marc Toussaint, Barbara Rakitsch, Christoph Zimmer
As transferable source knowledge is often available in safety critical experiments, we propose to consider transfer safe sequential learning to accelerate the learning of safety.
2 code implementations • 6 Mar 2023 • Danny Driess, Fei Xia, Mehdi S. M. Sajjadi, Corey Lynch, Aakanksha Chowdhery, Brian Ichter, Ayzaan Wahid, Jonathan Tompson, Quan Vuong, Tianhe Yu, Wenlong Huang, Yevgen Chebotar, Pierre Sermanet, Daniel Duckworth, Sergey Levine, Vincent Vanhoucke, Karol Hausman, Marc Toussaint, Klaus Greff, Andy Zeng, Igor Mordatch, Pete Florence
Large language models excel at a wide range of complex tasks.
Ranked #2 on Visual Question Answering (VQA) on OK-VQA
no code implementations • 3 Jun 2022 • Danny Driess, Ingmar Schubert, Pete Florence, Yunzhu Li, Marc Toussaint
This paper demonstrates that learning state representations with supervision from Neural Radiance Fields (NeRFs) can improve the performance of RL compared to other learned representations or even low-dimensional, hand-engineered state information.
no code implementations • 24 Feb 2022 • Danny Driess, Zhiao Huang, Yunzhu Li, Russ Tedrake, Marc Toussaint
We present a method to learn compositional multi-object dynamics models from image observations based on implicit object encoders, Neural Radiance Fields (NeRFs), and graph neural networks.
1 code implementation • NeurIPS 2021 • Ingmar Schubert, Danny Driess, Ozgur S. Oguz, Marc Toussaint
Applications of Reinforcement Learning (RL) in robotics are often limited by high data demand.
no code implementations • 28 Oct 2021 • Nicholas Roy, Ingmar Posner, Tim Barfoot, Philippe Beaudoin, Yoshua Bengio, Jeannette Bohg, Oliver Brock, Isabelle Depatie, Dieter Fox, Dan Koditschek, Tomas Lozano-Perez, Vikash Mansinghka, Christopher Pal, Blake Richards, Dorsa Sadigh, Stefan Schaal, Gaurav Sukhatme, Denis Therien, Marc Toussaint, Michiel Van de Panne
Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains.
no code implementations • 2 Oct 2021 • Danny Driess, Jung-Su Ha, Marc Toussaint, Russ Tedrake
We show that representing objects as signed-distance fields not only enables to learn and represent a variety of models with higher accuracy compared to point-cloud and occupancy measure representations, but also that SDF-based models are suitable for optimization-based planning.
no code implementations • 29 Sep 2021 • Jung-Su Ha, Danny Driess, Marc Toussaint
Robotic manipulation planning is the problem of finding a sequence of robot configurations that involves interactions with objects in the scene, e. g., grasp, placement, tool-use, etc.
no code implementations • 30 Jul 2021 • Hon Sum Alec Yu, Dingling Yao, Christoph Zimmer, Marc Toussaint, Duy Nguyen-Tuong
We investigate active learning in Gaussian Process state-space models (GPSSM).
no code implementations • 14 Jul 2021 • Ingmar Schubert, Ozgur S. Oguz, Marc Toussaint
In high-dimensional state spaces, the usefulness of Reinforcement Learning (RL) is limited by the problem of exploration.
no code implementations • 5 Jul 2021 • Yoojin Oh, Marc Toussaint, Jim Mainprice
After presenting all the components of the system and their empirical evaluation, we present experimental results comparing our pipeline to a direct traded control approach (i. e., one that does not use prediction) which shows that using intent prediction allows to bring down the overall task execution time.
no code implementations • 5 Jul 2021 • Janik Hager, Ruben Bauer, Marc Toussaint, Jim Mainprice
To this extend, we define grasp manifolds via a set of key points and locate them in images using a Mask R-CNN backbone.
no code implementations • 9 Mar 2021 • Marc Tuscher, Julian Hörz, Danny Driess, Marc Toussaint
We propose a robotic manipulation system, which is able to grasp a wide variety of formerly unseen objects and is robust against object perturbations and inferior grasping points.
no code implementations • 28 Jan 2021 • David Hägele, Moataz Abdelaal, Ozgur S. Oguz, Marc Toussaint, Daniel Weiskopf
Nonlinear programming targets nonlinear optimization with constraints, which is a generic yet complex methodology involving humans for problem modeling and algorithms for problem solving.
Motion Planning Robotics Human-Computer Interaction Numerical Analysis Numerical Analysis H.5.2; G.1.6
no code implementations • ICLR 2021 • Ingmar Schubert, Ozgur S Oguz, Marc Toussaint
In high-dimensional state spaces, the usefulness of Reinforcement Learning (RL) is limited by the problem of exploration.
1 code implementation • 23 Nov 2020 • Philipp Kratzer, Simon Bihlmaier, Niteesh Balachandra Midlagajni, Rohit Prakash, Marc Toussaint, Jim Mainprice
Hence, in this paper, we present a novel dataset of full-body motion for everyday manipulation tasks, which includes the above.
Robotics
1 code implementation • 18 Jul 2020 • Andreas Orthey, Sohaib Akbar, Marc Toussaint
Those methods exploit the structure of fiber bundles through the use of bundle primitives.
Robotics
no code implementations • 15 Jul 2020 • Sabrina Hoppe, Marc Toussaint
By selecting a subgraph with a favorable structure, we construct a simplified Markov Decision Process for which exact Q-values can be computed efficiently as more data comes in.
no code implementations • 9 Jun 2020 • Danny Driess, Jung-Su Ha, Marc Toussaint
This is possible by encoding the objects of the scene in images as input to the neural network, instead of a fixed feature vector.
no code implementations • 29 Mar 2020 • Christian Henkel, Marc Toussaint
It is a method to build a directed roadmap graph that allows for collision avoidance in multi-robot navigation.
1 code implementation • 28 Feb 2020 • Marc Toussaint, Jung-Su Ha, Danny Driess
Physical reasoning is a core aspect of intelligence in animals and humans.
Robotics
no code implementations • 4 Oct 2019 • Philipp Kratzer, Marc Toussaint, Jim Mainprice
Human movement prediction is difficult as humans naturally exhibit complex behaviors that can change drastically from one environment to the next.
2 code implementations • 11 Sep 2019 • Andreas Orthey, Benjamin Frész, Marc Toussaint
Those minima are important to visualize to let a user guide, prevent or predict motions.
no code implementations • 8 Sep 2019 • Muhammad Usman Khalid, Janik M. Hager, Werner Kraus, Marco F. Huber, Marc Toussaint
For most industrial bin picking solutions, the pose of a workpiece is localized by matching a CAD model to point cloud obtained from 3D sensor.
no code implementations • 4 Jun 2019 • Andreas Orthey, Marc Toussaint
Motion planning problems can be simplified by admissible projections of the configuration space to sequences of lower-dimensional quotient-spaces, called sequential simplifications.
2 code implementations • 14 May 2019 • Andreas Doerr, Michael Volpp, Marc Toussaint, Sebastian Trimpe, Christian Daniel
Policy gradient methods are powerful reinforcement learning algorithms and have been demonstrated to solve many complex tasks.
no code implementations • 5 Mar 2018 • Peter Englert, Marc Toussaint
The transfer of a robot skill between different geometric environments is non-trivial since a wide variety of environments exists, sensor observations as well as robot motions are high-dimensional, and the environment might only be partially observed.
4 code implementations • ICML 2018 • Andreas Doerr, Christian Daniel, Martin Schiegg, Duy Nguyen-Tuong, Stefan Schaal, Marc Toussaint, Sebastian Trimpe
State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification.
no code implementations • 25 Jul 2017 • Ilker Yildirim, Tobias Gerstenberg, Basil Saeed, Marc Toussaint, Josh Tenenbaum
In Experiment~2, we asked participants online to judge whether they think the person in the lab used one or two hands.
no code implementations • 23 Jan 2017 • Andrea Baisero, Stefan Otte, Peter Englert, Marc Toussaint
Successful human-robot cooperation hinges on each agent's ability to process and exchange information about the shared environment and the task at hand.
no code implementations • 9 Dec 2016 • Kim Peter Wabersich, Marc Toussaint
Bayesian Optimization (BO) has become a core method for solving expensive black-box optimization problems.
no code implementations • 26 Sep 2014 • Johannes Kulick, Robert Lieck, Marc Toussaint
Gathering the most information by picking the least amount of data is a common task in experimental design or when exploring an unknown environment in reinforcement learning and robotics.
1 code implementation • 1 Jul 2014 • Marc Toussaint
This is a documentation of a framework for robot motion optimization that aims to draw on classical constrained optimization methods.
Robotics
no code implementations • 16 Jan 2014 • Tobias Lang, Marc Toussaint
They are compact and generalize over world instantiations.
no code implementations • NeurIPS 2012 • Manuel Lopes, Tobias Lang, Marc Toussaint, Pierre-Yves Oudeyer
Formal exploration approaches in model-based reinforcement learning estimate the accuracy of the currently learned model without consideration of the empirical prediction error.
Model-based Reinforcement Learning reinforcement-learning +2
no code implementations • NeurIPS 2010 • Konrad Rawlik, Marc Toussaint, Sethu Vijayakumar
Algorithms based on iterative local approximations present a practical approach to optimal control in robotic systems.
no code implementations • NeurIPS 2007 • Ben Williams, Marc Toussaint, Amos J. Storkey
Inference of the shape and the timing of primitives can be done using a factorial HMM based model, allowing the handwriting to be represented in primitive timing space.