Search Results for author: Thomas Lampe

Found 20 papers, 3 papers with code

Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning

no code implementations • 8 Feb 2024 • Mohak Bhardwaj, Thomas Lampe, Michael Neunert, Francesco Romano, Abbas Abdolmaleki, Arunkumar Byravan, Markus Wulfmeier, Martin Riedmiller, Jonas Buchli

Recent advances in real-world applications of reinforcement learning (RL) have relied on the ability to accurately simulate systems at scale.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

Offline Actor-Critic Reinforcement Learning Scales to Large Models

no code implementations • 8 Feb 2024 • Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin Riedmiller

We show that offline actor-critic reinforcement learning can scale to large models - such as transformers - and follows similar scaling laws as supervised learning.

Continuous Control Offline RL +1

Paper
Add Code

Replay across Experiments: A Natural Extension of Off-Policy RL

no code implementations • 27 Nov 2023 • Dhruva Tirumala, Thomas Lampe, Jose Enrique Chen, Tuomas Haarnoja, Sandy Huang, Guy Lever, Ben Moran, Tim Hertweck, Leonard Hasenclever, Martin Riedmiller, Nicolas Heess, Markus Wulfmeier

Replaying data is a principal mechanism underlying the stability and data efficiency of off-policy reinforcement learning (RL).

Reinforcement Learning (RL)

Paper
Add Code

RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation

no code implementations • 20 Jun 2023 • Konstantinos Bousmalis, Giulia Vezzani, Dushyant Rao, Coline Devin, Alex X. Lee, Maria Bauza, Todor Davchev, Yuxiang Zhou, Agrim Gupta, Akhil Raju, Antoine Laurens, Claudio Fantacci, Valentin Dalibard, Martina Zambelli, Murilo Martins, Rugile Pevceviciute, Michiel Blokzijl, Misha Denil, Nathan Batchelor, Thomas Lampe, Emilio Parisotto, Konrad Żołna, Scott Reed, Sergio Gómez Colmenarejo, Jon Scholz, Abbas Abdolmaleki, Oliver Groth, Jean-Baptiste Regli, Oleg Sushkov, Tom Rothörl, José Enrique Chen, Yusuf Aytar, Dave Barker, Joy Ortiz, Martin Riedmiller, Jost Tobias Springenberg, Raia Hadsell, Francesco Nori, Nicolas Heess

With RoboCat, we demonstrate the ability to generalise to new tasks and robots, both zero-shot as well as through adaptation using only 100-1000 examples for the target task.

Paper
Add Code

SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration

no code implementations • 24 Nov 2022 • Giulia Vezzani, Dhruva Tirumala, Markus Wulfmeier, Dushyant Rao, Abbas Abdolmaleki, Ben Moran, Tuomas Haarnoja, Jan Humplik, Roland Hafner, Michael Neunert, Claudio Fantacci, Tim Hertweck, Thomas Lampe, Fereshteh Sadeghi, Nicolas Heess, Martin Riedmiller

The ability to effectively reuse prior knowledge is a key requirement when building general and flexible Reinforcement Learning (RL) agents.

Reinforcement Learning (RL)

Paper
Add Code

How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation

no code implementations • 6 May 2022 • Alex X. Lee, Coline Devin, Jost Tobias Springenberg, Yuxiang Zhou, Thomas Lampe, Abbas Abdolmaleki, Konstantinos Bousmalis

Our analysis, both in simulation and in the real world, shows that our approach is the best across data budgets, while standard offline RL from teacher rollouts is surprisingly effective when enough data is given.

Offline RL Reinforcement Learning (RL)

Paper
Add Code

Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes

1 code implementation • 12 Oct 2021 • Alex X. Lee, Coline Devin, Yuxiang Zhou, Thomas Lampe, Konstantinos Bousmalis, Jost Tobias Springenberg, Arunkumar Byravan, Abbas Abdolmaleki, Nimrod Gileadi, David Khosid, Claudio Fantacci, Jose Enrique Chen, Akhil Raju, Rae Jeong, Michael Neunert, Antoine Laurens, Stefano Saliceti, Federico Casarini, Martin Riedmiller, Raia Hadsell, Francesco Nori

We study the problem of robotic stacking with objects of complex geometry.

Ranked #2 on Skill Generalization on RGB-Stacking

Offline RL Reinforcement Learning (RL) +2

113

Paper
Code

Representation Matters: Improving Perception and Exploration for Robotics

no code implementations • 3 Nov 2020 • Markus Wulfmeier, Arunkumar Byravan, Tim Hertweck, Irina Higgins, Ankush Gupta, tejas kulkarni, Malcolm Reynolds, Denis Teplyashin, Roland Hafner, Thomas Lampe, Martin Riedmiller

Furthermore, the value of each representation is evaluated in terms of three properties: dimensionality, observability and disentanglement.

Disentanglement

Paper
Add Code

Data-efficient Hindsight Off-policy Option Learning

no code implementations • 30 Jul 2020 • Markus Wulfmeier, Dushyant Rao, Roland Hafner, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala, Noah Siegel, Nicolas Heess, Martin Riedmiller

We introduce Hindsight Off-policy Options (HO2), a data-efficient option learning algorithm.

Robot Manipulation

Paper
Add Code

Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning

no code implementations • ICLR 2020 • Noah Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin Riedmiller

In practice, however, standard off-policy algorithms fail in the batch setting for continuous control.

Continuous Control Multi-Task Learning +2

Paper
Add Code

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

no code implementations • 19 Feb 2020 • Noah Y. Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin Riedmiller

In practice, however, standard off-policy algorithms fail in the batch setting for continuous control.

Continuous Control Multi-Task Learning +2

Paper
Add Code

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics

no code implementations • 2 Jan 2020 • Michael Neunert, Abbas Abdolmaleki, Markus Wulfmeier, Thomas Lampe, Jost Tobias Springenberg, Roland Hafner, Francesco Romano, Jonas Buchli, Nicolas Heess, Martin Riedmiller

In contrast, we propose to treat hybrid problems in their 'native' form by solving them with hybrid reinforcement learning, which optimizes for discrete and continuous actions simultaneously.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation

no code implementations • 21 Oct 2019 • Rae Jeong, Yusuf Aytar, David Khosid, Yuxiang Zhou, Jackie Kay, Thomas Lampe, Konstantinos Bousmalis, Francesco Nori

In this work, we learn a latent state representation implicitly with deep reinforcement learning in simulation, and then adapt it to the real domain using unlabeled real robot data.

Domain Adaptation reinforcement-learning +1

Paper
Add Code

Modelling Generalized Forces with Reinforcement Learning for Sim-to-Real Transfer

no code implementations • 21 Oct 2019 • Rae Jeong, Jackie Kay, Francesco Romano, Thomas Lampe, Tom Rothorl, Abbas Abdolmaleki, Tom Erez, Yuval Tassa, Francesco Nori

Learning robotic control policies in the real world gives rise to challenges in data efficiency, safety, and controlling the initial condition of the system.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

no code implementations • 9 Oct 2019 • Arunkumar Byravan, Jost Tobias Springenberg, Abbas Abdolmaleki, Roland Hafner, Michael Neunert, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

Humans are masters at quickly learning many complex tasks, relying on an approximate understanding of the dynamics of their environments.

Model-based Reinforcement Learning Reinforcement Learning (RL) +2

Paper
Add Code

Compositional Transfer in Hierarchical Reinforcement Learning

no code implementations • 26 Jun 2019 • Markus Wulfmeier, Abbas Abdolmaleki, Roland Hafner, Jost Tobias Springenberg, Michael Neunert, Tim Hertweck, Thomas Lampe, Noah Siegel, Nicolas Heess, Martin Riedmiller

The successful application of general reinforcement learning algorithms to real-world robotics applications is often limited by their high data requirements.

General Reinforcement Learning Hierarchical Reinforcement Learning +4

Paper
Add Code

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

no code implementations • 13 Feb 2019 • Devin Schwab, Tobias Springenberg, Murilo F. Martins, Thomas Lampe, Michael Neunert, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller

We present a method for fast training of vision based control policies on real robots.

Imitation Learning

Paper
Add Code

Learning by Playing - Solving Sparse Reward Tasks from Scratch

1 code implementation • ICML 2018 • Martin Riedmiller, Roland Hafner, Thomas Lampe, Michael Neunert, Jonas Degrave, Tom Van de Wiele, Volodymyr Mnih, Nicolas Heess, Jost Tobias Springenberg

We propose Scheduled Auxiliary Control (SAC-X), a new learning paradigm in the context of Reinforcement Learning (RL).

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

4 code implementations • 27 Jul 2017 • Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

We propose a general and model-free approach for Reinforcement Learning (RL) on real robotics with sparse rewards.

reinforcement-learning Reinforcement Learning (RL)

815

Paper
Code

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

no code implementations • ICLR 2018 • Ivaylo Popov, Nicolas Heess, Timothy Lillicrap, Roland Hafner, Gabriel Barth-Maron, Matej Vecerik, Thomas Lampe, Yuval Tassa, Tom Erez, Martin Riedmiller

Solving this difficult and practically relevant problem in the real world is an important long-term goal for the field of robotics.

Continuous Control Q-Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.