Search Results for author: Jonathan Scholz

Found 7 papers, 2 papers with code

Scaling data-driven robotics with reward sketching and batch reinforcement learning

1 code implementation • 26 Sep 2019 • Serkan Cabi, Sergio Gómez Colmenarejo, Alexander Novikov, Ksenia Konyushkova, Scott Reed, Rae Jeong, Konrad Zolna, Yusuf Aytar, David Budden, Mel Vecerik, Oleg Sushkov, David Barker, Jonathan Scholz, Misha Denil, Nando de Freitas, Ziyu Wang

We present a framework for data-driven robotics that makes use of a large dataset of recorded robot experience and scales to several tasks using learned reward functions.

reinforcement-learning Reinforcement Learning (RL)

12,774

Paper
Code

Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards

4 code implementations • 27 Jul 2017 • Mel Vecerik, Todd Hester, Jonathan Scholz, Fumin Wang, Olivier Pietquin, Bilal Piot, Nicolas Heess, Thomas Rothörl, Thomas Lampe, Martin Riedmiller

We propose a general and model-free approach for Reinforcement Learning (RL) on real robotics with sparse rewards.

reinforcement-learning Reinforcement Learning (RL)

813

Paper
Code

PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

no code implementations • 27 May 2017 • Rico Jonschkowski, Roland Hafner, Jonathan Scholz, Martin Riedmiller

We propose position-velocity encoders (PVEs) which learn---without supervision---to encode images to positions and velocities of task-relevant objects.

Image Reconstruction Position

Paper
Add Code

Policy Shaping: Integrating Human Feedback with Reinforcement Learning

no code implementations • NeurIPS 2013 • Shane Griffith, Kaushik Subramanian, Jonathan Scholz, Charles L. Isbell, Andrea L. Thomaz

A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human feedback to solve complex tasks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Generative predecessor models for sample-efficient imitation learning

no code implementations • ICLR 2019 • Yannick Schroecker, Mel Vecerik, Jonathan Scholz

We propose Generative Predecessor Models for Imitation Learning (GPRIL), a novel imitation learning algorithm that matches the state-action distribution to the distribution observed in expert demonstrations, using generative models to reason probabilistically about alternative histories of demonstrated states.

Imitation Learning Robot Manipulation

Paper
Add Code

Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy Gradient

no code implementations • 15 Nov 2019 • Kevin Sebastian Luck, Mel Vecerik, Simon Stepputtis, Heni Ben Amor, Jonathan Scholz

This work evaluates the use of model-based trajectory optimization methods used for exploration in Deep Deterministic Policy Gradient when trained on a latent image embedding.

Continuous Control reinforcement-learning +1

Paper
Add Code

S3K: Self-Supervised Semantic Keypoints for Robotic Manipulation via Multi-View Consistency

no code implementations • 30 Sep 2020 • Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov, David Barker, Rugile Pevceviciute, Thomas Rothörl, Christopher Schuster, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

In this work we advocate semantic 3D keypoints as a visual representation, and present a semi-supervised training objective that can allow instance or category-level keypoints to be trained to 1-5 millimeter-accuracy with minimal supervision.

Image Reconstruction Representation Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.