no code implementations • ICML 2020 • Donald Hejna, Lerrel Pinto, Pieter Abbeel
Learning long-range behaviors on complex high-dimensional agents is a fundamental problem in robot learning.
no code implementations • 1 Jun 2023 • Gaoyue Zhou, Victoria Dean, Mohan Kumar Srirama, Aravind Rajeswaran, Jyothish Pari, Kyle Hatch, Aryan Jain, Tianhe Yu, Pieter Abbeel, Lerrel Pinto, Chelsea Finn, Abhinav Gupta
Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data.
no code implementations • 30 May 2023 • Ulyana Piterbarg, Lerrel Pinto, Rob Fergus
Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion.
no code implementations • 21 Mar 2023 • Irmak Guzey, Ben Evans, Soumith Chintala, Lerrel Pinto
In the first phase, we collect 2. 5 hours of play data, which is used to train self-supervised tactile encoders.
1 code implementation • 2 Mar 2023 • Siddhant Haldar, Jyothish Pari, Anant Rai, Lerrel Pinto
Given a weak base-policy trained by offline imitation of demonstrations, FISH computes rewards that correspond to the "match" between the robot's behavior and the demonstrations.
no code implementations • 18 Oct 2022 • Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto
While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging.
no code implementations • 12 Oct 2022 • Sridhar Pandian Arunachalam, Irmak Güzey, Soumith Chintala, Lerrel Pinto
A fundamental challenge in teaching robots is to provide an effective interface for human teachers to demonstrate useful skills to a robot.
2 code implementations • 11 Oct 2022 • Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam
We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization.
1 code implementation • 3 Oct 2022 • Abitha Thankaraj, Lerrel Pinto
Learning to produce contact-rich, dynamic behaviors from raw sensory data has been a longstanding challenge in robotics.
no code implementations • 4 Aug 2022 • Yilei Zeng, Jiali Duan, Yang Li, Emilio Ferrara, Lerrel Pinto, C. -C. Jay Kuo, Stefanos Nikolaidis
In this work, we guide the curriculum reinforcement learning results towards a preferred performance level that is neither too hard nor too easy via learning from the human decision process.
no code implementations • 30 Jun 2022 • Siddhant Haldar, Vaibhav Mathur, Denis Yarats, Lerrel Pinto
Our experiments on 20 visual control tasks across the DeepMind Control Suite, the OpenAI Robotics Suite, and the Meta-World Benchmark demonstrate an average of 7. 8X faster imitation to reach 90% of expert performance compared to prior state-of-the-art methods.
1 code implementation • 22 Jun 2022 • Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto
In this work, we present Behavior Transformer (BeT), a new technique to model unlabeled demonstration data with multiple modes.
no code implementations • 24 Mar 2022 • Sridhar Pandian Arunachalam, Sneha Silwal, Ben Evans, Lerrel Pinto
Optimizing behaviors for dexterous manipulation has been a longstanding challenge in robotics, with a variety of methods from model-based control to model-free reinforcement learning having been previously explored in literature.
1 code implementation • ICLR 2022 • Nur Muhammad Shafiullah, Lerrel Pinto
In this work, we propose a new framework for skill discovery, where skills are learned one after another in an incremental fashion.
no code implementations • 10 Mar 2022 • Ben Evans, Abitha Thankaraj, Lerrel Pinto
Understanding environment dynamics is necessary for robots to act safely and optimally in the world.
1 code implementation • 31 Jan 2022 • Denis Yarats, David Brandfonbrener, Hao liu, Michael Laskin, Pieter Abbeel, Alessandro Lazaric, Lerrel Pinto
In this work, we propose Exploratory data for Offline RL (ExORL), a data-centric approach to offline RL.
1 code implementation • 2 Dec 2021 • Jyothish Pari, Nur Muhammad Shafiullah, Sridhar Pandian Arunachalam, Lerrel Pinto
One reason such complexities arise is because standard visual imitation frameworks try to solve two coupled problems at once: learning a succinct but good representation from the diverse visual data, while simultaneously learning to associate the demonstrated actions with such representations.
1 code implementation • 28 Oct 2021 • Michael Laskin, Denis Yarats, Hao liu, Kimin Lee, Albert Zhan, Kevin Lu, Catherine Cang, Lerrel Pinto, Pieter Abbeel
Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range of complex yet specific control tasks.
no code implementations • 29 Sep 2021 • Donald Joseph Hejna III, Pieter Abbeel, Lerrel Pinto
Complex, long-horizon planning and its combinatorial nature pose steep challenges for learning-based agents.
4 code implementations • ICLR 2022 • Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto
We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual continuous control.
no code implementations • 19 Jul 2021 • Sarah Young, Jyothish Pari, Pieter Abbeel, Lerrel Pinto
In this work, we propose to use playful interactions in a self-supervised manner to learn visual representations for downstream tasks.
no code implementations • 7 Apr 2021 • Philippe Hansen-Estruch, Wenling Shang, Lerrel Pinto, Pieter Abbeel, Stas Tiomkin
In this work, we take advantage of these structures to build effective dynamical models that are amenable to sample-based learning.
1 code implementation • 31 Mar 2021 • Wenyu Han, Chen Feng, Haoran Wu, Alexander Gao, Armand Jordana, Dong Liu, Lerrel Pinto, Ludovic Righetti
We need intelligent robots for mobile construction, the process of navigating in an environment and modifying its structure according to a geometric design.
1 code implementation • ICLR 2021 • Donald J. Hejna III, Pieter Abbeel, Lerrel Pinto
Deep reinforcement learning primarily focuses on learning behavior, usually overlooking the fact that an agent's function is largely determined by form.
1 code implementation • 22 Feb 2021 • Denis Yarats, Rob Fergus, Alessandro Lazaric, Lerrel Pinto
Unfortunately, in RL, representation learning is confounded with the exploratory experience of the agent -- learning a useful representation requires diverse data, while effective exploration is only possible with coherent representations.
no code implementations • 1 Jan 2021 • Wenyu Han, Chen Feng, Haoran Wu, Alexander Gao, Armand Jordana, Dongdong Liu, Lerrel Pinto, Ludovic Righetti
We need intelligent robots to perform mobile construction, the process of moving in an environment and modifying its geometry according to a design plan.
1 code implementation • ICLR 2021 • Qiang Zhang, Tete Xiao, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang
We propose \textit{dynamics cycles} that align dynamic robot behavior across two domains using a cycle-consistency constraint.
no code implementations • 14 Dec 2020 • Albert Zhan, Ruihan Zhao, Lerrel Pinto, Pieter Abbeel, Michael Laskin
We present Contrastive Pre-training and Data Augmentation for Efficient Robotic Learning (CoDER), a method that utilizes data augmentation and unsupervised learning to achieve sample-efficient training of real-robot arm policies from sparse rewards.
no code implementations • 13 Nov 2020 • Bryan Chen, Alexander Sax, Gene Lewis, Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto
Vision-based robotics often separates the control loop into one module for perception and a separate module for control.
no code implementations • 11 Aug 2020 • Sarah Young, Dhiraj Gandhi, Shubham Tulsiani, Abhinav Gupta, Pieter Abbeel, Lerrel Pinto
We use commercially available reacher-grabber assistive tools both as a data collection device and as the robot's end-effector.
2 code implementations • ICLR 2021 • Nicklas Hansen, Rishabh Jangir, Yu Sun, Guillem Alenyà, Pieter Abbeel, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang
A natural solution would be to keep training after deployment in the new environment, but this cannot be done if the new environment offers no reward signal.
no code implementations • 3 Jul 2020 • Dhiraj Gandhi, Abhinav Gupta, Lerrel Pinto
In this work, we perform the first large-scale study of the interactions between sound and robotic action.
1 code implementation • NeurIPS 2020 • Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto
Our key insight is that if we can sample goals at the frontier of the set of goals that an agent is able to reach, it will provide a significantly stronger learning signal compared to randomly sampled goals.
2 code implementations • NeurIPS 2020 • Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas
To this end, we present Reinforcement Learning with Augmented Data (RAD), a simple plug-and-play module that can enhance most RL algorithms.
no code implementations • 7 Apr 2020 • Ilija Radosavovic, Xiaolong Wang, Lerrel Pinto, Jitendra Malik
To tackle this setting, we train an inverse dynamics model and use it to predict actions for state-only demonstrations.
1 code implementation • 11 Mar 2020 • Wilson Yan, Ashwin Vangipuram, Pieter Abbeel, Lerrel Pinto
Using visual model-based learning for deformable object manipulation is challenging due to difficulties in learning plannable visual representations along with complex dynamic models.
1 code implementation • 3 Mar 2020 • Donald J. Hejna III, Pieter Abbeel, Lerrel Pinto
Learning long-range behaviors on complex high-dimensional agents is a fundamental problem in robot learning.
no code implementations • NeurIPS 2020 • Alexander C. Li, Lerrel Pinto, Pieter Abbeel
Compared to standard relabeling techniques, Generalized Hindsight provides a substantially more efficient reuse of samples, which we empirically demonstrate on a suite of multi-task navigation and manipulation tasks.
no code implementations • ICLR 2020 • Tanmay Shankar, Shubham Tulsiani, Lerrel Pinto, Abhinav Gupta
In this paper, we present an approach to learn recomposable motor primitives across large-scale and diverse manipulation demonstrations.
2 code implementations • 29 Oct 2019 • Yilin Wu, Wilson Yan, Thanard Kurutach, Lerrel Pinto, Pieter Abbeel
Second, instead of jointly learning both the pick and the place locations, we only explicitly learn the placing policy conditioned on random pick points.
no code implementations • 25 Sep 2019 • Dhiraj Gandhi, Abhinav Gupta, Lerrel Pinto
In this work, we perform the first large-scale study of the interactions between sound and robotic action.
1 code implementation • ICLR 2019 • Wenxuan Zhou, Lerrel Pinto, Abhinav Gupta
A key challenge in reinforcement learning (RL) is environment generalization: a policy trained to solve a task in one environment often fails to solve the same task in a slightly different test environment.
2 code implementations • 19 Jun 2019 • Adithyavairavan Murali, Tao Chen, Kalyan Vasudev Alwala, Dhiraj Gandhi, Lerrel Pinto, Saurabh Gupta, Abhinav Gupta
This paper introduces PyRobot, an open-source robotics framework for research and benchmarking.
no code implementations • 24 Oct 2018 • Lerrel Pinto, Aditya Mandalika, Brian Hou, Siddhartha Srinivasa
This paper proposes a sample-efficient yet simple approach to learning closed-loop policies for nonprehensile manipulation.
no code implementations • 16 Oct 2018 • Pratyusha Sharma, Lekha Mohan, Lerrel Pinto, Abhinav Gupta
In order to make progress and capture the space of manipulation, we would need to collect a large-scale dataset of diverse tasks such as pouring, opening bottles, stacking objects etc.
no code implementations • NeurIPS 2018 • Abhinav Gupta, Adithyavairavan Murali, Dhiraj Gandhi, Lerrel Pinto
The models trained with our home dataset showed a marked improvement of 43. 7% over a baseline model trained with data collected in lab.
no code implementations • 18 Oct 2017 • Lerrel Pinto, Marcin Andrychowicz, Peter Welinder, Wojciech Zaremba, Pieter Abbeel
While several recent works have shown promising results in transferring policies trained in simulation to the real world, they often do not fully utilize the advantage of working with a simulator.
no code implementations • NeurIPS 2017 • Arun Venkatraman, Nicholas Rhinehart, Wen Sun, Lerrel Pinto, Martial Hebert, Byron Boots, Kris M. Kitani, J. Andrew Bagnell
We seek to combine the advantages of RNNs and PSRs by augmenting existing state-of-the-art recurrent neural networks with Predictive-State Decoders (PSDs), which add supervision to the network's internal state representation to target predicting future observations.
no code implementations • 4 Aug 2017 • Adithyavairavan Murali, Lerrel Pinto, Dhiraj Gandhi, Abhinav Gupta
Recent self-supervised learning approaches focus on using a few thousand data points to learn policies for high-level, low-dimensional action spaces.
1 code implementation • 19 Apr 2017 • Dhiraj Gandhi, Lerrel Pinto, Abhinav Gupta
An alternative is to use simulation.
6 code implementations • ICML 2017 • Lerrel Pinto, James Davidson, Rahul Sukthankar, Abhinav Gupta
Deep neural networks coupled with fast simulation and improved computation have led to recent successes in the field of reinforcement learning (RL).
1 code implementation • 5 Oct 2016 • Lerrel Pinto, James Davidson, Abhinav Gupta
Due to large number of experiences required for training, most of these approaches use a self-supervised paradigm: using sensors to measure success/failure.
no code implementations • 28 Sep 2016 • Lerrel Pinto, Abhinav Gupta
The argument of the difficulty in scalability to multiple tasks is well founded, since training these tasks often require hundreds or thousands of examples.
no code implementations • 5 Apr 2016 • Lerrel Pinto, Dhiraj Gandhi, Yuanfeng Han, Yong-Lae Park, Abhinav Gupta
We argue that biological agents use physical interactions with the world to learn visual representations unlike current vision systems which just use passive observations (images and videos downloaded from web).
no code implementations • 23 Sep 2015 • Lerrel Pinto, Abhinav Gupta
Our experiments clearly show the benefit of using large-scale datasets (and multi-stage training) for the task of grasping.