Search Results for author: Eric Jang

Found 20 papers, 10 papers with code

Multi-Game Decision Transformers

1 code implementation30 May 2022 Kuang-Huei Lee, Ofir Nachum, Mengjiao Yang, Lisa Lee, Daniel Freeman, Winnie Xu, Sergio Guadarrama, Ian Fischer, Eric Jang, Henryk Michalewski, Igor Mordatch

Specifically, we show that a single transformer-based model - with a single set of weights - trained purely offline can play a suite of up to 46 Atari games simultaneously at close-to-human performance.

Atari Games Offline RL

Bayesian Imitation Learning for End-to-End Mobile Manipulation

no code implementations15 Feb 2022 Yuqing Du, Daniel Ho, Alexander A. Alemi, Eric Jang, Mohi Khansari

In this work we investigate and demonstrate benefits of a Bayesian approach to imitation learning from multiple sensor inputs, as applied to the task of opening office doors with a mobile manipulator.

Imitation Learning

BC-Z: Zero-Shot Task Generalization with Robotic Imitation Learning

no code implementations4 Feb 2022 Eric Jang, Alex Irpan, Mohi Khansari, Daniel Kappler, Frederik Ebert, Corey Lynch, Sergey Levine, Chelsea Finn

In this paper, we study the problem of enabling a vision-based robotic manipulation system to generalize to novel tasks, a long-standing challenge in robot learning.

Imitation Learning

Practical Imitation Learning in the Real World via Task Consistency Loss

no code implementations3 Feb 2022 Mohi Khansari, Daniel Ho, Yuqing Du, Armando Fuentes, Matthew Bennice, Nicolas Sievers, Sean Kirmani, Yunfei Bai, Eric Jang

To the best of our knowledge, this is the first work to tackle latched door opening from a purely end-to-end learning approach, where the task of navigation and manipulation are jointly modeled by a single neural network.

Domain Adaptation Imitation Learning

Meta-Learning Requires Meta-Augmentation

1 code implementation NeurIPS 2020 Janarthanan Rajendran, Alex Irpan, Eric Jang

Meta-learning algorithms aim to learn two components: a model that predicts targets for a task, and a base learner that quickly updates that model when given examples from a new task.

Meta-Learning

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control

no code implementations ICLR 2020 Ted Xiao, Eric Jang, Dmitry Kalashnikov, Sergey Levine, Julian Ibarz, Karol Hausman, Alexander Herzog

We study reinforcement learning in settings where sampling an action from the policy must be done concurrently with the time evolution of the controlled system, such as when a robot must decide on the next action while still performing the previous action.

reinforcement-learning Reinforcement Learning (RL) +1

Scalable Multi-Task Imitation Learning with Autonomous Improvement

no code implementations25 Feb 2020 Avi Singh, Eric Jang, Alexander Irpan, Daniel Kappler, Murtaza Dalal, Sergey Levine, Mohi Khansari, Chelsea Finn

In this work, we target this challenge, aiming to build an imitation learning system that can continuously improve through autonomous data collection, while simultaneously avoiding the explicit use of reinforcement learning, to maintain the stability, simplicity, and scalability of supervised imitation.

Imitation Learning reinforcement-learning +1

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping

1 code implementation16 Nov 2018 Eric Jang, Coline Devin, Vincent Vanhoucke, Sergey Levine

We formulate an arithmetic relationship between feature vectors from this observation, and use it to learn a representation of scenes and objects that can then be used to identify object instances, localize them in the scene, and perform goal-directed grasping tasks where the robot must retrieve commanded objects from a bin.

Object Representation Learning

WAIC, but Why? Generative Ensembles for Robust Anomaly Detection

1 code implementation2 Oct 2018 Hyunsun Choi, Eric Jang, Alexander A. Alemi

Machine learning models encounter Out-of-Distribution (OoD) errors when the data seen at test time are generated from a different stochastic generator than the one used to generate the training data.

Anomaly Detection Out of Distribution (OOD) Detection

Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control

no code implementations CVPR 2018 Fereshteh Sadeghi, Alexander Toshev, Eric Jang, Sergey Levine

In robotics, this ability is referred to as visual servoing: moving a tool or end-point to a desired location using primarily visual feedback.

Robot Manipulation

Sim2Real View Invariant Visual Servoing by Recurrent Control

no code implementations20 Dec 2017 Fereshteh Sadeghi, Alexander Toshev, Eric Jang, Sergey Levine

To this end, we train a deep recurrent controller that can automatically determine which actions move the end-point of a robotic arm to a desired object.

End-to-End Learning of Semantic Grasping

no code implementations6 Jul 2017 Eric Jang, Sudheendra Vijayanarasimhan, Peter Pastor, Julian Ibarz, Sergey Levine

We consider the task of semantic robotic grasping, in which a robot picks up an object of a user-specified class using only monocular images.

Object object-detection +3

Time-Contrastive Networks: Self-Supervised Learning from Video

7 code implementations23 Apr 2017 Pierre Sermanet, Corey Lynch, Yevgen Chebotar, Jasmine Hsu, Eric Jang, Stefan Schaal, Sergey Levine

While representations are learned from an unlabeled collection of task-related videos, robot behaviors such as pouring are learned by watching a single 3rd-person demonstration by a human.

Metric Learning reinforcement-learning +3

Categorical Reparameterization with Gumbel-Softmax

19 code implementations3 Nov 2016 Eric Jang, Shixiang Gu, Ben Poole

Categorical variables are a natural choice for representing discrete structure in the world.

General Classification

Categorical Reparametrization with Gumbel-Softmax

1 code implementation ICLR 2017 2016 Eric Jang, Shixiang Gu, Ben Poole

Categorical variables are a natural choice for representing discrete structure in the world.

Cannot find the paper you are looking for? You can Submit a new open access paper.