Search Results for author: Dinesh Jayaraman

Found 57 papers, 23 papers with code

Recasting Generic Pretrained Vision Transformers As Object-Centric Scene Encoders For Manipulation Policies

no code implementations • 24 May 2024 • Jianing Qian, Anastasios Panagopoulos, Dinesh Jayaraman

Generic re-usable pre-trained image representation encoders have become a standard component of methods for many computer vision tasks.

Paper
Add Code

Privileged Sensing Scaffolds Reinforcement Learning

no code implementations • 23 May 2024 • Edward S. Hu, James Springer, Oleh Rybkin, Dinesh Jayaraman

We consider such sensory scaffolding setups for training artificial agents.

reinforcement-learning

Paper
Add Code

Composing Pre-Trained Object-Centric Representations for Robotics From "What" and "Where" Foundation Models

no code implementations • 20 Apr 2024 • Junyao Shi, Jianing Qian, Yecheng Jason Ma, Dinesh Jayaraman

There have recently been large advances both in pre-training visual representations for robotic control and segmenting unknown category objects in general images.

Object Systematic Generalization

Paper
Add Code

Can Transformers Capture Spatial Relations between Objects?

no code implementations • 1 Mar 2024 • Chuan Wen, Dinesh Jayaraman, Yang Gao

Spatial relationships between objects represent key scene information for humans to understand and interact with the world.

Relation

Paper
Add Code

DiffusionPhase: Motion Diffusion in Frequency Domain

no code implementations • 7 Dec 2023 • Weilin Wan, Yiming Huang, Shutong Wu, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

In this study, we introduce a learning-based method for generating high-quality human motion sequences from text descriptions (e. g., ``A person walks forward").

Paper
Add Code

TLControl: Trajectory and Language Control for Human Motion Synthesis

no code implementations • 28 Nov 2023 • Weilin Wan, Zhiyang Dou, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu

Controllable human motion synthesis is essential for applications in AR/VR, gaming, movies, and embodied AI.

Motion Synthesis

Paper
Add Code

Eureka: Human-Level Reward Design via Coding Large Language Models

1 code implementation • 19 Oct 2023 • Yecheng Jason Ma, William Liang, Guanzhi Wang, De-An Huang, Osbert Bastani, Dinesh Jayaraman, Yuke Zhu, Linxi Fan, Anima Anandkumar

The generality of Eureka also enables a new gradient-free in-context learning approach to reinforcement learning from human feedback (RLHF), readily incorporating human inputs to improve the quality and the safety of the generated rewards without model updating.

Decision Making In-Context Learning +1

2,687

Paper
Code

Universal Visual Decomposer: Long-Horizon Manipulation Made Easy

no code implementations • 12 Oct 2023 • Zichen Zhang, Yunshuang Li, Osbert Bastani, Abhishek Gupta, Dinesh Jayaraman, Yecheng Jason Ma, Luca Weihs

Learning long-horizon manipulation tasks, however, is a long-standing challenge, and demands decomposing the overarching task into several manageable subtasks to facilitate policy learning and generalization to unseen tasks.

reinforcement-learning

Paper
Add Code

Memory-Consistent Neural Networks for Imitation Learning

no code implementations • 9 Oct 2023 • Kaustubh Sridhar, Souradeep Dutta, Dinesh Jayaraman, James Weimer, Insup Lee

Imitation learning considerably simplifies policy synthesis compared to alternative approaches by exploiting access to expert demonstrations.

Imitation Learning

Paper
Add Code

LIV: Language-Image Representations and Rewards for Robotic Control

1 code implementation • 1 Jun 2023 • Yecheng Jason Ma, William Liang, Vaidehi Som, Vikash Kumar, Amy Zhang, Osbert Bastani, Dinesh Jayaraman

We present Language-Image Value learning (LIV), a unified objective for vision-language representation and reward learning from action-free videos with text annotations.

Contrastive Learning Imitation Learning

Paper
Code

TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching

no code implementations • 22 May 2023 • Yecheng Jason Ma, Kausik Sivakumar, Jason Yan, Osbert Bastani, Dinesh Jayaraman

Standard model-based reinforcement learning (MBRL) approaches fit a transition model of the environment to all past experience, but this wastes model capacity on data that is irrelevant for policy improvement.

Model-based Reinforcement Learning reinforcement-learning

Paper
Add Code

ZeroFlow: Scalable Scene Flow via Distillation

1 code implementation • 17 May 2023 • Kyle Vedder, Neehar Peri, Nathaniel Chodosh, Ishan Khatri, Eric Eaton, Dinesh Jayaraman, Yang Liu, Deva Ramanan, James Hays

Scene flow estimation is the task of describing the 3D motion field between temporally successive point clouds.

Ranked #3 on Self-supervised Scene Flow Estimation on Argoverse 2

Self-supervised Scene Flow Estimation

Paper
Code

Planning Goals for Exploration

1 code implementation • 23 Mar 2023 • Edward S. Hu, Richard Chang, Oleh Rybkin, Dinesh Jayaraman

We address this question within the goal-conditioned reinforcement learning paradigm, by identifying how the agent should set its goals at training time to maximize exploration.

Paper
Code

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

1 code implementation • 17 Dec 2022 • Kun Huang, Edward S. Hu, Dinesh Jayaraman

Physical interactions can often help reveal information that is not readily apparent.

Paper
Code

Long-HOT: A Modular Hierarchical Approach for Long-Horizon Object Transport

no code implementations • 28 Oct 2022 • Sriram Narayanan, Dinesh Jayaraman, Manmohan Chandraker

We address key challenges in long-horizon embodied exploration and navigation by proposing a new object transport task and a novel modular framework for temporally extended navigation.

Motion Planning

Paper
Add Code

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

1 code implementation • 30 Sep 2022 • Yecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar, Amy Zhang

Given the inherent cost and scarcity of in-domain, task-specific robot data, learning from large, diverse, offline human videos has emerged as a promising path towards acquiring a generally useful visual representation for control; however, how these human videos can be used for general-purpose reward learning remains an open question.

Offline RL Open-Ended Question Answering +2

123

Paper
Code

Vision-based Perimeter Defense via Multiview Pose Estimation

no code implementations • 25 Sep 2022 • Elijah S. Lee, Giuseppe Loianno, Dinesh Jayaraman, Vijay Kumar

Previous studies in the perimeter defense game have largely focused on the fully observable setting where the true player states are known to all players.

Pose Estimation

Paper
Add Code

Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming

no code implementations • 22 Jun 2022 • Chuan Wen, Jianing Qian, Jierui Lin, Jiaye Teng, Dinesh Jayaraman, Yang Gao

Across applications spanning supervised classification and sequential control, deep learning has been reported to find "shortcut" solutions that fail catastrophically under minor changes in the data distribution.

Autonomous Driving Classification +5

Paper
Add Code

How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression

1 code implementation • 7 Jun 2022 • Yecheng Jason Ma, Jason Yan, Dinesh Jayaraman, Osbert Bastani

Offline goal-conditioned reinforcement learning (GCRL) promises general-purpose skill learning in the form of reaching diverse goals from purely offline datasets.

regression reinforcement-learning +1

Paper
Code

Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching

2 code implementations • 4 Feb 2022 • Yecheng Jason Ma, Andrew Shen, Dinesh Jayaraman, Osbert Bastani

We propose State Matching Offline DIstribution Correction Estimation (SMODICE), a novel and versatile regression-based offline imitation learning (IL) algorithm derived via state-occupancy matching.

Imitation Learning Reinforcement Learning (RL)

Paper
Code

Prospective Learning: Principled Extrapolation to the Future

no code implementations • 19 Jan 2022 • Ashwin De Silva, Rahul Ramesh, Lyle Ungar, Marshall Hussain Shuler, Noah J. Cowan, Michael Platt, Chen Li, Leyla Isik, Seung-Eon Roh, Adam Charles, Archana Venkataraman, Brian Caffo, Javier J. How, Justus M Kebschull, John W. Krakauer, Maxim Bichuch, Kaleab Alemayehu Kinfu, Eva Yezerets, Dinesh Jayaraman, Jong M. Shin, Soledad Villar, Ian Phillips, Carey E. Priebe, Thomas Hartung, Michael I. Miller, Jayanta Dey, Ningyuan, Huang, Eric Eaton, Ralph Etienne-Cummings, Elizabeth L. Ogburn, Randal Burns, Onyema Osuagwu, Brett Mensh, Alysson R. Muotri, Julia Brown, Chris White, Weiwei Yang, Andrei A. Rusu, Timothy Verstynen, Konrad P. Kording, Pratik Chaudhari, Joshua T. Vogelstein

We conjecture that certain sequences of tasks are not retrospectively learnable (in which the data distribution is fixed), but are prospectively learnable (in which distributions may be dynamic), suggesting that prospective learning is more difficult in kind than retrospective learning.

Continual Learning Decision Making

Paper
Add Code

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

1 code implementation • 14 Dec 2021 • Yecheng Jason Ma, Andrew Shen, Osbert Bastani, Dinesh Jayaraman

Further, CAP adaptively tunes this penalty during training using true cost feedback from the environment.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Code

Transferable Visual Control Policies Through Robot-Awareness

no code implementations • ICLR 2022 • Edward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman

Training visual control policies from scratch on a new robot typically requires generating large amounts of robot-specific data.

Paper
Add Code

Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts

no code implementations • 29 Sep 2021 • Chuan Wen, Jianing Qian, Jierui Lin, Dinesh Jayaraman, Yang Gao

When operating in partially observed settings, it is important for a control policy to fuse information from a history of observations.

Autonomous Driving Continuous Control +1

Paper
Add Code

Probabilistic Modeling for Human Mesh Recovery

1 code implementation • ICCV 2021 • Nikos Kolotouros, Georgios Pavlakos, Dinesh Jayaraman, Kostas Daniilidis

This paper focuses on the problem of 3D human reconstruction from 2D evidence.

Ranked #73 on 3D Human Pose Estimation on Human3.6M (PA-MPJPE metric)

3D Human Pose Estimation 3D Human Reconstruction +1

254

Paper
Code

Know Thyself: Transferable Visual Control Policies Through Robot-Awareness

1 code implementation • 19 Jul 2021 • Edward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman

Training visual control policies from scratch on a new robot typically requires generating large amounts of robot-specific data.

Model-based Reinforcement Learning Transfer Learning +1

Paper
Code

Conservative Offline Distributional Reinforcement Learning

1 code implementation • NeurIPS 2021 • Yecheng Jason Ma, Dinesh Jayaraman, Osbert Bastani

We prove that CODAC learns a conservative return distribution -- in particular, for finite MDPs, CODAC converges to an uniform lower bound on the quantiles of the return distribution; our proof relies on a novel analysis of the distributional Bellman operator.

D4RL Distributional Reinforcement Learning +4

Paper
Code

Keyframe-Focused Visual Imitation Learning

no code implementations • 11 Jun 2021 • Chuan Wen, Jierui Lin, Jianing Qian, Yang Gao, Dinesh Jayaraman

Imitation learning trains control policies by mimicking pre-recorded expert demonstrations.

Continuous Control Graph Learning +1

Paper
Add Code

How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?

1 code implementation • 2 Apr 2021 • Jingxi Xu, Bruce Lee, Nikolai Matni, Dinesh Jayaraman

The difficulty of optimal control problems has classically been characterized in terms of system properties such as minimum eigenvalues of controllability/observability gramians.

Reinforcement Learning (RL)

Paper
Code

Likelihood-Based Diverse Sampling for Trajectory Forecasting

1 code implementation • ICCV 2021 • Yecheng Jason Ma, Jeevana Priya Inala, Dinesh Jayaraman, Osbert Bastani

We propose Likelihood-Based Diverse Sampling (LDS), a method for improving the quality and the diversity of trajectory samples from a pre-trained flow model.

Trajectory Forecasting

Paper
Code

Fighting Copycat Agents in Behavioral Cloning from Observation Histories

no code implementations • NeurIPS 2020 • Chuan Wen, Jierui Lin, Trevor Darrell, Dinesh Jayaraman, Yang Gao

Imitation learning trains policies to map from input observations to the actions that an expert would choose.

Imitation Learning

Paper
Add Code

Model-Based Inverse Reinforcement Learning from Visual Demonstrations

no code implementations • 18 Oct 2020 • Neha Das, Sarah Bechtle, Todor Davchev, Dinesh Jayaraman, Akshara Rai, Franziska Meier

Scaling model-based inverse reinforcement learning (IRL) to real robotic manipulation tasks with unknown dynamics remains an open problem.

Model Predictive Control reinforcement-learning +1

Paper
Add Code

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

1 code implementation • ICML 2020 • Jesse Zhang, Brian Cheung, Chelsea Finn, Sergey Levine, Dinesh Jayaraman

Reinforcement learning (RL) in real-world safety-critical target settings like urban driving is hazardous, imperiling the RL agent, other agents, and the environment.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors

1 code implementation • NeurIPS 2020 • Karl Pertsch, Oleh Rybkin, Frederik Ebert, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

In this work we propose a framework for visual prediction and planning that is able to overcome both of these limitations.

Paper
Code

DIGIT: A Novel Design for a Low-Cost Compact High-Resolution Tactile Sensor with Application to In-Hand Manipulation

1 code implementation • 29 May 2020 • Mike Lambeta, Po-Wei Chou, Stephen Tian, Brian Yang, Benjamin Maloon, Victoria Rose Most, Dave Stroud, Raymond Santos, Ahmad Byagowi, Gregg Kammerer, Dinesh Jayaraman, Roberto Calandra

Despite decades of research, general purpose in-hand manipulation remains one of the unsolved challenges of robotics.

144

Paper
Code

An Exploration of Embodied Visual Exploration

1 code implementation • 7 Jan 2020 • Santhosh K. Ramakrishnan, Dinesh Jayaraman, Kristen Grauman

Embodied computer vision considers perception for robots in novel, unstructured environments.

Benchmarking

Paper
Code

Morphology-Agnostic Visual Robotic Control

no code implementations • 31 Dec 2019 • Brian Yang, Dinesh Jayaraman, Glen Berseth, Alexei Efros, Sergey Levine

Existing approaches for visuomotor robotic control typically require characterizing the robot in advance by calibrating the camera or performing system identification.

Paper
Add Code

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

1 code implementation • ICLR 2021 • Glen Berseth, Daniel Geng, Coline Devin, Nicholas Rhinehart, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

Every living organism struggles against disruptive environmental forces to carve out and maintain an orderly niche.

Navigate reinforcement-learning +2

Paper
Code

Goal-Conditioned Video Prediction

no code implementations • 25 Sep 2019 • Oleh Rybkin, Karl Pertsch, Frederik Ebert, Dinesh Jayaraman, Chelsea Finn, Sergey Levine

Prior work on video generation largely focuses on prediction models that only observe frames from the beginning of the video.

Imitation Learning Video Generation +1

Paper
Add Code

Hope For The Best But Prepare For The Worst: Cautious Adaptation In RL Agents

no code implementations • 25 Sep 2019 • Jesse Zhang, Brian Cheung, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

We study the problem of safe adaptation: given a model trained on a variety of past experiences for some task, can this model learn to perform that task in a new situation while avoiding catastrophic failure?

Domain Adaptation Meta Reinforcement Learning +2

Paper
Add Code

SMiRL: Surprise Minimizing RL in Entropic Environments

no code implementations • 25 Sep 2019 • Glen Berseth, Daniel Geng, Coline Devin, Dinesh Jayaraman, Chelsea Finn, Sergey Levine

All living organisms struggle against the forces of nature to carve out niches where they can maintain relative stasis.

Unsupervised Pre-training Unsupervised Reinforcement Learning

Paper
Add Code

Emergence of Exploratory Look-Around Behaviors through Active Observation Completion

1 code implementation • Science Robotics 2019 • Santhosh K. Ramakrishnan, Dinesh Jayaraman, Kristen Grauman

Standard computer vision systems assume access to intelligently captured inputs (e. g., photos from a human photographer), yet autonomously capturing good observations is a major challenge in itself.

Active Observation Completion

Paper
Code

Causal Confusion in Imitation Learning

2 code implementations • NeurIPS 2019 • Pim de Haan, Dinesh Jayaraman, Sergey Levine

Such discriminative models are non-causal: the training procedure is unaware of the causal structure of the interaction between the expert and the environment.

Imitation Learning

Paper
Code

REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning

no code implementations • 17 May 2019 • Brian Yang, Jesse Zhang, Vitchyr Pong, Sergey Levine, Dinesh Jayaraman

We envision REPLAB as a framework for reproducible research across manipulation tasks, and as a step in this direction, we define a template for a grasping benchmark consisting of a task definition, evaluation protocol, performance measures, and a dataset of 92k grasp attempts.

Benchmarking Machine Translation +1

Paper
Add Code

Manipulation by Feel: Touch-Based Control with Deep Predictive Models

no code implementations • 11 Mar 2019 • Stephen Tian, Frederik Ebert, Dinesh Jayaraman, Mayur Mudigonda, Chelsea Finn, Roberto Calandra, Sergey Levine

Touch sensing is widely acknowledged to be important for dexterous robotic manipulation, but exploiting tactile sensing for continuous, non-prehensile manipulation is challenging.

Paper
Add Code

Time-Agnostic Prediction: Predicting Predictable Video Frames

no code implementations • ICLR 2019 • Dinesh Jayaraman, Frederik Ebert, Alexei A. Efros, Sergey Levine

Prediction is arguably one of the most basic functions of an intelligent system.

Paper
Add Code

More Than a Feeling: Learning to Grasp and Regrasp using Vision and Touch

no code implementations • 28 May 2018 • Roberto Calandra, Andrew Owens, Dinesh Jayaraman, Justin Lin, Wenzhen Yuan, Jitendra Malik, Edward H. Adelson, Sergey Levine

This model -- a deep, multimodal convolutional network -- predicts the outcome of a candidate grasp adjustment, and then executes a grasp by iteratively selecting the most promising actions.

Robotic Grasping

Paper
Add Code

ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

no code implementations • ECCV 2018 • Dinesh Jayaraman, Ruohan Gao, Kristen Grauman

We introduce an unsupervised feature learning approach that embeds 3D shape information into a single-view image representation.

Decoder Object +1

Paper
Add Code

Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks

2 code implementations • CVPR 2018 • Dinesh Jayaraman, Kristen Grauman

It is common to implicitly assume access to intelligently captured inputs (e. g., photos from a human photographer), yet autonomously capturing good observations is itself a major challenge.

Paper
Code

Pano2Vid: Automatic Cinematography for Watching 360$^{\circ}$ Videos

no code implementations • 7 Dec 2016 • Yu-Chuan Su, Dinesh Jayaraman, Kristen Grauman

AutoCam leverages NFOV web video to discriminatively identify space-time "glimpses" of interest at each time instant, and then uses dynamic programming to select optimal human-like camera trajectories.

Paper
Add Code

Object-Centric Representation Learning from Unlabeled Videos

no code implementations • 1 Dec 2016 • Ruohan Gao, Dinesh Jayaraman, Kristen Grauman

Compared to existing temporal coherence methods, our idea has the advantage of lightweight preprocessing of the unlabeled video (no tracking required) while still being able to extract object-level regions from which to learn invariances.

Image Classification Object +2

Paper
Add Code

Look-ahead before you leap: end-to-end active recognition by forecasting the effect of motion

no code implementations • 30 Apr 2016 • Dinesh Jayaraman, Kristen Grauman

To verify this hypothesis, we attempt to induce this capacity in our active recognition pipeline, by simultaneously learning to forecast the effects of the agent's motions on its internal representation of the environment conditional on all past views.

Paper
Add Code

Slow and steady feature analysis: higher order temporal coherence in video

no code implementations • CVPR 2016 • Dinesh Jayaraman, Kristen Grauman

While this standard approach captures the fact that high-level visual signals change slowly over time, it fails to capture *how* the visual content changes.

Action Recognition Temporal Action Localization

Paper
Add Code

Learning image representations tied to ego-motion

1 code implementation • ICCV 2015 • Dinesh Jayaraman, Kristen Grauman

Understanding how images of objects and scenes behave in response to specific ego-motions is a crucial aspect of proper visual development, yet existing visual learning methods are conspicuously disconnected from the physical source of their images.

Autonomous Driving Scene Recognition

Paper
Code

Zero-shot recognition with unreliable attributes

no code implementations • NeurIPS 2014 • Dinesh Jayaraman, Kristen Grauman

In principle, zero-shot learning makes it possible to train an object recognition model simply by specifying the category's attributes.

Attribute Object Recognition +1

Paper
Add Code

Zero Shot Recognition with Unreliable Attributes

no code implementations • 15 Sep 2014 • Dinesh Jayaraman, Kristen Grauman

In principle, zero-shot learning makes it possible to train a recognition model simply by specifying the category's attributes.

Attribute Zero-Shot Learning

Paper
Add Code

Decorrelating Semantic Visual Attributes by Resisting the Urge to Share

no code implementations • CVPR 2014 • Dinesh Jayaraman, Fei Sha, Kristen Grauman

Existing methods to learn visual attributes are prone to learning the wrong thing---namely, properties that are correlated with the attribute of interest among training samples.

Attribute Multi-Task Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.