no code implementations • EMNLP (NLP4ConvAI) 2021 • Eunah Cho, Ziyan Jiang, Jie Hao, Zheng Chen, Saurabh Gupta, Xing Fan, Chenlei Guo
Query rewrite (QR) is an emerging component in conversational AI systems, reducing user defect.
no code implementations • EMNLP 2021 • Zhuoyi Wang, Saurabh Gupta, Jie Hao, Xing Fan, Dingcheng Li, Alexander Hanbo Li, Chenlei Guo
Rephrase detection is used to identify the rephrases and has long been treated as a task with pairwise input, which does not fully utilize the contextual information (e. g. users’ implicit feedback).
no code implementations • 9 May 2024 • Satyadwyoom Kumar, Saurabh Gupta, Arun Balaji Buduru
Most of these adversarial attack strategies assume that the adversary has access to the training data, the model parameters, and the input during deployment, hence, focus on perturbing the pixel level information present in the input image.
2 code implementations • 25 Mar 2024 • Zicong Fan, Takehiko Ohkawa, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Zheng Liu, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao
A holistic 3Dunderstanding of such interactions from egocentric views is important for tasks in robotics, AR/VR, action recognition and motion generation.
no code implementations • 27 Feb 2024 • Arjun Gupta, Michelle Zhang, Rishik Sathua, Saurabh Gupta
In this work, we build an end-to-end system that enables a commodity mobile manipulator (Stretch RE2) to pull open cabinets and drawers in diverse previously unseen real world environments.
no code implementations • 27 Feb 2024 • XiaoYu Zhang, Matthew Chang, Pranav Kumar, Saurabh Gupta
The Dataset Aggregation, or DAgger approach to this problem simply collects more data to cover these failure states.
no code implementations • 11 Dec 2023 • Aditya Prakash, Arjun Gupta, Saurabh Gupta
Objects undergo varying amounts of perspective distortion as they move across a camera's field of view.
no code implementations • 11 Dec 2023 • Aditya Prakash, Ruisen Tu, Matthew Chang, Saurabh Gupta
We present WildHands, a method for 3D hand pose estimation in egocentric images in the wild.
1 code implementation • CVPR 2024 • Yiduo Hao, Sohrab Madani, Junfeng Guan, Mohammed Alloulah, Saurabh Gupta, Haitham Hassanieh
The perception of autonomous vehicles using radars has attracted increased research interest due its ability to operate in fog and bad weather.
no code implementations • ICCV 2023 • Shaowei Liu, Yang Zhou, Jimei Yang, Saurabh Gupta, Shenlong Wang
This paper presents a novel object-centric contact representation ContactGen for hand-object interaction.
no code implementations • 6 Jul 2023 • XiaoYu Zhang, Saurabh Gupta
We use self-supervision to train SRPNet, a neural network that predicts what space is revealed on execution of a candidate action on a given plant.
no code implementations • CVPR 2023 • Shaowei Liu, Saurabh Gupta, Shenlong Wang
We build rearticulable models for arbitrary everyday man-made objects containing an arbitrary number of parts that are connected together in arbitrary ways via 1 degree-of-freedom joints.
no code implementations • 4 May 2023 • Aditya Prakash, Matthew Chang, Matthew Jin, Saurabh Gupta
Prior works for reconstructing hand-held objects from a single image rely on direct 3D shape supervision which is challenging to gather in real world at scale.
no code implementations • 2 Mar 2023 • Arjun Gupta, Max E. Shepherd, Saurabh Gupta
Our key insight is to cast it as a learning problem to leverage past experience of solving similar planning problems to directly predict motion plans for mobile manipulation tasks in novel situations at test time.
no code implementations • 9 Feb 2023 • Matthew Chang, Saurabh Gupta
In this paper, we analyze the behavior of existing techniques and design new solutions for the problem of one-shot visual imitation.
1 code implementation • 21 Jul 2022 • Gabriel Sarch, Zhaoyuan Fang, Adam W. Harley, Paul Schydlo, Michael J. Tarr, Saurabh Gupta, Katerina Fragkiadaki
We introduce TIDEE, an embodied agent that tidies up a disordered scene based on learned commonsense object placement and room arrangement priors.
no code implementations • ICLR 2022 • Matthew Chang, Arjun Gupta, Saurabh Gupta
We show that LAQ can recover value functions that have high correlation with value functions learned using ground truth actions.
no code implementations • 7 Apr 2022 • Narayana Darapaneni, Jai Arora, MoniShankar Hazra, Naman Vig, Simrandeep Singh Gandhi, Saurabh Gupta, Anwesh Reddy Paduri
With over 50 million car sales annually and over 1. 3 million deaths every year due to motor accidents we have chosen this space.
1 code implementation • CVPR 2022 • Mohit Goyal, Sahil Modi, Rishabh Goyal, Saurabh Gupta
Analyzing the hands shows what we can do to objects and how.
no code implementations • NeurIPS 2021 • Devendra Singh Chaplot, Murtaza Dalal, Saurabh Gupta, Jitendra Malik, Ruslan Salakhutdinov
The observations gathered by this exploration policy are labelled using 3D consistency and used to improve the perception model.
no code implementations • 6 Jul 2021 • Arun Narenthiran Sivakumar, Sahil Modi, Mateus Valverde Gasparino, Che Ellis, Andres Eduardo Baquero Velasquez, Girish Chowdhary, Saurabh Gupta
We describe a system for visually guided autonomous navigation of under-canopy farm robots.
no code implementations • EMNLP (NLP+CSS) 2020 • Saurabh Gupta, Huy H. Nguyen, Junichi Yamagishi, Isao Echizen
Recent advancements in natural language generation has raised serious concerns.
no code implementations • 29 Sep 2020 • Saurabh Gupta, Arun Balaji Buduru, Ponnurangam Kumaraguru
With experiments on MNIST dataset, we show that imdpGAN preserves the privacy of the individual data point, and learns latent codes to control the specificity of the generated samples.
no code implementations • 29 Sep 2020 • Saurabh Gupta, Siddhant Bhambri, Karan Dhingra, Arun Balaji Buduru, Ponnurangam Kumaraguru
We experiment on real-world smart home data, and show that the multi-objective approaches: i) establish trade-off between the two objectives, ii) achieve better combined user satisfaction and power consumption than single-objective approaches.
no code implementations • ECCV 2020 • Senthil Purushwalkam, Tian Ye, Saurabh Gupta, Abhinav Gupta
During training, given a pair of videos, we compute cycles that connect patches in a given frame in the first video by matching through frames in the second video.
1 code implementation • NeurIPS 2020 • Matthew Chang, Arjun Gupta, Saurabh Gupta
Semantic cues and statistical regularities in real-world environment layouts can improve efficiency for navigation in novel environments.
no code implementations • ECCV 2020 • Devendra Singh Chaplot, Helen Jiang, Saurabh Gupta, Abhinav Gupta
Instead, we explore a self-supervised approach for training our exploration policy by introducing a notion of semantic curiosity.
no code implementations • CVPR 2020 • Devendra Singh Chaplot, Ruslan Salakhutdinov, Abhinav Gupta, Saurabh Gupta
This paper studies the problem of image-goal navigation which involves navigating to the location indicated by a goal image in a novel previously unseen environment.
2 code implementations • ICLR 2020 • Devendra Singh Chaplot, Dhiraj Gandhi, Saurabh Gupta, Abhinav Gupta, Ruslan Salakhutdinov
The use of learning provides flexibility with respect to input modalities (in the SLAM module), leverages structural regularities of the world (in global policies), and provides robustness to errors in state estimation (in local policies).
3 code implementations • CVPR 2020 • Kiana Ehsani, Shubham Tulsiani, Saurabh Gupta, Ali Farhadi, Abhinav Gupta
Our quantitative and qualitative results show that (a) we can predict meaningful forces from videos whose effects lead to accurate imitation of the motions observed, (b) by jointly optimizing for contact point and force prediction, we can improve the performance on both tasks in comparison to independent training, and (c) we can learn a representation from this model that generalizes to novel objects using few shot examples.
no code implementations • ICLR 2020 • Rohan Chitnis, Shubham Tulsiani, Saurabh Gupta, Abhinav Gupta
Our key idea is that a good guiding principle for intrinsic motivation in synergistic tasks is to take actions which affect the world in ways that would not be achieved if the agents were acting on their own.
1 code implementation • ICLR 2020 • William Qi, Ravi Teja Mullapudi, Saurabh Gupta, Deva Ramanan
In this paper, we combine the best of both worlds with a modular approach that learns a spatial representation of a scene that is trained to be effective when coupled with traditional geometric planners.
no code implementations • 6 Dec 2019 • Mudit Verma, Siddhant Bhambri, Saurabh Gupta, Arun Balaji Buduru
Rapid advancements in the Internet of Things (IoT) have facilitated more efficient deployment of smart environment solutions for specific user requirement.
no code implementations • 5 Nov 2019 • Bharathan Balaji, Sunil Mallya, Sahika Genc, Saurabh Gupta, Leo Dirac, Vineet Khare, Gourav Roy, Tao Sun, Yunzhe Tao, Brian Townsend, Eddie Calleja, Sunil Muralidhara, Dhanasekar Karuppasamy
DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in developing intelligent control systems.
no code implementations • 25 Oct 2019 • Yunzhe Tao, Saurabh Gupta, Satyapriya Krishna, Xiong Zhou, Orchid Majumder, Vineet Khare
Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers.
no code implementations • 30 Sep 2019 • Rohan Chitnis, Shubham Tulsiani, Saurabh Gupta, Abhinav Gupta
Our insight is that for many tasks, the learning process can be decomposed into learning a state-independent task schema (a sequence of skills to execute) and a policy to choose the parameterizations of the skills in a state-dependent manner.
no code implementations • 16 Sep 2019 • Saurabh Gupta, Asmit Kumar Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru
In the political context, hashtags on Twitter are used by users to campaign for their parties, spread news, or to get followers and get a general idea by following a discussion built around a hashtag.
2 code implementations • 19 Jun 2019 • Adithyavairavan Murali, Tao Chen, Kalyan Vasudev Alwala, Dhiraj Gandhi, Lerrel Pinto, Saurabh Gupta, Abhinav Gupta
This paper introduces PyRobot, an open-source robotics framework for research and benchmarking.
no code implementations • 29 May 2019 • Ashish Kumar, Saurabh Gupta, Jitendra Malik
We demonstrate our proposed approach in context of navigation, and show that we can successfully learn consistent and diverse visuomotor subroutines from passive egocentric videos.
no code implementations • 6 Mar 2019 • Somil Bansal, Varun Tolani, Saurabh Gupta, Jitendra Malik, Claire Tomlin
Model-based control is a popular paradigm for robot navigation because it can leverage a known dynamics model to efficiently plan robust robot trajectories.
2 code implementations • ICLR 2019 • Tao Chen, Saurabh Gupta, Abhinav Gupta
Numerous past works have tackled the problem of task-driven navigation.
no code implementations • NeurIPS 2018 • Ashish Kumar, Saurabh Gupta, David Fouhey, Sergey Levine, Jitendra Malik
Equipped with this abstraction, a second network observes the world and decides how to act to retrace the path under noisy actuation and a changing environment.
4 code implementations • 16 Sep 2018 • Michael Danielczuk, Matthew Matl, Saurabh Gupta, Andrew Li, Andrew Lee, Jeffrey Mahler, Ken Goldberg
We train a variant of Mask R-CNN with domain randomization on the generated dataset to perform category-agnostic instance segmentation without any hand-labeled data and we evaluate the trained network, which we refer to as Synthetic Depth (SD) Mask R-CNN, on a set of real, high-resolution depth images of challenging, densely-cluttered bins containing objects with highly-varied geometry.
Ranked #1 on Unseen Object Instance Segmentation on WISDOM
10 code implementations • 18 Jul 2018 • Peter Anderson, Angel Chang, Devendra Singh Chaplot, Alexey Dosovitskiy, Saurabh Gupta, Vladlen Koltun, Jana Kosecka, Jitendra Malik, Roozbeh Mottaghi, Manolis Savva, Amir R. Zamir
Skillful mobile operation in three-dimensional environments is a primary topic of study in Artificial Intelligence.
no code implementations • 21 Dec 2017 • Saurabh Gupta, David Fouhey, Sergey Levine, Jitendra Malik
This works presents a formulation for visual navigation that unifies map based spatial reasoning and path planning, with landmark based robust plan execution in noisy environments.
no code implementations • CVPR 2018 • Shubham Tulsiani, Saurabh Gupta, David Fouhey, Alexei A. Efros, Jitendra Malik
The goal of this paper is to take a single 2D image of a scene and recover the 3D structure in terms of a small set of factors: a layout representing the enclosing surfaces as well as a set of objects represented in terms of shape and pose.
6 code implementations • CVPR 2017 • Saurabh Gupta, Varun Tolani, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik
The accumulated belief of the world enables the agent to track visited regions of the environment.
no code implementations • CVPR 2016 • Judy Hoffman, Saurabh Gupta, Trevor Darrell
Thus, our method transfers information commonly extracted from depth training data to a network which can extract that information from the RGB counterpart.
no code implementations • 25 Nov 2015 • Saurabh Gupta, Bharath Hariharan, Jitendra Malik
In this paper we explore two ways of using context for object detection.
1 code implementation • CVPR 2016 • Saurabh Gupta, Judy Hoffman, Jitendra Malik
In this work we propose a technique that transfers supervision between images from different modalities.
no code implementations • CVPR 2015 • Saurabh Gupta, Pablo Arbelaez, Ross Girshick, Jitendra Malik
The goal of this work is to represent objects in an RGB-D scene with corresponding 3D models from a library.
1 code implementation • 17 May 2015 • Jacob Devlin, Saurabh Gupta, Ross Girshick, Margaret Mitchell, C. Lawrence Zitnick
We explore a variety of nearest neighbor baseline approaches for image captioning.
1 code implementation • 17 May 2015 • Saurabh Gupta, Jitendra Malik
In this paper we introduce the problem of Visual Semantic Role Labeling: given an image we want to detect people doing actions and localize the objects of interaction.
no code implementations • IJCNLP 2015 • Jacob Devlin, Hao Cheng, Hao Fang, Saurabh Gupta, Li Deng, Xiaodong He, Geoffrey Zweig, Margaret Mitchell
Two recent approaches have achieved state-of-the-art results in image captioning.
18 code implementations • 1 Apr 2015 • Xinlei Chen, Hao Fang, Tsung-Yi Lin, Ramakrishna Vedantam, Saurabh Gupta, Piotr Dollar, C. Lawrence Zitnick
In this paper we describe the Microsoft COCO Caption dataset and evaluation server.
no code implementations • 16 Feb 2015 • Saurabh Gupta, Pablo Arbeláez, Ross Girshick, Jitendra Malik
The goal of this work is to replace objects in an RGB-D scene with corresponding 3D models from a library.
1 code implementation • CVPR 2015 • Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollár, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John C. Platt, C. Lawrence Zitnick, Geoffrey Zweig
The language model learns from a set of over 400, 000 image descriptions to capture the statistics of word usage.
Ranked #1 on Image Captioning on COCO Captions test
1 code implementation • 22 Jul 2014 • Saurabh Gupta, Ross Girshick, Pablo Arbeláez, Jitendra Malik
In this paper we study the problem of object detection for RGB-D images using semantically rich image and depth features.
Ranked #6 on Object Detection In Indoor Scenes on SUN RGB-D
no code implementations • CVPR 2013 • Saurabh Gupta, Pablo Arbelaez, Jitendra Malik
We address the problems of contour detection, bottomup grouping and semantic segmentation using RGB-D data.