no code implementations • 15 Apr 2022 • Vitor Guizilini, Rares Ambrus, Dian Chen, Sergey Zakharov, Adrien Gaidon
Experiments on the KITTI and DDAD datasets show that our DepthFormer architecture establishes a new state of the art in self-supervised monocular depth estimation, and is even competitive with highly specialized supervised single-frame architectures.
no code implementations • 28 Mar 2022 • Vitor Guizilini, Kuan-Hui Lee, Rares Ambrus, Adrien Gaidon
However, the simultaneous self-supervised learning of depth and scene flow is ill-posed, as there are infinitely many combinations that result in the same 3D point.
no code implementations • 6 Dec 2021 • Jiading Fang, Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Greg Shakhnarovich, Adrien Gaidon, Matthew R. Walter
Camera calibration is integral to robotics and computer vision algorithms that seek to infer geometric properties of the scene from visual input streams.
no code implementations • ICCV 2021 • Aditya Ganeshan, Alexis Vallet, Yasunori Kudo, Shin-ichi Maeda, Tommi Kerola, Rares Ambrus, Dennis Park, Adrien Gaidon
Deep learning models for semantic segmentation rely on expensive, large-scale, manually annotated datasets.
Ranked #13 on
Semantic Segmentation
on NYU Depth v2
1 code implementation • ICCV 2021 • Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, Adrien Gaidon
Recent progress in 3D object detection from single images leverages monocular depth estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors.
Ranked #1 on
Monocular 3D Object Detection
on KITTI Pedestrian Hard
(using extra training data)
no code implementations • 31 Mar 2021 • Vitor Guizilini, Igor Vasiljevic, Rares Ambrus, Greg Shakhnarovich, Adrien Gaidon
In this work, we extend monocular self-supervised depth and ego-motion estimation to large-baseline multi-camera rigs.
1 code implementation • CVPR 2021 • Vitor Guizilini, Rares Ambrus, Wolfram Burgard, Adrien Gaidon
Estimating scene geometry from data obtained with cost-effective sensors is key for robots and self-driving cars.
no code implementations • ICCV 2021 • Vitor Guizilini, Jie Li, Rares Ambrus, Adrien Gaidon
Simulators can efficiently generate large amounts of labeled synthetic data with perfect supervision for hard-to-label tasks like semantic segmentation.
no code implementations • 5 Jan 2021 • Rares Ambrus, Vitor Guizilini, Naveen Kuppuswamy, Andrew Beaulieu, Adrien Gaidon, Alex Alspach
Fluid-filled soft visuotactile sensors such as the Soft-bubbles alleviate key challenges for robust manipulation, as they enable reliable grasps along with the ability to obtain high-resolution sensory feedback on contact geometry and forces.
no code implementations • 26 Dec 2020 • Hsu-kuang Chiu, Jie Li, Rares Ambrus, Jeannette Bohg
Second, we propose to learn a metric that combines the Mahalanobis and feature distances when comparing a track and a new detection in data association.
no code implementations • 29 Aug 2020 • Andreas Bühler, Adrien Gaidon, Andrei Cramariuc, Rares Ambrus, Guy Rosman, Wolfram Burgard
In this work, we propose a behavioral cloning approach that can safely leverage imperfect perception without being conservative.
1 code implementation • 15 Aug 2020 • Igor Vasiljevic, Vitor Guizilini, Rares Ambrus, Sudeep Pillai, Wolfram Burgard, Greg Shakhnarovich, Adrien Gaidon
Self-supervised learning has emerged as a powerful tool for depth and ego-motion estimation, leading to state-of-the-art results on benchmark datasets.
1 code implementation • ICLR 2020 • Vitor Guizilini, Rui Hou, Jie Li, Rares Ambrus, Adrien Gaidon
Instead of using semantic labels and proxy losses in a multi-task approach, we propose a new architecture leveraging fixed pretrained semantic segmentation networks to guide self-supervised representation learning via pixel-adaptive convolutions.
2 code implementations • ICLR 2020 • Jiexiong Tang, Hanme Kim, Vitor Guizilini, Sudeep Pillai, Rares Ambrus
By making the sampling of inlier-outlier sets from point-pair correspondences fully differentiable within the keypoint learning framework, we show that are able to simultaneously self-supervise keypoint description and improve keypoint matching.
1 code implementation • 7 Dec 2019 • Jiexiong Tang, Rares Ambrus, Vitor Guizilini, Sudeep Pillai, Hanme Kim, Patric Jensfelt, Adrien Gaidon
Detecting and matching robust viewpoint-invariant keypoints is critical for visual SLAM and Structure-from-Motion.
no code implementations • 4 Oct 2019 • Vitor Guizilini, Jie Li, Rares Ambrus, Sudeep Pillai, Adrien Gaidon
Dense depth estimation from a single image is a key problem in computer vision, with exciting applications in a multitude of robotic tasks.
no code implementations • 4 Oct 2019 • Rares Ambrus, Vitor Guizilini, Jie Li, Sudeep Pillai, Adrien Gaidon
Learning depth and camera ego-motion from raw unlabeled RGB video streams is seeing exciting progress through self-supervision from strong geometric cues.
3 code implementations • CVPR 2020 • Vitor Guizilini, Rares Ambrus, Sudeep Pillai, Allan Raventos, Adrien Gaidon
Although cameras are ubiquitous, robotic platforms typically rely on active sensors like LiDAR for direct 3D perception.
no code implementations • 3 Oct 2018 • Sudeep Pillai, Rares Ambrus, Adrien Gaidon
Both contributions provide significant performance gains over the state-of-the-art in self-supervised depth and pose estimation on the public KITTI benchmark.
no code implementations • 18 Oct 2017 • Johan Ekekrantz, Nils Bore, Rares Ambrus, John Folkesson, Patric Jensfelt
In this paper we introduce a system for unsupervised object discovery and segmentation of RGBD-images.