no code implementations • 17 Jun 2024 • Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus
We propose DistillNeRF, a self-supervised learning framework addressing the challenge of understanding 3D environments from limited 2D observations in autonomous driving.
1 code implementation • CVPR 2024 • Yang Zhou, Hao Shao, Letian Wang, Steven L. Waslander, Hongsheng Li, Yu Liu
Context information, such as road maps and surrounding agents' states, provides crucial geometric and semantic information for motion behavior prediction.
1 code implementation • 19 Feb 2024 • Chang Won Lee, Steven L. Waslander
Multi-object tracking (MOT) methods have seen a significant boost in performance recently, due to strong interest from the research community and steadily improving object detection methods.
no code implementations • 9 Feb 2024 • Evan D. Cook, Marc-Antoine Lavoie, Steven L. Waslander
This is a post-hoc method which can be applied to any pretrained model, and involves training a lightweight auxiliary normalizing flow model to perform the out-of-distribution detection via density thresholding.
1 code implementation • CVPR 2024 • Hao Shao, Yuxuan Hu, Letian Wang, Steven L. Waslander, Yu Liu, Hongsheng Li
On the other hand, previous autonomous driving methods tend to rely on limited-format inputs (e. g. sensor data and navigation waypoints), restricting the vehicle's ability to understand language information and interact with humans.
1 code implementation • CVPR 2024 • Ziwei Liao, Jialiang Zhu, Chunyu Wang, Han Hu, Steven L. Waslander
In this work, we aim to improve the 3D reasoning ability of Transformers in multi-view 3D human pose estimation.
3D Human Pose Estimation Multi-view 3D Human Pose Estimation
1 code implementation • 17 Sep 2023 • Ziwei Liao, Jun Yang, Jingxing Qian, Angela P. Schoellig, Steven L. Waslander
Unlike current state-of-the-art approaches, we explicitly model the uncertainty of the object shapes and poses during our optimization, resulting in a high-quality object-level mapping system.
no code implementations • 17 Jun 2023 • Ziwei Liao, Steven L. Waslander
We propose a method to model uncertainty as part of the representation and define an uncertainty-aware encoder which generates latent codes with uncertainty directly from individual input images.
no code implementations • CVPR 2023 • Hao Shao, Letian Wang, RuoBing Chen, Steven L. Waslander, Hongsheng Li, Yu Liu
The large-scale deployment of autonomous vehicles is yet to come, and one of the major remaining challenges lies in urban dense traffic scenarios.
Ranked #1 on Autonomous Driving on CARLA Leaderboard
1 code implementation • 8 May 2023 • Letian Wang, Jie Liu, Hao Shao, Wenshuo Wang, RuoBing Chen, Yu Liu, Steven L. Waslander
Inspired by this, we propose ASAP-RL, an efficient reinforcement learning algorithm for autonomous driving that simultaneously leverages motion skills and expert priors.
1 code implementation • 27 Apr 2023 • Jenny Xu, Steven L. Waslander
Current LiDAR-based 3D object detectors for autonomous driving are almost entirely trained on human-annotated data collected in specific geographical domains with specific sensor setups, making it difficult to adapt to a different domain.
1 code implementation • 27 Apr 2023 • Barza Nisar, Hruday Vishal Kanna Anand, Steven L. Waslander
Accurate 3D object detection in all weather conditions remains a key challenge to enable the widespread deployment of autonomous vehicles, as most work to date has been performed on clear weather data.
1 code implementation • CVPR 2023 • Anas Mahmoud, Jordan S. K. Hu, Tianshu Kuai, Ali Harakeh, Liam Paull, Steven L. Waslander
However, image-to point representation learning for autonomous driving datasets faces two main challenges: 1) the abundance of self-similarity, which results in the contrastive losses pushing away semantically similar point and image regions and thus disturbing the local semantic structure of the learned representations, and 2) severe class imbalance as pretraining gets dominated by over-represented classes.
no code implementations • 24 Nov 2022 • Ali Harakeh, Jordan Hu, Naiqing Guan, Steven L. Waslander, Liam Paull
A common approach to model uncertainty is to choose a parametric distribution and fit the data to it using maximum likelihood estimation.
no code implementations • 17 Aug 2022 • John Willes, Cody Reading, Steven L. Waslander
We then perform a learned regression on each track/detection feature pair to estimate affinities, and use a robust two-stage data association and track management approach to produce the final tracks.
1 code implementation • CVPR 2022 • Jordan S. K. Hu, Tianshu Kuai, Steven L. Waslander
LiDAR has become one of the primary 3D object detection sensors in autonomous driving.
no code implementations • 2 Mar 2022 • Anas Mahmoud, Jordan S. K. Hu, Steven L. Waslander
Sequential fusion methods suffer from a limited number of pixel and point correspondences due to point cloud sparsity, or their performance is strictly capped by the detections of one of the modalities.
no code implementations • 27 Feb 2022 • Jun Yang, Steven L. Waslander
Depth acquisition with the active stereo camera is a challenging task for highly reflective objects.
no code implementations • 30 Nov 2021 • Jordan S. K. Hu, Steven L. Waslander
Autonomous driving datasets are often skewed and in particular, lack training data for objects at farther distances from the ego vehicle.
no code implementations • 8 Oct 2021 • Samuel Looper, Steven L. Waslander
Model-based control methods for robotic systems such as quadrotors, autonomous driving vehicles and flexible manipulators require motion models that generate accurate predictions of complex nonlinear system dynamics over long periods of time.
2 code implementations • CVPR 2021 • Cody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander
We validate our approach on the KITTI 3D object detection benchmark, where we rank 1st among published monocular methods.
no code implementations • 8 Feb 2021 • Justin Tomasi, Brandon Wagstaff, Steven L. Waslander, Jonathan Kelly
Successful visual navigation depends upon capturing images that contain sufficient useful information.
3 code implementations • 13 Jan 2021 • Ali Harakeh, Steven L. Waslander
We show that in the context of object detection, training variance networks with negative log likelihood (NLL) can lead to high entropy predictive distributions regardless of the correctness of the output mean.
no code implementations • ICLR 2021 • Ali Harakeh, Steven L. Waslander
We show that in the context of object detection, training variance networks with negative log likelihood (NLL) can lead to high entropy predictive distributions regardless of the correctness of the output mean.
no code implementations • 11 Mar 2020 • Chengyao Li, Jason Ku, Steven L. Waslander
To tackle these two issues, we propose CG-Stereo, a confidence-guided stereo 3D object detection pipeline that uses separate decoders for foreground and background pixels during depth estimation, and leverages the confidence estimation from the depth estimation network as a soft attention mechanism in the 3D object detector.
3D Object Detection From Stereo Images Autonomous Driving +3
no code implementations • 18 Sep 2019 • Chengyao Li, Steven L. Waslander
In this paper, we propose a novel approach inspired by RAIM to monitor the integrity of optimization-based visual localization and calculate the protection level of a state estimate, i. e. the largest possible translational error in each direction.
no code implementations • 17 Sep 2019 • Alex D. Pon, Jason Ku, Chengyao Li, Steven L. Waslander
The issue with existing stereo matching networks is that they are designed for disparity estimation, not 3D object detection; the shape and accuracy of object point clouds are not the focus.
3D Object Detection From Stereo Images Autonomous Driving +5
no code implementations • 15 Jul 2019 • Jason Ku, Alex D. Pon, Sean Walsh, Steven L. Waslander
Accurately estimating the orientation of pedestrians is an important and challenging task for autonomous driving because this information is essential for tracking and predicting pedestrian behavior.
1 code implementation • CVPR 2019 • Jason Ku, Alex D. Pon, Steven L. Waslander
We present MonoPSR, a monocular 3D object detection method that leverages proposals and shape reconstruction.
Ranked #13 on Vehicle Pose Estimation on KITTI Cars Hard
2 code implementations • 9 Mar 2019 • Ali Harakeh, Michael Smart, Steven L. Waslander
When incorporating deep neural networks into robotic systems, a major challenge is the lack of uncertainty measures associated with their output predictions.
1 code implementation • 29 Nov 2018 • Pranav Ganti, Steven L. Waslander
In order to facilitate long-term localization using a visual simultaneous localization and mapping (SLAM) algorithm, careful feature selection can help ensure that reference points persist over long durations and the runtime and storage complexity of the algorithm remain consistent.
no code implementations • 25 Jul 2018 • Qi Chen, Lei Wang, Yifan Wu, Guangming Wu, Zhiling Guo, Steven L. Waslander
In this paper, we present a new large-scale benchmark dataset termed Aerial Imagery for Roof Segmentation (AIRS).
no code implementations • 24 Jul 2018 • Christopher L. Choi, Jason Rebello, Leonid Koppel, Pranav Ganti, Arun Das, Steven L. Waslander
In this paper, we present an encoderless approach for DCC calibration which simultaneously estimates the kinematic parameters of the transformation chain as well as the unknown joint angles.
no code implementations • 16 Jul 2018 • Jungwook Lee, Sean Walsh, Ali Harakeh, Steven L. Waslander
Training 3D object detectors for autonomous driving has been limited to small datasets due to the effort required to generate annotations.
2 code implementations • 20 Jun 2018 • Alex D. Pon, Oles Andrienko, Ali Harakeh, Steven L. Waslander
The root cause of this issue is that no public dataset contains both traffic light and sign labels, which leads to difficulties in developing a joint detection framework.
no code implementations • 20 May 2018 • Nima Mohajerin, Steven L. Waslander
In this work, the state initialization problem is addressed using Neural Networks (NNs) to effectively train a variety of RNNs for modeling two aerial vehicles, a helicopter and a quadrotor, from experimental data.
2 code implementations • 31 Jan 2018 • Jason Ku, Ali Harakeh, Steven L. Waslander
With the rise of data driven deep neural networks as a realization of universal function approximators, most research on computer vision problems has moved away from hand crafted classical image processing algorithms.
no code implementations • 23 Sep 2015 • Siddhant Ahuja, Peter Iles, Steven L. Waslander
Topographic mapping in planetary environments relies on accurate 3D scan registration methods.
no code implementations • 25 Jun 2015 • Michael J. Tribou, David W. L. Wang, Steven L. Waslander
An analysis of the relative motion and point feature model configurations leading to solution degeneracy is presented, for the case of a Simultaneous Localization and Mapping system using multicamera clusters with non-overlapping fields-of-view.