no code implementations • CVPR 2024 • Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai
In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.
no code implementations • 14 Aug 2023 • Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai
In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots.
2 code implementations • 30 May 2023 • Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
When compared to monocular pre-training methods on the nuScenes dataset, UniScene shows a significant improvement of about 2. 0% in mAP and 2. 0% in NDS for multi-camera 3D object detection, as well as a 3% increase in mIoU for surrounding semantic scene completion.
2 code implementations • 20 Jun 2022 • Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai
Freespace detection is an essential component of autonomous driving technology and plays an important role in trajectory planning.
Ranked #10 on Semantic Segmentation on SYN-UDTIRI
2 code implementations • 20 Jun 2022 • Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai
This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE).
1 code implementation • 9 May 2021 • Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
The experimental results show that the proposed method outperforms state-of-the-art multimodal methods and is robust to the perturbations of the topometric map.
1 code implementation • 6 Apr 2021 • Chen Min, Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
Deep learning has recently demonstrated its promising performance for vision-based parking-slot detection.