no code implementations • CVPR 2024 • Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai
In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.
1 code implementation • 6 May 2024 • Zheng Zhu, XiaoFeng Wang, Wangbo Zhao, Chen Min, Nianchen Deng, Min Dou, Yuqi Wang, Botian Shi, Kai Wang, Chi Zhang, Yang You, Zhaoxiang Zhang, Dawei Zhao, Liang Xiao, Jian Zhao, Jiwen Lu, Guan Huang
General world models represent a crucial pathway toward achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual environments to decision-making systems.
no code implementations • 14 Aug 2023 • Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai
In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots.
2 code implementations • 30 May 2023 • Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
When compared to monocular pre-training methods on the nuScenes dataset, UniScene shows a significant improvement of about 2. 0% in mAP and 2. 0% in NDS for multi-camera 3D object detection, as well as a 3% increase in mIoU for surrounding semantic scene completion.
no code implementations • 13 Nov 2022 • Liang Xiao, Jiaolong Xu, Dawei Zhao, Erke Shang, Qi Zhu, Bin Dai
In this work, we show that by simply applying consistency training with random data augmentation, state-of-the-art results on domain adaptation (DA) and generalization (DG) can be obtained.
2 code implementations • 20 Jun 2022 • Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai
This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE).
2 code implementations • 20 Jun 2022 • Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai
Freespace detection is an essential component of autonomous driving technology and plays an important role in trajectory planning.
Ranked #10 on Semantic Segmentation on SYN-UDTIRI
1 code implementation • 9 May 2021 • Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
The experimental results show that the proposed method outperforms state-of-the-art multimodal methods and is robust to the perturbations of the topometric map.
1 code implementation • 6 Apr 2021 • Chen Min, Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
Deep learning has recently demonstrated its promising performance for vision-based parking-slot detection.
no code implementations • 6 May 2020 • Li Wang, Dawei Zhao, Tao Wu, Hao Fu, Zhiyu Wang, Liang Xiao, Xin Xu, Bin Dai
3D moving object detection is one of the most critical tasks in dynamic scene analysis.