no code implementations • CVPR 2024 • Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai
In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.
1 code implementation • 8 Apr 2024 • Chenxu Wang, Bin Dai, Huaping Liu, Baoyuan Wang
To gauge the significance of agent architecture, we implement a target-driven planning (TDP) module as an adjunct to the existing agent.
no code implementations • 29 Nov 2023 • Duomin Wang, Bin Dai, Yu Deng, Baoyuan Wang
In this study, our goal is to create interactive avatar agents that can autonomously plan and animate nuanced facial movements realistically, from both visual and behavioral perspectives.
no code implementations • 15 Sep 2023 • Chenghao Shi, Xieyuanli Chen, Junhao Xiao, Bin Dai, Huimin Lu
In the end, we integrate our LCR-Net into a SLAM system and achieve robust and accurate online LiDAR SLAM in outdoor driving environments.
1 code implementation • 14 Aug 2023 • Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai
In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots.
no code implementations • 11 Aug 2023 • Weiyuan Li, Bin Dai, Ziyi Zhou, Qi Yao, Baoyuan Wang
A high-level prior model can be easily injected on top to generate unlimited long and diverse sequences.
no code implementations • 11 Jun 2023 • Wensong Bai, Chao Zhang, Yichao Fu, Lingwei Peng, Hui Qian, Bin Dai
In this paper, we propose the first fully push-forward-based Distributional Reinforcement Learning algorithm, called Push-forward-based Actor-Critic EncourageR (PACER).
2 code implementations • 30 May 2023 • Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
When compared to monocular pre-training methods on the nuScenes dataset, UniScene shows a significant improvement of about 2. 0% in mAP and 2. 0% in NDS for multi-camera 3D object detection, as well as a 3% increase in mIoU for surrounding semantic scene completion.
no code implementations • 31 Mar 2023 • Chenghao Shi, Xieyuanli Chen, Huimin Lu, Wenbang Deng, Junhao Xiao, Bin Dai
The proposed 3D-RoFormer fuses 3D position information into the transformer network, efficiently exploiting point clouds' contextual and geometric information to generate robust superpoint correspondences.
no code implementations • 13 Nov 2022 • Liang Xiao, Jiaolong Xu, Dawei Zhao, Erke Shang, Qi Zhu, Bin Dai
In this work, we show that by simply applying consistency training with random data augmentation, state-of-the-art results on domain adaptation (DA) and generalization (DG) can be obtained.
2 code implementations • 20 Jun 2022 • Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai
This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE).
2 code implementations • 20 Jun 2022 • Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai
Freespace detection is an essential component of autonomous driving technology and plays an important role in trajectory planning.
Ranked #10 on Semantic Segmentation on SYN-UDTIRI
no code implementations • NeurIPS 2021 • Bin Dai, Li Wenliang, David Wipf
A number of recent studies of continuous variational autoencoder (VAE) models have noted, either directly or indirectly, the tendency of various parameter gradients to drift towards infinity during training.
no code implementations • 26 Oct 2021 • Jiuhai Chen, Chen Zhu, Bin Dai
In this paper, we study how SSL can enhance the performance of the out-of-distribution (OOD) detection task.
Out-of-Distribution Detection Out of Distribution (OOD) Detection +1
1 code implementation • 9 May 2021 • Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
The experimental results show that the proposed method outperforms state-of-the-art multimodal methods and is robust to the perturbations of the topometric map.
1 code implementation • 6 Apr 2021 • Chen Min, Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai
Deep learning has recently demonstrated its promising performance for vision-based parking-slot detection.
1 code implementation • NeurIPS 2020 • Ziyu Wang, Bin Dai, David Wipf, Jun Zhu
The recent, counter-intuitive discovery that deep generative models (DGMs) can frequently assign a higher likelihood to outliers has implications for both outlier detection applications as well as our overall understanding of generative modeling.
no code implementations • 7 Sep 2020 • Krzysztof Łakomy, Rafal Madonski, Bin Dai, Jun Yang, Piotr Kicki, Maral Ansari, Shihua Li
The performance of active disturbance rejection control (ADRC) algorithms can be limited in practice by high-frequency measurement noise.
no code implementations • 6 May 2020 • Li Wang, Dawei Zhao, Tao Wu, Hao Fu, Zhiyu Wang, Liang Xiao, Xin Xu, Bin Dai
3D moving object detection is one of the most critical tasks in dynamic scene analysis.
no code implementations • ICML 2020 • Bin Dai, Ziyu Wang, David Wipf
In narrow asymptotic settings Gaussian VAE models of continuous data have been shown to possess global optima aligned with ground-truth distributions.
no code implementations • 9 Jul 2019 • Xiaoxiang Zhang, Hao Fu, Bin Dai
Object detection and classification based on lidar point cloud is a key technology for UGV.
4 code implementations • ICLR 2019 • Bin Dai, David Wipf
Although variational autoencoders (VAEs) represent a widely influential deep generative model, many aspects of the underlying energy function remain poorly understood.
1 code implementation • ICML 2018 • Bin Dai, Chen Zhu, Baining Guo, David Wipf
Neural networks can be compressed to reduce memory and computational requirements, or to increase accuracy by facilitating the use of a larger base architecture.
1 code implementation • ICML 2018 • Bin Dai, Chen Zhu, David Wipf
Neural networks can be compressed to reduce memory and computational requirements, or to increase accuracy by facilitating the use of a larger base architecture.
no code implementations • 2 Nov 2017 • Bin Dai, Baoyuan Wang, Gang Hua
Selecting attractive photos from a human action shot sequence is quite challenging, because of the subjective nature of the "attractiveness", which is mainly a combined factor of human pose in action and the background.
1 code implementation • 16 Jun 2017 • Bin Dai, Yu Wang, John Aston, Gang Hua, David Wipf
Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying distribution.
no code implementations • COLING 2016 • Shoushan Li, Bin Dai, ZhengXian Gong, Guodong Zhou
In gender classification, labeled data is often limited while unlabeled data is ample.
no code implementations • 8 Jun 2012 • Bin Dai, Shilin Ding, Grace Wahba
In this paper, we consider the multivariate Bernoulli distribution as a model to estimate the structure of graphs with binary nodes.