Search Results for author: Bin Dai

Found 28 papers, 12 papers with code

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

no code implementations CVPR 2024 Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai

In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.

3D Object Detection Motion Forecasting +4

Towards Objectively Benchmarking Social Intelligence for Language Agents at Action Level

1 code implementation8 Apr 2024 Chenxu Wang, Bin Dai, Huaping Liu, Baoyuan Wang

To gauge the significance of agent architecture, we implement a target-driven planning (TDP) module as an adjunct to the existing agent.

Benchmarking

AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents

no code implementations29 Nov 2023 Duomin Wang, Bin Dai, Yu Deng, Baoyuan Wang

In this study, our goal is to create interactive avatar agents that can autonomously plan and animate nuanced facial movements realistically, from both visual and behavioral perspectives.

Neural Rendering

Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM

no code implementations15 Sep 2023 Chenghao Shi, Xieyuanli Chen, Junhao Xiao, Bin Dai, Huimin Lu

In the end, we integrate our LCR-Net into a SLAM system and achieve robust and accurate online LiDAR SLAM in outdoor driving environments.

Point Cloud Registration Pose Estimation +1

UniWorld: Autonomous Driving Pre-training via World Models

1 code implementation14 Aug 2023 Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots.

3D Object Detection Autonomous Driving +2

Controlling Character Motions without Observable Driving Source

no code implementations11 Aug 2023 Weiyuan Li, Bin Dai, Ziyi Zhou, Qi Yao, Baoyuan Wang

A high-level prior model can be easily injected on top to generate unlimited long and diverse sequences.

Diversity

PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm

no code implementations11 Jun 2023 Wensong Bai, Chao Zhang, Yichao Fu, Lingwei Peng, Hui Qian, Bin Dai

In this paper, we propose the first fully push-forward-based Distributional Reinforcement Learning algorithm, called Push-forward-based Actor-Critic EncourageR (PACER).

Continuous Control Distributional Reinforcement Learning +2

UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction for Autonomous Driving

2 code implementations30 May 2023 Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

When compared to monocular pre-training methods on the nuScenes dataset, UniScene shows a significant improvement of about 2. 0% in mAP and 2. 0% in NDS for multi-camera 3D object detection, as well as a 3% increase in mIoU for surrounding semantic scene completion.

3D Object Detection 3D Scene Reconstruction +2

RDMNet: Reliable Dense Matching Based Point Cloud Registration for Autonomous Driving

no code implementations31 Mar 2023 Chenghao Shi, Xieyuanli Chen, Huimin Lu, Wenbang Deng, Junhao Xiao, Bin Dai

The proposed 3D-RoFormer fuses 3D position information into the transformer network, efficiently exploiting point clouds' contextual and geometric information to generate robust superpoint correspondences.

Autonomous Driving Point Cloud Registration +1

Adversarial and Random Transformations for Robust Domain Adaptation and Generalization

no code implementations13 Nov 2022 Liang Xiao, Jiaolong Xu, Dawei Zhao, Erke Shang, Qi Zhu, Bin Dai

In this work, we show that by simply applying consistency training with random data augmentation, state-of-the-art results on domain adaptation (DA) and generalization (DG) can be obtained.

Data Augmentation Domain Adaptation

Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders

2 code implementations20 Jun 2022 Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE).

3D Object Detection 3D Semantic Segmentation +6

ORFD: A Dataset and Benchmark for Off-Road Freespace Detection

2 code implementations20 Jun 2022 Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai

Freespace detection is an essential component of autonomous driving technology and plays an important role in trajectory planning.

Autonomous Driving Semantic Segmentation +1

On the Value of Infinite Gradients in Variational Autoencoder Models

no code implementations NeurIPS 2021 Bin Dai, Li Wenliang, David Wipf

A number of recent studies of continuous variational autoencoder (VAE) models have noted, either directly or indirectly, the tendency of various parameter gradients to drift towards infinity during training.

feature selection Open-Ended Question Answering

Trajectory Prediction for Autonomous Driving with Topometric Map

1 code implementation9 May 2021 Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

The experimental results show that the proposed method outperforms state-of-the-art multimodal methods and is robust to the perturbations of the topometric map.

Autonomous Driving Trajectory Prediction

Attentional Graph Neural Network for Parking-slot Detection

1 code implementation6 Apr 2021 Chen Min, Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

Deep learning has recently demonstrated its promising performance for vision-based parking-slot detection.

Graph Neural Network

Further Analysis of Outlier Detection with Deep Generative Models

1 code implementation NeurIPS 2020 Ziyu Wang, Bin Dai, David Wipf, Jun Zhu

The recent, counter-intuitive discovery that deep generative models (DGMs) can frequently assign a higher likelihood to outliers has implications for both outlier detection applications as well as our overall understanding of generative modeling.

Outlier Detection

Active Disturbance Rejection Control Design with Suppression of Sensor Noise Effects in Application to DC-DC Buck Power Converter

no code implementations7 Sep 2020 Krzysztof Łakomy, Rafal Madonski, Bin Dai, Jun Yang, Piotr Kicki, Maral Ansari, Shihua Li

The performance of active disturbance rejection control (ADRC) algorithms can be limited in practice by high-frequency measurement noise.

The Usual Suspects? Reassessing Blame for VAE Posterior Collapse

no code implementations ICML 2020 Bin Dai, Ziyu Wang, David Wipf

In narrow asymptotic settings Gaussian VAE models of continuous data have been shown to possess global optima aligned with ground-truth distributions.

Decoder

Diagnosing and Enhancing VAE Models

4 code implementations ICLR 2019 Bin Dai, David Wipf

Although variational autoencoders (VAEs) represent a widely influential deep generative model, many aspects of the underlying energy function remain poorly understood.

Decoder

Compressing Neural Networks using the Variational Information Bottelneck

1 code implementation ICML 2018 Bin Dai, Chen Zhu, Baining Guo, David Wipf

Neural networks can be compressed to reduce memory and computational requirements, or to increase accuracy by facilitating the use of a larger base architecture.

Compressing Neural Networks using the Variational Information Bottleneck

1 code implementation ICML 2018 Bin Dai, Chen Zhu, David Wipf

Neural networks can be compressed to reduce memory and computational requirements, or to increase accuracy by facilitating the use of a larger base architecture.

Understanding and Predicting The Attractiveness of Human Action Shot

no code implementations2 Nov 2017 Bin Dai, Baoyuan Wang, Gang Hua

Selecting attractive photos from a human action shot sequence is quite challenging, because of the subjective nature of the "attractiveness", which is mainly a combined factor of human pose in action and the background.

Hidden Talents of the Variational Autoencoder

1 code implementation16 Jun 2017 Bin Dai, Yu Wang, John Aston, Gang Hua, David Wipf

Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying distribution.

Dimensionality Reduction

Multivariate Bernoulli distribution

no code implementations8 Jun 2012 Bin Dai, Shilin Ding, Grace Wahba

In this paper, we consider the multivariate Bernoulli distribution as a model to estimate the structure of graphs with binary nodes.

Variable Selection

Cannot find the paper you are looking for? You can Submit a new open access paper.