Search Results for author: Chen Min

Found 10 papers, 6 papers with code

Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders

2 code implementations20 Jun 2022 Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE).

3D Object Detection 3D Semantic Segmentation +6

UniScene: Multi-Camera Unified Pre-training via 3D Scene Reconstruction

2 code implementations30 May 2023 Chen Min, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

When compared to monocular pre-training methods on the nuScenes dataset, UniScene shows a significant improvement of about 2. 0% in mAP and 2. 0% in NDS for multi-camera 3D object detection, as well as a 3% increase in mIoU for surrounding semantic scene completion.

3D Object Detection 3D Scene Reconstruction +2

Attentional Graph Neural Network for Parking-slot Detection

1 code implementation6 Apr 2021 Chen Min, Jiaolong Xu, Liang Xiao, Dawei Zhao, Yiming Nie, Bin Dai

Deep learning has recently demonstrated its promising performance for vision-based parking-slot detection.

ORFD: A Dataset and Benchmark for Off-Road Freespace Detection

2 code implementations20 Jun 2022 Chen Min, Weizhong Jiang, Dawei Zhao, Jiaolong Xu, Liang Xiao, Yiming Nie, Bin Dai

Freespace detection is an essential component of autonomous driving technology and plays an important role in trajectory planning.

Autonomous Driving Trajectory Planning

AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network

1 code implementation ICCV 2021 Zizhuang Wei, Qingtian Zhu, Chen Min, Yisong Chen, Guoping Wang

To overcome the difficulty of varying occlusion in complex scenes, we propose an inter-view cost volume aggregation module for adaptive pixel-wise view aggregation, which is able to preserve better-matched pairs among all views.

Ranked #10 on Point Clouds on Tanks and Temples (Mean F1 (Intermediate) metric)

3D Reconstruction Point Clouds

UniWorld: Autonomous Driving Pre-training via World Models

1 code implementation14 Aug 2023 Chen Min, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

In this paper, we draw inspiration from Alberto Elfes' pioneering work in 1989, where he introduced the concept of the occupancy grid as World Models for robots.

3D Object Detection Autonomous Driving +2

Label-less Learning for Traffic Control in an Edge Network

no code implementations29 Aug 2018 Chen Min, Hao Yixue, Lin Kai, Yuan Zhiyong, Hu Long

In order to solve this problem, we design a traffic control algorithm based on label-less learning on the edge cloud, which is dubbed as LLTC.

Emotion Recognition

Deep Learning for Multi-View Stereo via Plane Sweep: A Survey

no code implementations18 Jun 2021 Qingtian Zhu, Chen Min, Zizhuang Wei, Yisong Chen, Guoping Wang

3D reconstruction has lately attracted increasing attention due to its wide application in many areas, such as autonomous driving, robotics and virtual reality.

3D Reconstruction Autonomous Driving

STS: Surround-view Temporal Stereo for Multi-view 3D Detection

no code implementations22 Aug 2022 Zengran Wang, Chen Min, Zheng Ge, Yinhao Li, Zeming Li, Hongyu Yang, Di Huang

Instead of using a sole monocular depth method, in this work, we propose a novel Surround-view Temporal Stereo (STS) technique that leverages the geometry correspondence between frames across time to facilitate accurate depth learning.

3D Object Detection Depth Estimation +4

Adaptive Learning for Multi-view Stereo Reconstruction

no code implementations8 Apr 2024 Qinglu Min, Jie Zhao, Zhihao Zhang, Chen Min

Deep learning has recently demonstrated its excellent performance on the task of multi-view stereo (MVS).

Cannot find the paper you are looking for? You can Submit a new open access paper.