TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
3D Object Detection	ScanNetV2	UDeerLvic	mAP@0.25	78.0	# 1
3D Object Detection	ScanNetV2	UDeerLvic	mAP@0.5	65.8	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/lvic-multi-modality-segmentation-by-lifting/3d-object-detection-on-scannetv2)](https://paperswithcode.com/sota/3d-object-detection-on-scannetv2?p=lvic-multi-modality-segmentation-by-lifting)`

LVIC: Multi-modality segmentation by Lifting Visual Info as Cue

8 Mar 2024 · ZiChao Dong, Bowen Pang, Xufeng Huang, Hang Ji, Xin Zhan, Junbo Chen ·

Multi-modality fusion is proven an effective method for 3d perception for autonomous driving. However, most current multi-modality fusion pipelines for LiDAR semantic segmentation have complicated fusion mechanisms. Point painting is a quite straight forward method which directly bind LiDAR points with visual information. Unfortunately, previous point painting like methods suffer from projection error between camera and LiDAR. In our experiments, we find that this projection error is the devil in point painting. As a result of that, we propose a depth aware point painting mechanism, which significantly boosts the multi-modality fusion. Apart from that, we take a deeper look at the desired visual feature for LiDAR to operate semantic segmentation. By Lifting Visual Information as Cue, LVIC ranks 1st on nuScenes LiDAR semantic segmentation benchmark. Our experiments show the robustness and effectiveness. Codes would be make publicly available soon.

PDF Abstract