TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection	nuScenes	DSVT	NDS	0.73	# 25
3D Object Detection	nuScenes	DSVT	mATE	0.25	# 304
3D Object Detection	nuScenes	DSVT	mASE	0.23	# 321
3D Object Detection	nuScenes	DSVT	mAOE	0.30	# 348
3D Object Detection	nuScenes	DSVT	mAVE	0.25	# 284
3D Object Detection	nuScenes	DSVT	mAAE	0.14	# 70
3D Object Detection	nuScenes LiDAR only	DSVT	NDS	72.7	# 1
3D Object Detection	nuScenes LiDAR only	DSVT	mAP	68.4	# 1
3D Object Detection	nuScenes LiDAR only	DSVT	NDS (val)	71.1	# 1
3D Object Detection	nuScenes LiDAR only	DSVT	mAP (val)	66.4	# 1
3D Object Detection	waymo cyclist	DSVT(val)	APH/L2	78.0	# 1
3D Object Detection	waymo pedestrian	DSVT(val)	APH/L2	76.4	# 1
3D Object Detection	waymo vehicle	DSVT(val)	APH/L2	74.1	# 2
3D Object Detection	waymo vehicle	DSVT(val)	L1 mAP	82.1	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dsvt-dynamic-sparse-voxel-transformer-with/3d-object-detection-on-nuscenes-lidar-only)](https://paperswithcode.com/sota/3d-object-detection-on-nuscenes-lidar-only?p=dsvt-dynamic-sparse-voxel-transformer-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dsvt-dynamic-sparse-voxel-transformer-with/3d-object-detection-on-waymo-cyclist)](https://paperswithcode.com/sota/3d-object-detection-on-waymo-cyclist?p=dsvt-dynamic-sparse-voxel-transformer-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dsvt-dynamic-sparse-voxel-transformer-with/3d-object-detection-on-waymo-pedestrian)](https://paperswithcode.com/sota/3d-object-detection-on-waymo-pedestrian?p=dsvt-dynamic-sparse-voxel-transformer-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dsvt-dynamic-sparse-voxel-transformer-with/3d-object-detection-on-waymo-vehicle)](https://paperswithcode.com/sota/3d-object-detection-on-waymo-vehicle?p=dsvt-dynamic-sparse-voxel-transformer-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dsvt-dynamic-sparse-voxel-transformer-with/3d-object-detection-on-nuscenes)](https://paperswithcode.com/sota/3d-object-detection-on-nuscenes?p=dsvt-dynamic-sparse-voxel-transformer-with)`

DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets

CVPR 2023 · Haiyang Wang, Chen Shi, Shaoshuai Shi, Meng Lei, Sen Wang, Di He, Bernt Schiele, LiWei Wang ·

Designing an efficient yet deployment-friendly 3D backbone to handle sparse point clouds is a fundamental problem in 3D perception. Compared with the customized sparse convolution, the attention mechanism in Transformers is more appropriate for flexibly modeling long-range relationships and is easier to be deployed in real-world applications. However, due to the sparse characteristics of point clouds, it is non-trivial to apply a standard transformer on sparse points. In this paper, we present Dynamic Sparse Voxel Transformer (DSVT), a single-stride window-based voxel Transformer backbone for outdoor 3D perception. In order to efficiently process sparse points in parallel, we propose Dynamic Sparse Window Attention, which partitions a series of local regions in each window according to its sparsity and then computes the features of all regions in a fully parallel manner. To allow the cross-set connection, we design a rotated set partitioning strategy that alternates between two partitioning configurations in consecutive self-attention layers. To support effective downsampling and better encode geometric information, we also propose an attention-style 3D pooling module on sparse points, which is powerful and deployment-friendly without utilizing any customized CUDA operations. Our model achieves state-of-the-art performance with a broad range of 3D perception tasks. More importantly, DSVT can be easily deployed by TensorRT with real-time inference speed (27Hz). Code will be available at \url{https://github.com/Haiyang-W/DSVT}.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

open-mmlab/OpenPCDet official

4,320

haiyang-w/dsvt official

327

open-mmlab/mmdetection3d

4,808

Tasks

Add Remove

3D Object Detection

object-detection

Object Detection

Datasets

nuScenes

Waymo Open Dataset

nuScenes LiDAR only

Results from the Paper

Edit

Ranked #1 on 3D Object Detection on nuScenes LiDAR only

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection	nuScenes	DSVT	NDS	0.73	# 25	Compare
			mATE	0.25	# 304	Compare
			mASE	0.23	# 321	Compare
			mAOE	0.30	# 348	Compare
			mAVE	0.25	# 284	Compare
			mAAE	0.14	# 70	Compare
3D Object Detection	nuScenes LiDAR only	DSVT	NDS	72.7	# 1	Compare
			mAP	68.4	# 1	Compare
			NDS (val)	71.1	# 1	Compare
			mAP (val)	66.4	# 1	Compare
3D Object Detection	waymo cyclist	DSVT(val)	APH/L2	78.0	# 1	Compare
3D Object Detection	waymo pedestrian	DSVT(val)	APH/L2	76.4	# 1	Compare
3D Object Detection	waymo vehicle	DSVT(val)	APH/L2	74.1	# 2	Compare
3D Object Detection	waymo vehicle	DSVT(val)	L1 mAP	82.1	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • SPEED • Transformer

Edit Social Preview

DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove