TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	S3DIS Area5	StratifiedTransformer	mIoU	72.0	# 17
Semantic Segmentation	S3DIS Area5	StratifiedTransformer	oAcc	91.5	# 9
Semantic Segmentation	S3DIS Area5	StratifiedTransformer	mAcc	78.1	# 11
Semantic Segmentation	S3DIS Area5	StratifiedTransformer	Number of params	8.0M	# 48
Semantic Segmentation	ScanNet	StratifiedFormer	test mIoU	73.7	# 14
Semantic Segmentation	ScanNet	StratifiedFormer	val mIoU	74.3	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stratified-transformer-for-3d-point-cloud/semantic-segmentation-on-scannet)](https://paperswithcode.com/sota/semantic-segmentation-on-scannet?p=stratified-transformer-for-3d-point-cloud)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/stratified-transformer-for-3d-point-cloud/semantic-segmentation-on-s3dis-area5)](https://paperswithcode.com/sota/semantic-segmentation-on-s3dis-area5?p=stratified-transformer-for-3d-point-cloud)`

Stratified Transformer for 3D Point Cloud Segmentation

CVPR 2022 · Xin Lai, Jianhui Liu, Li Jiang, LiWei Wang, Hengshuang Zhao, Shu Liu, Xiaojuan Qi, Jiaya Jia ·

3D point cloud segmentation has made tremendous progress in recent years. Most current methods focus on aggregating local features, but fail to directly model long-range dependencies. In this paper, we propose Stratified Transformer that is able to capture long-range contexts and demonstrates strong generalization ability and high performance. Specifically, we first put forward a novel key sampling strategy. For each query point, we sample nearby points densely and distant points sparsely as its keys in a stratified way, which enables the model to enlarge the effective receptive field and enjoy long-range contexts at a low computational cost. Also, to combat the challenges posed by irregular point arrangements, we propose first-layer point embedding to aggregate local information, which facilitates convergence and boosts performance. Besides, we adopt contextual relative position encoding to adaptively capture position information. Finally, a memory-efficient implementation is introduced to overcome the issue of varying point numbers in each window. Extensive experiments demonstrate the effectiveness and superiority of our method on S3DIS, ScanNetv2 and ShapeNetPart datasets. Code is available at https://github.com/dvlab-research/Stratified-Transformer.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

dvlab-research/stratified-transform… official

336

Pointcept/Pointcept

1,124

gofinge/pointtransformerv2

329

dvlab-research/deepvision3d

118

Tasks

Add Remove

Point Cloud Segmentation

Position

Semantic Segmentation

Datasets

ShapeNet

ScanNet

S3DIS

Results from the Paper

Edit

Ranked #14 on Semantic Segmentation on ScanNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	S3DIS Area5	StratifiedTransformer	mIoU	72.0	# 17	Compare
			oAcc	91.5	# 9	Compare
			mAcc	78.1	# 11	Compare
			Number of params	8.0M	# 48	Compare
Semantic Segmentation	ScanNet	StratifiedFormer	test mIoU	73.7	# 14	Compare
Semantic Segmentation	ScanNet	StratifiedFormer	val mIoU	74.3	# 13	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

Stratified Transformer for 3D Point Cloud Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove