TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Point Cloud Classification	ModelNet40	OGNet + MD	Overall Accuracy	93.31	# 56
3D Point Cloud Classification	ModelNet40	OGNet + MD	Mean Accuracy	90.71	# 23
3D Point Cloud Classification	ModelNet40	DGCNN + MD	Overall Accuracy	93.39	# 55
3D Point Cloud Classification	ModelNet40	DGCNN + MD	Mean Accuracy	89.88	# 30
Semantic Segmentation	S3DIS	SMS	Mean IoU	51.74	# 49
Semantic Segmentation	S3DIS	SMS	Number of params	N/A	# 1
3D Part Segmentation	ShapeNet-Part	DGCNN + MD	Instance Average IoU	85.5	# 43

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-cloud-pre-training-by-mixing-and/3d-part-segmentation-on-shapenet-part)](https://paperswithcode.com/sota/3d-part-segmentation-on-shapenet-part?p=point-cloud-pre-training-by-mixing-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-cloud-pre-training-by-mixing-and/semantic-segmentation-on-s3dis)](https://paperswithcode.com/sota/semantic-segmentation-on-s3dis?p=point-cloud-pre-training-by-mixing-and)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-cloud-pre-training-by-mixing-and/3d-point-cloud-classification-on-modelnet40)](https://paperswithcode.com/sota/3d-point-cloud-classification-on-modelnet40?p=point-cloud-pre-training-by-mixing-and)`

Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes

1 Sep 2021 · Chao Sun, Zhedong Zheng, Xiaohan Wang, Mingliang Xu, Yi Yang ·

The manual annotation for large-scale point clouds costs a lot of time and is usually unavailable in harsh real-world scenarios. Inspired by the great success of the pre-training and fine-tuning paradigm in both vision and language tasks, we argue that pre-training is one potential solution for obtaining a scalable model to 3D point cloud downstream tasks as well. In this paper, we, therefore, explore a new self-supervised learning method, called Mixing and Disentangling (MD), for 3D point cloud representation learning. As the name implies, we mix two input shapes and demand the model learning to separate the inputs from the mixed shape. We leverage this reconstruction task as the pretext optimization objective for self-supervised learning. There are two primary advantages: 1) Compared to prevailing image datasets, eg, ImageNet, point cloud datasets are de facto small. The mixing process can provide a much larger online training sample pool. 2) On the other hand, the disentangling process motivates the model to mine the geometric prior knowledge, eg, key points. To verify the effectiveness of the proposed pretext task, we build one baseline network, which is composed of one encoder and one decoder. During pre-training, we mix two original shapes and obtain the geometry-aware embedding from the encoder, then an instance-adaptive decoder is applied to recover the original shapes from the embedding. Albeit simple, the pre-trained encoder can capture the key points of an unseen point cloud and surpasses the encoder trained from scratch on downstream tasks. The proposed method has improved the empirical performance on both ModelNet-40 and ShapeNet-Part datasets in terms of point cloud classification and segmentation tasks. We further conduct ablation studies to explore the effect of each component and verify the generalization of our proposed strategy by harnessing different backbones.

PDF Abstract

Code

Add Remove Mark official

cyysc1998/3D-Pretraining official

Tasks

Add Remove

3D Part Segmentation

3D Point Cloud Classification

Point Cloud Classification

Point Cloud Pre-training

Representation Learning

Self-Supervised Learning

Semantic Segmentation

Datasets

ShapeNet

ModelNet

S3DIS

Results from the Paper

Edit

Ranked #43 on 3D Part Segmentation on ShapeNet-Part

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Point Cloud Classification	ModelNet40	OGNet + MD	Overall Accuracy	93.31	# 56	Compare
3D Point Cloud Classification	ModelNet40	OGNet + MD	Mean Accuracy	90.71	# 23	Compare
3D Point Cloud Classification	ModelNet40	DGCNN + MD	Overall Accuracy	93.39	# 55	Compare
3D Point Cloud Classification	ModelNet40	DGCNN + MD	Mean Accuracy	89.88	# 30	Compare
Semantic Segmentation	S3DIS	SMS	Mean IoU	51.74	# 49	Compare
Semantic Segmentation	S3DIS	SMS	Number of params	N/A	# 1	Compare
3D Part Segmentation	ShapeNet-Part	DGCNN + MD	Instance Average IoU	85.5	# 43	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove