TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Point Cloud Classification	ModelNet40	PointNet2+PointCMT	Overall Accuracy	94.4	# 12
3D Point Cloud Classification	ModelNet40	PointNet2+PointCMT	Mean Accuracy	91.2	# 15
3D Point Cloud Classification	ModelNet40	PointNet2+PointCMT	Number of params	1.62M	# 91
3D Point Cloud Classification	ScanObjectNN	PointCMT	Overall Accuracy	86.7	# 32
3D Point Cloud Classification	ScanObjectNN	PointCMT	Mean Accuracy	84.8	# 16
3D Point Cloud Classification	ScanObjectNN	PointCMT	Number of params	12.6M	# 60

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/let-images-give-you-more-point-cloud-cross/3d-point-cloud-classification-on-modelnet40)](https://paperswithcode.com/sota/3d-point-cloud-classification-on-modelnet40?p=let-images-give-you-more-point-cloud-cross)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/let-images-give-you-more-point-cloud-cross/3d-point-cloud-classification-on-scanobjectnn)](https://paperswithcode.com/sota/3d-point-cloud-classification-on-scanobjectnn?p=let-images-give-you-more-point-cloud-cross)`

Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis

9 Oct 2022 · Xu Yan, Heshen Zhan, Chaoda Zheng, Jiantao Gao, Ruimao Zhang, Shuguang Cui, Zhen Li ·

Although recent point cloud analysis achieves impressive progress, the paradigm of representation learning from a single modality gradually meets its bottleneck. In this work, we take a step towards more discriminative 3D point cloud representation by fully taking advantages of images which inherently contain richer appearance information, e.g., texture, color, and shade. Specifically, this paper introduces a simple but effective point cloud cross-modality training (PointCMT) strategy, which utilizes view-images, i.e., rendered or projected 2D images of the 3D object, to boost point cloud analysis. In practice, to effectively acquire auxiliary knowledge from view images, we develop a teacher-student framework and formulate the cross modal learning as a knowledge distillation problem. PointCMT eliminates the distribution discrepancy between different modalities through novel feature and classifier enhancement criteria and avoids potential negative transfer effectively. Note that PointCMT effectively improves the point-only representation without architecture modification. Sufficient experiments verify significant gains on various datasets using appealing backbones, i.e., equipped with PointCMT, PointNet++ and PointMLP achieve state-of-the-art performance on two benchmarks, i.e., 94.4% and 86.7% accuracy on ModelNet40 and ScanObjectNN, respectively. Code will be made available at https://github.com/ZhanHeshen/PointCMT.

PDF Abstract

Code

Add Remove Mark official

zhanheshen/pointcmt official

yanx27/2dpass

382

Tasks

Add Remove

3D Point Cloud Classification

Knowledge Distillation

Representation Learning

Datasets

ModelNet

ScanObjectNN

Results from the Paper

Edit

Ranked #12 on 3D Point Cloud Classification on ModelNet40

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Point Cloud Classification	ModelNet40	PointNet2+PointCMT	Overall Accuracy	94.4	# 12	Compare
			Mean Accuracy	91.2	# 15	Compare
			Number of params	1.62M	# 91	Compare
3D Point Cloud Classification	ScanObjectNN	PointCMT	Overall Accuracy	86.7	# 32	Compare
			Mean Accuracy	84.8	# 16	Compare
			Number of params	12.6M	# 60	Compare

Methods

Add Remove

Knowledge Distillation

Edit Social Preview

Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove