TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet10	PointCLIP V2	Accuracy (%)	73.13	# 2
Training-free 3D Point Cloud Classification	ModelNet40	PointCLIP V2	Accuracy (%)	64.2	# 2
Training-free 3D Point Cloud Classification	ModelNet40	PointCLIP V2	Need 3D Data?	No	# 1
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet40	PointCLIP V2	Accuracy (%)	64.22	# 6
Zero-shot 3D Point Cloud Classification	ScanNetV2	PointCLIP V2	Top 1 Accuracy %	11.0	# 7
Training-free 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	Accuracy (%)	35.4	# 2
Training-free 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	Need 3D Data?	No	# 1
Zero-Shot Transfer 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	PB_T50_RS Accuracy (%)	35.36	# 1
Zero-Shot Transfer 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	OBJ_BG Accuracy(%)	41.22	# 1
Zero-Shot Transfer 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	OBJ_ONLY Accuracy(%)	50.09	# 5
Training-free 3D Part Segmentation	ShapeNet-Part	PointCLIP V2	mIoU	48.4	# 2
Training-free 3D Part Segmentation	ShapeNet-Part	PointCLIP V2	Need 3D Data?	No	# 1
3D Open-Vocabulary Instance Segmentation	STPLS3D	PointCLIPV2	AP50	03.1	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/zero-shot-transfer-3d-point-cloud-1)](https://paperswithcode.com/sota/zero-shot-transfer-3d-point-cloud-1?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/training-free-3d-point-cloud-classification)](https://paperswithcode.com/sota/training-free-3d-point-cloud-classification?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/training-free-3d-point-cloud-classification-1)](https://paperswithcode.com/sota/training-free-3d-point-cloud-classification-1?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/training-free-3d-part-segmentation-on)](https://paperswithcode.com/sota/training-free-3d-part-segmentation-on?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/3d-open-vocabulary-instance-segmentation-on-3)](https://paperswithcode.com/sota/3d-open-vocabulary-instance-segmentation-on-3?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/zero-shot-transfer-3d-point-cloud-2)](https://paperswithcode.com/sota/zero-shot-transfer-3d-point-cloud-2?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/zero-shot-transfer-3d-point-cloud)](https://paperswithcode.com/sota/zero-shot-transfer-3d-point-cloud?p=pointclip-v2-adapting-clip-for-powerful-3d)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pointclip-v2-adapting-clip-for-powerful-3d/zero-shot-3d-point-cloud-classification-on-1)](https://paperswithcode.com/sota/zero-shot-3d-point-cloud-classification-on-1?p=pointclip-v2-adapting-clip-for-powerful-3d)`

PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning

ICCV 2023 · Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao ·

Large-scale pre-trained models have shown promising open-world performance for both vision and language tasks. However, their transferred capacity on 3D point clouds is still limited and only constrained to the classification task. In this paper, we first collaborate CLIP and GPT to be a unified 3D open-world learner, named as PointCLIP V2, which fully unleashes their potential for zero-shot 3D classification, segmentation, and detection. To better align 3D data with the pre-trained language knowledge, PointCLIP V2 contains two key designs. For the visual end, we prompt CLIP via a shape projection module to generate more realistic depth maps, narrowing the domain gap between projected point clouds with natural images. For the textual end, we prompt the GPT model to generate 3D-specific text as the input of CLIP's textual encoder. Without any training in 3D domains, our approach significantly surpasses PointCLIP by +42.90%, +40.44%, and +28.75% accuracy on three datasets for zero-shot 3D classification. On top of that, V2 can be extended to few-shot 3D classification, zero-shot 3D part segmentation, and 3D object detection in a simple manner, demonstrating our generalization ability for unified 3D open-world learning.

PDF Abstract ICCV 2023 PDF ICCV 2023 Abstract

Code

Add Remove Mark official

yangyangyang127/pointclip_v2 official

197

zrrskywalker/pointclip

291

Tasks

Add Remove

3D Classification

3D Object Detection

3D Open-Vocabulary Instance Segmentation

3D Part Segmentation

Classification

Descriptive

object-detection

Object Detection

Open Vocabulary Object Detection

Training-free 3D Part Segmentation

Training-free 3D Point Cloud Classification

Zero-shot 3D classification

Zero-shot 3D Point Cloud Classification

Zero-Shot Transfer 3D Point Cloud Classification

Datasets

ShapeNet

ScanNet

ModelNet

ScanObjectNN

STPLS3D

Results from the Paper

Edit

Ranked #2 on 3D Open-Vocabulary Instance Segmentation on STPLS3D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet10	PointCLIP V2	Accuracy (%)	73.13	# 2	Compare
Training-free 3D Point Cloud Classification	ModelNet40	PointCLIP V2	Accuracy (%)	64.2	# 2	Compare
Training-free 3D Point Cloud Classification	ModelNet40	PointCLIP V2	Need 3D Data?	No	# 1	Compare
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet40	PointCLIP V2	Accuracy (%)	64.22	# 6	Compare
Zero-shot 3D Point Cloud Classification	ScanNetV2	PointCLIP V2	Top 1 Accuracy %	11.0	# 7	Compare
Training-free 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	Accuracy (%)	35.4	# 2	Compare
Training-free 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	Need 3D Data?	No	# 1	Compare
Zero-Shot Transfer 3D Point Cloud Classification	ScanObjectNN	PointCLIP V2	PB_T50_RS Accuracy (%)	35.36	# 1	Compare
			OBJ_BG Accuracy(%)	41.22	# 1	Compare
			OBJ_ONLY Accuracy(%)	50.09	# 5	Compare
Training-free 3D Part Segmentation	ShapeNet-Part	PointCLIP V2	mIoU	48.4	# 2	Compare
Training-free 3D Part Segmentation	ShapeNet-Part	PointCLIP V2	Need 3D Data?	No	# 1	Compare
3D Open-Vocabulary Instance Segmentation	STPLS3D	PointCLIPV2	AP50	03.1	# 2	Compare

Methods

Add Remove

Adam • ALIGN • Attention Dropout • BPE • CLIP • Cosine Annealing • Dense Connections • Discriminative Fine-Tuning • Dropout • GELU • GPT • Layer Normalization • Linear Layer • Linear Warmup With Cosine Annealing • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay

Edit Social Preview

PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove