TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Training-free 3D Point Cloud Classification	ModelNet40	ULIP	Accuracy (%)	60.4	# 3
Training-free 3D Point Cloud Classification	ModelNet40	ULIP	Need 3D Data?	Yes	# 1
3D Point Cloud Classification	ModelNet40	ULIP + PointMLP	Overall Accuracy	94.7	# 7
3D Point Cloud Classification	ModelNet40	ULIP + PointMLP	Mean Accuracy	92.4	# 2
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet40	ULIP + PointMLP	Accuracy (%)	61.5	# 8
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet40	ULIP + PointBERT	Accuracy (%)	60.4	# 9
3D Point Cloud Classification	ModelNet40	ULIP + PointNet++(ssg)	Overall Accuracy	93.4	# 53
3D Point Cloud Classification	ModelNet40	ULIP + PointNet++(ssg)	Mean Accuracy	91.2	# 15
3D Point Cloud Classification	ModelNet40	ULIP + PointBERT	Overall Accuracy	94.1	# 19
3D Point Cloud Classification	ScanObjectNN	ULIP + PointMLP	Overall Accuracy	89.4	# 17
3D Point Cloud Classification	ScanObjectNN	ULIP + PointMLP	Mean Accuracy	88.5	# 6
3D Point Cloud Classification	ScanObjectNN	ULIP + PointNeXt	Overall Accuracy	89.7	# 15
3D Point Cloud Classification	ScanObjectNN	ULIP + PointNeXt	Mean Accuracy	88.6	# 5
3D Point Cloud Classification	ScanObjectNN	ULIP + PointNeXt	Number of params	1.4M	# 51
3D Point Cloud Classification	ScanObjectNN	ULIP + PointBERT	Overall Accuracy	86.4	# 36

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ulip-learning-unified-representation-of/training-free-3d-point-cloud-classification)](https://paperswithcode.com/sota/training-free-3d-point-cloud-classification?p=ulip-learning-unified-representation-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ulip-learning-unified-representation-of/3d-point-cloud-classification-on-modelnet40)](https://paperswithcode.com/sota/3d-point-cloud-classification-on-modelnet40?p=ulip-learning-unified-representation-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ulip-learning-unified-representation-of/zero-shot-transfer-3d-point-cloud)](https://paperswithcode.com/sota/zero-shot-transfer-3d-point-cloud?p=ulip-learning-unified-representation-of)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/ulip-learning-unified-representation-of/3d-point-cloud-classification-on-scanobjectnn)](https://paperswithcode.com/sota/3d-point-cloud-classification-on-scanobjectnn?p=ulip-learning-unified-representation-of)`

ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding

CVPR 2023 · Le Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, ran Xu, Juan Carlos Niebles, Silvio Savarese ·

The recognition capabilities of current state-of-the-art 3D models are limited by datasets with a small number of annotated data and a pre-defined set of categories. In its 2D counterpart, recent advances have shown that similar problems can be significantly alleviated by employing knowledge from other modalities, such as language. Inspired by this, leveraging multimodal information for 3D modality could be promising to improve 3D understanding under the restricted data regime, but this line of research is not well studied. Therefore, we introduce ULIP to learn a unified representation of images, texts, and 3D point clouds by pre-training with object triplets from the three modalities. To overcome the shortage of training triplets, ULIP leverages a pre-trained vision-language model that has already learned a common visual and textual space by training with massive image-text pairs. Then, ULIP learns a 3D representation space aligned with the common image-text space, using a small number of automatically synthesized triplets. ULIP is agnostic to 3D backbone networks and can easily be integrated into any 3D architecture. Experiments show that ULIP effectively improves the performance of multiple recent 3D backbones by simply pre-training them on ShapeNet55 using our framework, achieving state-of-the-art performance in both standard 3D classification and zero-shot 3D classification on ModelNet40 and ScanObjectNN. ULIP also improves the performance of PointMLP by around 3% in 3D classification on ScanObjectNN, and outperforms PointCLIP by 28.8% on top-1 accuracy for zero-shot 3D classification on ModelNet40. Our code and pre-trained models are released at https://github.com/salesforce/ULIP.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

salesforce/ulip official

355

Tasks

Add Remove

3D Architecture

3D Classification

3D Point Cloud Classification

Classification

Language Modelling

Training-free 3D Point Cloud Classification

Zero-shot 3D classification

Zero-Shot Transfer 3D Point Cloud Classification

Datasets

ShapeNet

ModelNet

Caltech-101

ScanObjectNN

Results from the Paper

Edit

Ranked #3 on Training-free 3D Point Cloud Classification on ModelNet40 (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Training-free 3D Point Cloud Classification	ModelNet40	ULIP	Accuracy (%)	60.4	# 3	Compare
Training-free 3D Point Cloud Classification	ModelNet40	ULIP	Need 3D Data?	Yes	# 1	Compare
3D Point Cloud Classification	ModelNet40	ULIP + PointMLP	Overall Accuracy	94.7	# 7	Compare
3D Point Cloud Classification	ModelNet40	ULIP + PointMLP	Mean Accuracy	92.4	# 2	Compare
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet40	ULIP + PointMLP	Accuracy (%)	61.5	# 8	Compare
Zero-Shot Transfer 3D Point Cloud Classification	ModelNet40	ULIP + PointBERT	Accuracy (%)	60.4	# 9	Compare
3D Point Cloud Classification	ModelNet40	ULIP + PointNet++(ssg)	Overall Accuracy	93.4	# 53	Compare
3D Point Cloud Classification	ModelNet40	ULIP + PointNet++(ssg)	Mean Accuracy	91.2	# 15	Compare
3D Point Cloud Classification	ModelNet40	ULIP + PointBERT	Overall Accuracy	94.1	# 19	Compare
3D Point Cloud Classification	ScanObjectNN	ULIP + PointMLP	Overall Accuracy	89.4	# 17	Compare
3D Point Cloud Classification	ScanObjectNN	ULIP + PointMLP	Mean Accuracy	88.5	# 6	Compare
3D Point Cloud Classification	ScanObjectNN	ULIP + PointNeXt	Overall Accuracy	89.7	# 15	Compare
			Mean Accuracy	88.6	# 5	Compare
			Number of params	1.4M	# 51	Compare
3D Point Cloud Classification	ScanObjectNN	ULIP + PointBERT	Overall Accuracy	86.4	# 36	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove