TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Open-Vocabulary Instance Segmentation	Replica	OpenIns3D	mAP	13.6	# 2
3D Open-Vocabulary Instance Segmentation	S3DIS	OpenIns3D	AP50 Novel B8/N4	37.0	# 1
3D Open-Vocabulary Instance Segmentation	S3DIS	OpenIns3D	AP50 Novel B6/N6	33.0	# 1
3D Open-Vocabulary Object Detection	ScanNet on unseen classes	OpenIns3D	AP25	43.7	# 1
Zero-shot 3D Point Cloud Classification	ScanNetV2	OpenIns3D	Top 1 Accuracy %	60.8	# 1
3D Open-Vocabulary Instance Segmentation	STPLS3D	OPENINS3D	AP50	13.3	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openins3d-snap-and-lookup-for-3d-open/3d-open-vocabulary-instance-segmentation-on-2)](https://paperswithcode.com/sota/3d-open-vocabulary-instance-segmentation-on-2?p=openins3d-snap-and-lookup-for-3d-open)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openins3d-snap-and-lookup-for-3d-open/3d-open-vocabulary-object-detection-on-1)](https://paperswithcode.com/sota/3d-open-vocabulary-object-detection-on-1?p=openins3d-snap-and-lookup-for-3d-open)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openins3d-snap-and-lookup-for-3d-open/zero-shot-3d-point-cloud-classification-on-1)](https://paperswithcode.com/sota/zero-shot-3d-point-cloud-classification-on-1?p=openins3d-snap-and-lookup-for-3d-open)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openins3d-snap-and-lookup-for-3d-open/3d-open-vocabulary-instance-segmentation-on-3)](https://paperswithcode.com/sota/3d-open-vocabulary-instance-segmentation-on-3?p=openins3d-snap-and-lookup-for-3d-open)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/openins3d-snap-and-lookup-for-3d-open/3d-open-vocabulary-instance-segmentation-on-1)](https://paperswithcode.com/sota/3d-open-vocabulary-instance-segmentation-on-1?p=openins3d-snap-and-lookup-for-3d-open)`

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

1 Sep 2023 · Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby ·

Current 3D open-vocabulary scene understanding methods mostly utilize well-aligned 2D images as the bridge to learn 3D features with language. However, applying these approaches becomes challenging in scenarios where 2D images are absent. In this work, we introduce a new pipeline, namely, OpenIns3D, which requires no 2D image inputs, for 3D open-vocabulary scene understanding at the instance level. The OpenIns3D framework employs a "Mask-Snap-Lookup" scheme. The "Mask" module learns class-agnostic mask proposals in 3D point clouds. The "Snap" module generates synthetic scene-level images at multiple scales and leverages 2D vision language models to extract interesting objects. The "Lookup" module searches through the outcomes of "Snap" with the help of Mask2Pixel maps, which contain the precise correspondence between 3D masks and synthetic images, to assign category names to the proposed masks. This 2D input-free and flexible approach achieves state-of-the-art results on a wide range of indoor and outdoor datasets by a large margin. Moreover, OpenIns3D allows for effortless switching of 2D detectors without re-training. When integrated with powerful 2D open-world models such as ODISE and GroundingDINO, excellent results were observed on open-vocabulary instance segmentation. When integrated with LLM-powered 2D models like LISA, it demonstrates a remarkable capacity to process highly complex text queries which require intricate reasoning and world knowledge. Project page: https://zheninghuang.github.io/OpenIns3D/

PDF Abstract

Code

Add Remove Mark official

Pointcept/OpenIns3D official

Tasks

Add Remove

3D Open-Vocabulary Instance Segmentation

3D Open-Vocabulary Object Detection

Instance Segmentation

Open Vocabulary Object Detection

Scene Understanding

Semantic Segmentation

Zero-shot 3D Point Cloud Classification

Datasets

ScanNet

S3DIS

Replica

STPLS3D

Results from the Paper

Edit

Ranked #1 on 3D Open-Vocabulary Object Detection on ScanNet on unseen classes

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Open-Vocabulary Instance Segmentation	Replica	OpenIns3D	mAP	13.6	# 2	Compare
3D Open-Vocabulary Instance Segmentation	S3DIS	OpenIns3D	AP50 Novel B8/N4	37.0	# 1	Compare
3D Open-Vocabulary Instance Segmentation	S3DIS	OpenIns3D	AP50 Novel B6/N6	33.0	# 1	Compare
3D Open-Vocabulary Object Detection	ScanNet on unseen classes	OpenIns3D	AP25	43.7	# 1	Compare
Zero-shot 3D Point Cloud Classification	ScanNetV2	OpenIns3D	Top 1 Accuracy %	60.8	# 1	Compare
3D Open-Vocabulary Instance Segmentation	STPLS3D	OPENINS3D	AP50	13.3	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove