TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Open Vocabulary Semantic Segmentation	ADE20K-150	CAT-Seg	mIoU	37.9	# 1
Open Vocabulary Semantic Segmentation	ADE20K-847	CAT-Seg	mIoU	16.0	# 1
Open Vocabulary Semantic Segmentation	PASCAL Context-459	CAT-Seg	mIoU	23.8	# 2
Open Vocabulary Semantic Segmentation	PASCAL Context-59	CAT-Seg	mIoU	63.3	# 2
Open Vocabulary Semantic Segmentation	PascalVOC-20	CAT-Seg	mIoU	97.0	# 3
Open Vocabulary Semantic Segmentation	PascalVOC-20b	CAT-Seg	mIoU	82.5	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cat-seg-cost-aggregation-for-open-vocabulary/open-vocabulary-semantic-segmentation-on-2)](https://paperswithcode.com/sota/open-vocabulary-semantic-segmentation-on-2?p=cat-seg-cost-aggregation-for-open-vocabulary)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cat-seg-cost-aggregation-for-open-vocabulary/open-vocabulary-semantic-segmentation-on-3)](https://paperswithcode.com/sota/open-vocabulary-semantic-segmentation-on-3?p=cat-seg-cost-aggregation-for-open-vocabulary)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cat-seg-cost-aggregation-for-open-vocabulary/open-vocabulary-semantic-segmentation-on-9)](https://paperswithcode.com/sota/open-vocabulary-semantic-segmentation-on-9?p=cat-seg-cost-aggregation-for-open-vocabulary)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cat-seg-cost-aggregation-for-open-vocabulary/open-vocabulary-semantic-segmentation-on-7)](https://paperswithcode.com/sota/open-vocabulary-semantic-segmentation-on-7?p=cat-seg-cost-aggregation-for-open-vocabulary)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cat-seg-cost-aggregation-for-open-vocabulary/open-vocabulary-semantic-segmentation-on-1)](https://paperswithcode.com/sota/open-vocabulary-semantic-segmentation-on-1?p=cat-seg-cost-aggregation-for-open-vocabulary)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cat-seg-cost-aggregation-for-open-vocabulary/open-vocabulary-semantic-segmentation-on-5)](https://paperswithcode.com/sota/open-vocabulary-semantic-segmentation-on-5?p=cat-seg-cost-aggregation-for-open-vocabulary)`

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation

21 Mar 2023 · Seokju Cho, Heeseong Shin, Sunghwan Hong, Anurag Arnab, Paul Hongsuck Seo, Seungryong Kim ·

Open-vocabulary semantic segmentation presents the challenge of labeling each pixel within an image based on a wide range of text descriptions. In this work, we introduce a novel cost-based approach to adapt vision-language foundation models, notably CLIP, for the intricate task of semantic segmentation. Through aggregating the cosine similarity score, i.e., the cost volume between image and text embeddings, our method potently adapts CLIP for segmenting seen and unseen classes by fine-tuning its encoders, addressing the challenges faced by existing methods in handling unseen classes. Building upon this, we explore methods to effectively aggregate the cost volume considering its multi-modal nature of being established between image and text embeddings. Furthermore, we examine various methods for efficiently fine-tuning CLIP.

PDF Abstract

Code

Add Remove Mark official

KU-CVLAB/CAT-Seg official

↳ Quickstart in

Spaces

194

openrobotlab/ov_parts

blumenstiel/CAT-Seg-MESS

↳ Quickstart in

Spaces

Tasks

Add Remove

Image Segmentation

Open Vocabulary Semantic Segmentation

Segmentation

Semantic Segmentation

text similarity

Datasets

ADE20K

PASCAL Context

PASCAL VOC

Results from the Paper

Edit

Ranked #1 on Open Vocabulary Semantic Segmentation on ADE20K-150

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Open Vocabulary Semantic Segmentation	ADE20K-150	CAT-Seg	mIoU	37.9	# 1	Compare
Open Vocabulary Semantic Segmentation	ADE20K-847	CAT-Seg	mIoU	16.0	# 1	Compare
Open Vocabulary Semantic Segmentation	PASCAL Context-459	CAT-Seg	mIoU	23.8	# 2	Compare
Open Vocabulary Semantic Segmentation	PASCAL Context-59	CAT-Seg	mIoU	63.3	# 2	Compare
Open Vocabulary Semantic Segmentation	PascalVOC-20	CAT-Seg	mIoU	97.0	# 3	Compare
Open Vocabulary Semantic Segmentation	PascalVOC-20b	CAT-Seg	mIoU	82.5	# 1	Compare

Methods

Add Remove

CLIP

Edit Social Preview

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove