TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Generalized Referring Expression Segmentation	gRefCOCO	LTS	gIoU	52.70	# 5
Generalized Referring Expression Segmentation	gRefCOCO	LTS	cIoU	52.30	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/locate-then-segment-a-strong-pipeline-for/generalized-referring-expression-segmentation)](https://paperswithcode.com/sota/generalized-referring-expression-segmentation?p=locate-then-segment-a-strong-pipeline-for)`

Locate then Segment: A Strong Pipeline for Referring Image Segmentation

CVPR 2021 · Ya Jing, Tao Kong, Wei Wang, Liang Wang, Lei LI, Tieniu Tan ·

Referring image segmentation aims to segment the objects referred by a natural language expression. Previous methods usually focus on designing an implicit and recurrent feature interaction mechanism to fuse the visual-linguistic features to directly generate the final segmentation mask without explicitly modeling the localization information of the referent instances. To tackle these problems, we view this task from another perspective by decoupling it into a "Locate-Then-Segment" (LTS) scheme. Given a language expression, people generally first perform attention to the corresponding target image regions, then generate a fine segmentation mask about the object based on its context. The LTS first extracts and fuses both visual and textual features to get a cross-modal representation, then applies a cross-model interaction on the visual-textual features to locate the referred object with position prior, and finally generates the segmentation result with a light-weight segmentation network. Our LTS is simple but surprisingly effective. On three popular benchmark datasets, the LTS outperforms all the previous state-of-the-art methods by a large margin (e.g., +3.2% on RefCOCO+ and +3.4% on RefCOCOg). In addition, our model is more interpretable with explicitly locating the object, which is also proved by visualization experiments. We believe this framework is promising to serve as a strong baseline for referring image segmentation.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Generalized Referring Expression Segmentation

Image Segmentation

Segmentation

Semantic Segmentation

Datasets

MS COCO

RefCOCO

gRefCOCO

Results from the Paper

Edit

Ranked #5 on Generalized Referring Expression Segmentation on gRefCOCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Generalized Referring Expression Segmentation	gRefCOCO	LTS	gIoU	52.70	# 5		Compare
Generalized Referring Expression Segmentation	gRefCOCO	LTS	cIoU	52.30	# 5		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Locate then Segment: A Strong Pipeline for Referring Image Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove