TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Referring Expression Segmentation	RefCOCO testA	LANG2SEG	Overall IoU	61.77	# 21
Referring Expression Segmentation	RefCOCO testB	LANG2SEG	Overall IoU	53.81	# 19
Referring Expression Segmentation	RefCoCo val	LANG2SEG	Overall IoU	58.90	# 21

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/referring-expression-object-segmentation-with/referring-expression-segmentation-on-refcoco-2)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-2?p=referring-expression-object-segmentation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/referring-expression-object-segmentation-with/referring-expression-segmentation-on-refcoco-1)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco-1?p=referring-expression-object-segmentation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/referring-expression-object-segmentation-with/referring-expression-segmentation-on-refcoco)](https://paperswithcode.com/sota/referring-expression-segmentation-on-refcoco?p=referring-expression-object-segmentation-with)`

Referring Expression Object Segmentation with Caption-Aware Consistency

10 Oct 2019 · Yi-Wen Chen, Yi-Hsuan Tsai, Tiantian Wang, Yen-Yu Lin, Ming-Hsuan Yang ·

Referring expressions are natural language descriptions that identify a particular object within a scene and are widely used in our daily conversations. In this work, we focus on segmenting the object in an image specified by a referring expression. To this end, we propose an end-to-end trainable comprehension network that consists of the language and visual encoders to extract feature representations from both domains. We introduce the spatial-aware dynamic filters to transfer knowledge from text to image, and effectively capture the spatial information of the specified object. To better communicate between the language and visual modules, we employ a caption generation network that takes features shared across both domains as input, and improves both representations via a consistency that enforces the generated sentence to be similar to the given referring expression. We evaluate the proposed framework on two referring expression datasets and show that our method performs favorably against the state-of-the-art algorithms.

PDF Abstract

Code

Add Remove Mark official

wenz116/lang2seg official

Tasks

Add Remove

Caption Generation

Object

Referring Expression

Referring Expression Segmentation

Semantic Segmentation

Sentence

Datasets

MS COCO

RefCOCO

Results from the Paper

Edit

Ranked #19 on Referring Expression Segmentation on RefCOCO testB

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Referring Expression Segmentation	RefCOCO testA	LANG2SEG	Overall IoU	61.77	# 21	Compare
Referring Expression Segmentation	RefCOCO testB	LANG2SEG	Overall IoU	53.81	# 19	Compare
Referring Expression Segmentation	RefCoCo val	LANG2SEG	Overall IoU	58.90	# 21	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Referring Expression Object Segmentation with Caption-Aware Consistency

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove