TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Referring Expression Segmentation	A2D Sentences	PRPE	Precision@0.5	0.634	# 15
Referring Expression Segmentation	A2D Sentences	PRPE	Precision@0.9	0.083	# 16
Referring Expression Segmentation	A2D Sentences	PRPE	IoU overall	0.661	# 14
Referring Expression Segmentation	A2D Sentences	PRPE	IoU mean	0.529	# 17
Referring Expression Segmentation	A2D Sentences	PRPE	Precision@0.6	0.579	# 15
Referring Expression Segmentation	A2D Sentences	PRPE	Precision@0.7	0.483	# 16
Referring Expression Segmentation	A2D Sentences	PRPE	Precision@0.8	0.322	# 15
Referring Expression Segmentation	A2D Sentences	PRPE	AP	0.388	# 14
Referring Expression Segmentation	J-HMDB	PRPE	Precision@0.5	0.572	# 21
Referring Expression Segmentation	J-HMDB	PRPE	Precision@0.6	0.690	# 9
Referring Expression Segmentation	J-HMDB	PRPE	Precision@0.7	0.319	# 14
Referring Expression Segmentation	J-HMDB	PRPE	Precision@0.8	0.06	# 13
Referring Expression Segmentation	J-HMDB	PRPE	Precision@0.9	0.001	# 5
Referring Expression Segmentation	J-HMDB	PRPE	AP	0.294	# 11

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/polar-relative-positional-encoding-for-video/referring-expression-segmentation-on-j-hmdb)](https://paperswithcode.com/sota/referring-expression-segmentation-on-j-hmdb?p=polar-relative-positional-encoding-for-video)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/polar-relative-positional-encoding-for-video/referring-expression-segmentation-on-a2d)](https://paperswithcode.com/sota/referring-expression-segmentation-on-a2d?p=polar-relative-positional-encoding-for-video)`

Polar Relative Positional Encoding for Video-Language Segmentation

20 Jul 2020 · Ke Ning, Lingxi Xie, Fei Wu, Qi Tian ·

In this paper, we tackle a challenging task named video-language segmentation. Given a video and a sentence in natural language, the goal is to segment the object or actor described by the sentence in video frames. To accurately denote a target object, the given sentence usually refers to multiple attributes, such as nearby objects with spatial relations, etc. In this paper, we propose a novel Polar Relative Positional Encoding (PRPE) mechanism that represents spatial relations in a ``linguistic'' way, i.e., in terms of direction and range. Sentence feature can interact with positional embeddings in a more direct way to extract the implied relative positional relations. We also propose parameterized functions for these positional embeddings to adapt real-value directions and ranges. With PRPE, we design a Polar Attention Module (PAM) as the basic module for vision-language fusion. Our method outperforms previous best method by a large margin of 11.4% absolute improvement in terms of mAP on the challenging A2D Sentences dataset. Our method also achieves competitive performances on the J-HMDB Sentences dataset.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Referring Expression Segmentation

Sentence

Datasets

Visual Question Answering

JHMDB

A2D

A2D Sentences

Results from the Paper

Add Remove

Ranked #11 on Referring Expression Segmentation on J-HMDB

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Referring Expression Segmentation	A2D Sentences	PRPE	Precision@0.5	0.634	# 15	Compare
			Precision@0.9	0.083	# 16	Compare
			IoU overall	0.661	# 14	Compare
			IoU mean	0.529	# 17	Compare
			Precision@0.6	0.579	# 15	Compare
			Precision@0.7	0.483	# 16	Compare
			Precision@0.8	0.322	# 15	Compare
			AP	0.388	# 14	Compare
Referring Expression Segmentation	J-HMDB	PRPE	Precision@0.5	0.572	# 21	Compare
			Precision@0.6	0.690	# 9	Compare
			Precision@0.7	0.319	# 14	Compare
			Precision@0.8	0.06	# 13	Compare
			Precision@0.9	0.001	# 5	Compare
			AP	0.294	# 11	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Polar Relative Positional Encoding for Video-Language Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove