TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Visual Dialog	VisDial v0.9 val	CAG	MRR	0.6756	# 12
Visual Dialog	VisDial v0.9 val	CAG	Mean Rank	3.75	# 3
Visual Dialog	VisDial v0.9 val	CAG	R@1	54.64	# 4
Visual Dialog	VisDial v0.9 val	CAG	R@10	91.48	# 2
Visual Dialog	VisDial v0.9 val	CAG	R@5	83.72	# 3
Visual Dialog	Visual Dialog v1.0 test-std	CAG	NDCG (x 100)	56.64	# 60
Visual Dialog	Visual Dialog v1.0 test-std	CAG	MRR (x 100)	63.49	# 28
Visual Dialog	Visual Dialog v1.0 test-std	CAG	R@1	49.85	# 28
Visual Dialog	Visual Dialog v1.0 test-std	CAG	R@5	80.63	# 26
Visual Dialog	Visual Dialog v1.0 test-std	CAG	R@10	90.15	# 20
Visual Dialog	Visual Dialog v1.0 test-std	CAG	Mean	4.11	# 56

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/iterative-context-aware-graph-inference-for/visual-dialog-on-visdial-v09-val)](https://paperswithcode.com/sota/visual-dialog-on-visdial-v09-val?p=iterative-context-aware-graph-inference-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/iterative-context-aware-graph-inference-for/visual-dialog-on-visual-dialog-v1-0-test-std)](https://paperswithcode.com/sota/visual-dialog-on-visual-dialog-v1-0-test-std?p=iterative-context-aware-graph-inference-for)`

Iterative Context-Aware Graph Inference for Visual Dialog

CVPR 2020 · Dan Guo, Hui Wang, Hanwang Zhang, Zheng-Jun Zha, Meng Wang ·

Visual dialog is a challenging task that requires the comprehension of the semantic dependencies among implicit visual and textual contexts. This task can refer to the relation inference in a graphical model with sparse contexts and unknown graph structure (relation descriptor), and how to model the underlying context-aware relation inference is critical. To this end, we propose a novel Context-Aware Graph (CAG) neural network. Each node in the graph corresponds to a joint semantic feature, including both object-based (visual) and history-related (textual) context representations. The graph structure (relations in dialog) is iteratively updated using an adaptive top-$K$ message passing mechanism. Specifically, in every message passing step, each node selects the most $K$ relevant nodes, and only receives messages from them. Then, after the update, we impose graph attention on all the nodes to get the final graph embedding and infer the answer. In CAG, each node has dynamic relations in the graph (different related $K$ neighbor nodes), and only the most relevant nodes are attributive to the context-aware relational graph inference. Experimental results on VisDial v0.9 and v1.0 datasets show that CAG outperforms comparative methods. Visualization results further validate the interpretability of our method.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

wh0330/CAG_VisDial

Tasks

Add Remove

Graph Attention

Graph Embedding

Relation

Visual Dialog

Datasets

VisDial

Results from the Paper

Edit

Ranked #12 on Visual Dialog on VisDial v0.9 val

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Visual Dialog	VisDial v0.9 val	CAG	MRR	0.6756	# 12	Compare
			Mean Rank	3.75	# 3	Compare
			R@1	54.64	# 4	Compare
			R@10	91.48	# 2	Compare
			R@5	83.72	# 3	Compare
Visual Dialog	Visual Dialog v1.0 test-std	CAG	NDCG (x 100)	56.64	# 60	Compare
			MRR (x 100)	63.49	# 28	Compare
			R@1	49.85	# 28	Compare
			R@5	80.63	# 26	Compare
			R@10	90.15	# 20	Compare
			Mean	4.11	# 56	Compare

Methods

Add Remove

Interpretability

Edit Social Preview

Iterative Context-Aware Graph Inference for Visual Dialog

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove