TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Situation Recognition	imSitu	Kernel GraphNet	Top-1 Verb	43.27	# 4
Situation Recognition	imSitu	Kernel GraphNet	Top-1 Verb & Value	35.41	# 2
Situation Recognition	imSitu	Kernel GraphNet	Top-5 Verbs	68.72	# 5
Situation Recognition	imSitu	Kernel GraphNet	Top-5 Verbs & Value	55.62	# 4
Grounded Situation Recognition	SWiG	Kernel GraphNet	Top-1 Verb	43.27	# 4
Grounded Situation Recognition	SWiG	Kernel GraphNet	Top-1 Verb & Value	35.41	# 3
Grounded Situation Recognition	SWiG	Kernel GraphNet	Top-5 Verbs	68.72	# 5
Grounded Situation Recognition	SWiG	Kernel GraphNet	Top-5 Verbs & Value	55.62	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mixture-kernel-graph-attention-network-for/situation-recognition-on-imsitu)](https://paperswithcode.com/sota/situation-recognition-on-imsitu?p=mixture-kernel-graph-attention-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mixture-kernel-graph-attention-network-for/grounded-situation-recognition-on-swig)](https://paperswithcode.com/sota/grounded-situation-recognition-on-swig?p=mixture-kernel-graph-attention-network-for)`

Mixture-Kernel Graph Attention Network for Situation Recognition

ICCV 2019 · Mohammed Suhail, Leonid Sigal ·

Understanding images beyond salient actions involves reasoning about scene context, objects, and the roles they play in the captured event. Situation recognition has recently been introduced as the task of jointly reasoning about the verbs (actions) and a set of semantic-role and entity (noun) pairs in the form of action frames. Labeling an image with an action frame requires an assignment of values (nouns) to the roles based on the observed image content. Among the inherent challenges are the rich conditional structured dependencies between the output role assignments and the overall semantic sparsity. In this paper, we propose a novel mixture-kernel attention graph neural network (GNN) architecture designed to address these challenges. Our GNN enables dynamic graph structure during training and inference, through the use of a graph attention mechanism, and context-aware interactions between role pairs. We illustrate the efficacy of our model and design choices by conducting experiments on imSitu benchmark dataset, with accuracy improvements of up to 10% over the state-of-the-art.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Graph Attention

Grounded Situation Recognition

Situation Recognition

Datasets

FrameNet

Results from the Paper

Add Remove

Ranked #4 on Grounded Situation Recognition on SWiG

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Situation Recognition	imSitu	Kernel GraphNet	Top-1 Verb	43.27	# 4	Compare
			Top-1 Verb & Value	35.41	# 2	Compare
			Top-5 Verbs	68.72	# 5	Compare
			Top-5 Verbs & Value	55.62	# 4	Compare
Grounded Situation Recognition	SWiG	Kernel GraphNet	Top-1 Verb	43.27	# 4	Compare
			Top-1 Verb & Value	35.41	# 3	Compare
			Top-5 Verbs	68.72	# 5	Compare
			Top-5 Verbs & Value	55.62	# 4	Compare

Methods

Add Remove

Graph Neural Network

Edit Social Preview

Mixture-Kernel Graph Attention Network for Situation Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove