TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Weakly Supervised Object Detection	Charades	R*CNN	MAP	0.99	# 5
Human-Object Interaction Detection	HICO	R*CNN	mAP	28.5	# 8
Weakly Supervised Object Detection	HICO-DET	R*CNN	MAP	2.15	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/contextual-action-recognition-with-rcnn/weakly-supervised-object-detection-on-hico)](https://paperswithcode.com/sota/weakly-supervised-object-detection-on-hico?p=contextual-action-recognition-with-rcnn)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/contextual-action-recognition-with-rcnn/weakly-supervised-object-detection-on-4)](https://paperswithcode.com/sota/weakly-supervised-object-detection-on-4?p=contextual-action-recognition-with-rcnn)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/contextual-action-recognition-with-rcnn/human-object-interaction-detection-on-hico-1)](https://paperswithcode.com/sota/human-object-interaction-detection-on-hico-1?p=contextual-action-recognition-with-rcnn)`

Contextual Action Recognition with R*CNN

ICCV 2015 · Georgia Gkioxari, Ross Girshick, Jitendra Malik ·

There are multiple cues in an image which reveal what action a person is performing. For example, a jogger has a pose that is characteristic for jogging, but the scene (e.g. road, trail) and the presence of other joggers can be an additional source of information. In this work, we exploit the simple observation that actions are accompanied by contextual cues to build a strong action recognition system. We adapt RCNN to use more than one region for classification while still maintaining the ability to localize the action. We call our system R*CNN. The action-specific models and the feature maps are trained jointly, allowing for action specific representations to emerge. R*CNN achieves 90.2% mean AP on the PASAL VOC Action dataset, outperforming all other approaches in the field by a significant margin. Last, we show that R*CNN is not limited to action recognition. In particular, R*CNN can also be used to tackle fine-grained tasks such as attribute classification. We validate this claim by reporting state-of-the-art performance on the Berkeley Attributes of People dataset.

PDF Abstract ICCV 2015 PDF ICCV 2015 Abstract

Code

Add Remove Mark official

gkioxari/RstarCNN official

132

VivekUdupa/CPSC8810-DeepLearning_vk…

Tasks

Add Remove

Action Recognition

Attribute

General Classification

Human-Object Interaction Detection

Temporal Action Localization

Datasets

MPII

Charades

HICO-DET

MPII Human Pose

HICO

Results from the Paper

Edit

Ranked #4 on Weakly Supervised Object Detection on HICO-DET

Get a GitHub badge

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Weakly Supervised Object Detection	Charades	R*CNN	MAP	0.99	# 5	See all
Weakly Supervised Object Detection	HICO-DET	R*CNN	MAP	2.15	# 4	See all
Human-Object Interaction Detection	HICO	R*CNN	mAP	28.5	# 8	See all

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Contextual Action Recognition with R*CNN

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit