TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Video Classification	Breakfast	VideoGraph	Accuracy (%)	69.5	# 7
Long-video Activity Recognition	Breakfast	VideoGraph (I3D-K400-Pretrain-feature)	mAP	63.14	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/videograph-recognizing-minutes-long-human/long-video-activity-recognition-on-breakfast)](https://paperswithcode.com/sota/long-video-activity-recognition-on-breakfast?p=videograph-recognizing-minutes-long-human)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/videograph-recognizing-minutes-long-human/video-classification-on-breakfast)](https://paperswithcode.com/sota/video-classification-on-breakfast?p=videograph-recognizing-minutes-long-human)`

VideoGraph: Recognizing Minutes-Long Human Activities in Videos

13 May 2019 · Noureldien Hussein, Efstratios Gavves, Arnold W. M. Smeulders ·

Many human activities take minutes to unfold. To represent them, related works opt for statistical pooling, which neglects the temporal structure. Others opt for convolutional methods, as CNN and Non-Local. While successful in learning temporal concepts, they are short of modeling minutes-long temporal dependencies. We propose VideoGraph, a method to achieve the best of two worlds: represent minutes-long human activities and learn their underlying temporal structure. VideoGraph learns a graph-based representation for human activities. The graph, its nodes and edges are learned entirely from video datasets, making VideoGraph applicable to problems without node-level annotation. The result is improvements over related works on benchmarks: Epic-Kitchen and Breakfast. Besides, we demonstrate that VideoGraph is able to learn the temporal structure of human activities in minutes-long videos.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Long-video Activity Recognition

Video Classification

Datasets

Charades

Breakfast

Results from the Paper

Edit

Ranked #6 on Long-video Activity Recognition on Breakfast

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Video Classification	Breakfast	VideoGraph	Accuracy (%)	69.5	# 7		Compare
Long-video Activity Recognition	Breakfast	VideoGraph (I3D-K400-Pretrain-feature)	mAP	63.14	# 6		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

VideoGraph: Recognizing Minutes-Long Human Activities in Videos

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove