TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Action Recognition	HMDB-51	Hidden Two-Stream	Average accuracy of 3 splits	78.7	# 26
Action Recognition	UCF101	Hidden Two-Stream	3-fold Accuracy	97.1	# 20

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hidden-two-stream-convolutional-networks-for/action-recognition-in-videos-on-ucf101)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ucf101?p=hidden-two-stream-convolutional-networks-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/hidden-two-stream-convolutional-networks-for/action-recognition-in-videos-on-hmdb-51)](https://paperswithcode.com/sota/action-recognition-in-videos-on-hmdb-51?p=hidden-two-stream-convolutional-networks-for)`

Hidden Two-Stream Convolutional Networks for Action Recognition

2 Apr 2017 · Yi Zhu, Zhenzhong Lan, Shawn Newsam, Alexander G. Hauptmann ·

Analyzing videos of human actions involves understanding the temporal relationships among video frames. State-of-the-art action recognition approaches rely on traditional optical flow estimation methods to pre-compute motion information for CNNs. Such a two-stage approach is computationally expensive, storage demanding, and not end-to-end trainable. In this paper, we present a novel CNN architecture that implicitly captures motion information between adjacent frames. We name our approach hidden two-stream CNNs because it only takes raw video frames as input and directly predicts action classes without explicitly computing optical flow. Our end-to-end approach is 10x faster than its two-stage baseline. Experimental results on four challenging action recognition datasets: UCF101, HMDB51, THUMOS14 and ActivityNet v1.2 show that our approach significantly outperforms the previous best real-time approaches.

PDF Abstract

Code

Add Remove Mark official

bryanyzhu/Hidden-Two-Stream official

194

bryanyzhu/two-stream-pytorch

553

AbdalaDiasse/Video-classification-f…

Tasks

Add Remove

Action Recognition

Optical Flow Estimation

Temporal Action Localization

Vocal Bursts Valence Prediction

Datasets

UCF101

HMDB51

ActivityNet

THUMOS14

Results from the Paper

Edit

Ranked #20 on Action Recognition on UCF101

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Action Recognition	HMDB-51	Hidden Two-Stream	Average accuracy of 3 splits	78.7	# 26		Compare
Action Recognition	UCF101	Hidden Two-Stream	3-fold Accuracy	97.1	# 20		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Hidden Two-Stream Convolutional Networks for Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove