Browse > Computer Vision > Video > Video Understanding

Video Understanding

19 papers with code · Computer Vision
Subtask of Video

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

CVPR 2018 tensorflow/models

The AVA dataset densely annotates 80 atomic visual actions in 430 15-minute video clips, where actions are localized in space and time, resulting in 1. 58M action labels with multiple labels per person occurring frequently.

ACTION LOCALIZATION VIDEO UNDERSTANDING ZERO-SHOT ACTION RECOGNITION

TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition

30 Mar 2017chihyaoma/Activity-Recognition-with-CNN-and-RNN

We demonstrate that using both RNNs (using LSTMs) and Temporal-ConvNets on spatiotemporal feature matrices are able to exploit spatiotemporal dynamics to improve the overall performance.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS ACTIVITY RECOGNITION VIDEO CLASSIFICATION VIDEO UNDERSTANDING

Learnable pooling with Context Gating for video classification

21 Jun 2017antoine77340/Youtube-8M-WILLOW

In particular, we evaluate our method on the large-scale multi-modal Youtube-8M v2 dataset and outperform all other methods in the Youtube 8M Large-Scale Video Understanding challenge.

VIDEO CLASSIFICATION VIDEO UNDERSTANDING

ECO: Efficient Convolutional Network for Online Video Understanding

ECCV 2018 mzolfaghari/ECO-efficient-video-understanding

In this paper, we introduce a network architecture that takes long-term content into account and enables fast per-video processing at the same time.

ACTION CLASSIFICATION VIDEO CAPTIONING VIDEO UNDERSTANDING

End-to-End Learning of Motion Representation for Video Understanding

CVPR 2018 LijieFan/tvnet

Despite the recent success of end-to-end learned representations, hand-crafted optical flow features are still widely used in video analysis tasks.

ACTION RECOGNITION IN VIDEOS OPTICAL FLOW ESTIMATION VIDEO UNDERSTANDING

TSM: Temporal Shift Module for Efficient Video Understanding

20 Nov 2018MIT-HAN-LAB/temporal-shift-module

The explosive growth in video streaming gives rise to challenges on efficiently extracting the spatial-temporal information to perform video understanding at low computation cost.

VIDEO RECOGNITION VIDEO UNDERSTANDING

The Monkeytyping Solution to the YouTube-8M Video Understanding Challenge

16 Jun 2017wangheda/youtube-8m

This article describes the final solution of team monkeytyping, who finished in second place in the YouTube-8M video understanding challenge.

VIDEO CLASSIFICATION VIDEO UNDERSTANDING

Temporal Tessellation: A Unified Approach for Video Analysis

ICCV 2017 dot27/temporal-tessellation

A test video is processed by forming correspondences between its clips and the clips of reference videos with known semantics, following which, reference semantics can be transferred to the test video.

ACTION DETECTION VIDEO CAPTIONING VIDEO SUMMARIZATION VIDEO UNDERSTANDING

What does a Car-ssette tape tell?

31 May 2019richermans/AudioCaption

This paper contributes a manually-annotated dataset on car scene, in extension to a previously published hospital audio captioning dataset.

VIDEO UNDERSTANDING WORD EMBEDDINGS

Learnable Pooling Methods for Video Classification

1 Oct 2018pomonam/LearnablePoolingMethods

We demonstrate our solutions in the "The 2nd YouTube-8M Video Understanding Challenge", by using frame-level video and audio descriptors.

VIDEO CLASSIFICATION VIDEO UNDERSTANDING