Browse > Computer Vision > Video > Video Understanding

Video Understanding

13 papers with code · Computer Vision
Subtask of Video

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

CVPR 2018 tensorflow/models

The AVA dataset densely annotates 80 atomic visual actions in 430 15-minute video clips, where actions are localized in space and time, resulting in 1. 58M action labels with multiple labels per person occurring frequently.

ACTION LOCALIZATION ACTION RECOGNITION VIDEO UNDERSTANDING

TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition

30 Mar 2017chihyaoma/Activity-Recognition-with-CNN-and-RNN

We demonstrate that using both RNNs (using LSTMs) and Temporal-ConvNets on spatiotemporal feature matrices are able to exploit spatiotemporal dynamics to improve the overall performance.

ACTION CLASSIFICATION ACTION RECOGNITION IN VIDEOS ACTIVITY RECOGNITION VIDEO CLASSIFICATION VIDEO UNDERSTANDING

Learnable pooling with Context Gating for video classification

21 Jun 2017antoine77340/Youtube-8M-WILLOW

In particular, we evaluate our method on the large-scale multi-modal Youtube-8M v2 dataset and outperform all other methods in the Youtube 8M Large-Scale Video Understanding challenge.

VIDEO CLASSIFICATION VIDEO UNDERSTANDING

ECO: Efficient Convolutional Network for Online Video Understanding

ECCV 2018 mzolfaghari/ECO-efficient-video-understanding

In this paper, we introduce a network architecture that takes long-term content into account and enables fast per-video processing at the same time.

ACTION CLASSIFICATION VIDEO CAPTIONING VIDEO UNDERSTANDING

The Monkeytyping Solution to the YouTube-8M Video Understanding Challenge

16 Jun 2017wangheda/youtube-8m

This article describes the final solution of team monkeytyping, who finished in second place in the YouTube-8M video understanding challenge.

VIDEO CLASSIFICATION VIDEO UNDERSTANDING

TSM: Temporal Shift Module for Efficient Video Understanding

20 Nov 2018MIT-HAN-LAB/temporal-shift-module

The explosive growth in video streaming gives rise to challenges on efficiently extracting the spatial-temporal information to perform video understanding at low computation cost.

VIDEO RECOGNITION VIDEO UNDERSTANDING

Temporal Tessellation: A Unified Approach for Video Analysis

ICCV 2017 dot27/temporal-tessellation

A test video is processed by forming correspondences between its clips and the clips of reference videos with known semantics, following which, reference semantics can be transferred to the test video.

ACTION DETECTION VIDEO CAPTIONING VIDEO SUMMARIZATION VIDEO UNDERSTANDING

Learnable Pooling Methods for Video Classification

1 Oct 2018pomonam/LearnablePoolingMethods

We demonstrate our solutions in the "The 2nd YouTube-8M Video Understanding Challenge", by using frame-level video and audio descriptors.

VIDEO CLASSIFICATION VIDEO UNDERSTANDING

Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning

HLT 2018 chitwansaharia/HACAModel

Furthermore, for the first time, we validate the superior performance of the deep audio features on the video captioning task.

VIDEO CAPTIONING VIDEO UNDERSTANDING

Joint Event Detection and Description in Continuous Video Streams

28 Feb 2018VisionLearningGroup/JEDDi-Net

In order to explicitly model temporal relationships between visual events and their captions in a single video, we also propose a two-level hierarchical captioning module that keeps track of context.

DENSE VIDEO CAPTIONING VIDEO UNDERSTANDING