About

Benchmarks

TREND DATASET BEST METHOD PAPER TITLE PAPER CODE COMPARE

Libraries

Subtasks

Datasets

Greatest papers with code

MoViNets: Mobile Video Networks for Efficient Video Recognition

21 Mar 2021tensorflow/models

We present Mobile Video Networks (MoViNets), a family of computation and memory efficient video networks that can operate on streaming video for online inference.

ACTION CLASSIFICATION ACTION RECOGNITION NEURAL ARCHITECTURE SEARCH VIDEO RECOGNITION

Non-local Neural Networks

CVPR 2018 facebookresearch/detectron

Both convolutional and recurrent operations are building blocks that process one local neighborhood at a time.

Ranked #8 on Keypoint Detection on COCO (Validation AP metric)

ACTION CLASSIFICATION ACTION RECOGNITION INSTANCE SEGMENTATION KEYPOINT DETECTION OBJECT DETECTION

AssembleNet++: Assembling Modality Representations via Attention Connections

18 Aug 2020google-research/google-research

We create a family of powerful video models which are able to: (i) learn interactions between semantic object information and raw appearance and motion features, and (ii) deploy attention in order to better learn the importance of features at each convolutional block of the network.

ACTION CLASSIFICATION ACTIVITY RECOGNITION

Large-scale weakly-supervised pre-training for video action recognition

CVPR 2019 microsoft/computervision-recipes

Second, frame-based models perform quite well on action recognition; is pre-training for good image features sufficient or is pre-training for spatio-temporal features valuable for optimal transfer learning?

 Ranked #1 on Egocentric Activity Recognition on EPIC-KITCHENS-55 (Actions Top-1 (S2) metric)

ACTION CLASSIFICATION ACTION RECOGNITION ACTIVITY RECOGNITION IN VIDEOS EGOCENTRIC ACTIVITY RECOGNITION TRANSFER LEARNING

A Closer Look at Spatiotemporal Convolutions for Action Recognition

CVPR 2018 microsoft/computervision-recipes

In this paper we discuss several forms of spatiotemporal convolutions for video analysis and study their effects on action recognition.

ACTION CLASSIFICATION ACTION RECOGNITION

Deep Concept-wise Temporal Convolutional Networks for Action Localization

26 Aug 2019PaddlePaddle/models

In this paper, we empirically find that stacking more conventional temporal convolution layers actually deteriorates action classification performance, possibly ascribing to that all channels of 1D feature map, which generally are highly abstract and can be regarded as latent concepts, are excessively recombined in temporal convolution.

ACTION CLASSIFICATION ACTION CLASSIFICATION ACTION LOCALIZATION

Revisiting ResNets: Improved Training and Scaling Strategies

13 Mar 2021tensorflow/tpu

Using improved training and scaling strategies, we design a family of ResNet architectures, ResNet-RS, which are 1. 7x - 2. 7x faster than EfficientNets on TPUs, while achieving similar accuracies on ImageNet.

ACTION CLASSIFICATION IMAGE CLASSIFICATION VIDEO CLASSIFICATION

X3D: Expanding Architectures for Efficient Video Recognition

CVPR 2020 facebookresearch/SlowFast

This paper presents X3D, a family of efficient video networks that progressively expand a tiny 2D image classification architecture along multiple network axes, in space, time, width and depth.

ACTION CLASSIFICATION FEATURE SELECTION IMAGE CLASSIFICATION VIDEO CLASSIFICATION VIDEO RECOGNITION

Audiovisual SlowFast Networks for Video Recognition

23 Jan 2020facebookresearch/SlowFast

We present Audiovisual SlowFast Networks, an architecture for integrated audiovisual perception.

ACTION CLASSIFICATION ACTION CLASSIFICATION VIDEO RECOGNITION