Multimodal Activity Recognition

11 papers with code • 9 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Greatest papers with code

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

MIT-HAN-LAB/temporal-shift-module 2 Aug 2016

The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network.

Action Classification Action Recognition +3

Moments in Time Dataset: one million videos for event understanding

zhoubolei/moments_models 9 Jan 2018

We present the Moments in Time Dataset, a large-scale human-annotated collection of one million short videos corresponding to dynamic events unfolding within three seconds.

Action Recognition Multimodal Activity Recognition

Interpretable 3D Human Action Analysis with Temporal Convolutional Networks

TaeSoo-Kim/TCNActionRecognition 14 Apr 2017

In this work, we propose to use a new class of models known as Temporal Convolutional Neural Networks (TCN) for 3D human action recognition.

Action Analysis Multimodal Activity Recognition +1

Distilling Audio-Visual Knowledge by Compositional Contrastive Learning

yanbeic/CCL CVPR 2021

Having access to multi-modal cues (e. g. vision and audio) empowers some cognitive tasks to be done faster compared to learning from a single modality.

Audio Tagging audio-visual learning +5

Cross-modal Learning by Hallucinating Missing Modalities in RGB-D Vision

ncgarcia/modality-distillation Multimodal Scene Understanding Algorithms, Applications and Deep Learning 2019

We report state-of-the-art or comparable results on video action recognition on the largest multimodal dataset available for this task, the NTU RGB+D, as well as on the UWA3DII and Northwestern-UCLA.

Action Recognition Multimodal Activity Recognition +1

EV-Action: Electromyography-Vision Multi-Modal Action Dataset

wanglichenxj/EV-Action-Electromyography-Vision-Multi-Modal-Action-Dataset 20 Apr 2019

To make up this, we introduce a new, large-scale EV-Action dataset in this work, which consists of RGB, depth, electromyography (EMG), and two skeleton modalities.

Action Analysis Action Recognition +2

Bayesian Hierarchical Dynamic Model for Human Action Recognition

rort1989/HDM CVPR 2019

Human action recognition remains as a challenging task partially due to the presence of large variations in the execution of action.

Action Recognition Bayesian Inference +2