Browse > Computer Vision > Action Detection

Action Detection

48 papers with code ยท Computer Vision

Leaderboards

Latest papers without code

Spatio-Temporal Action Detection with Multi-Object Interaction

1 Apr 2020

Spatio-temporal action detection in videos requires localizing the action both spatially and temporally in the form of an "action tube".

ACTION DETECTION HUMAN DETECTION

Long Short-Term Relation Networks for Video Action Detection

31 Mar 2020

It has been well recognized that modeling human-object or object-object relations would be helpful for detection task.

ACTION DETECTION

Revisiting Few-shot Activity Detection with Class Similarity Control

31 Mar 2020

In this paper, we present a conceptually simple and general yet novel framework for few-shot temporal activity detection based on proposal regression which detects the start and end time of the activities in untrimmed videos.

ACTION DETECTION ACTIVITY DETECTION VIDEO CLASSIFICATION

Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation

20 Mar 2020

We believe the introduction of the COIN dataset will promote the future in-depth research on instructional video analysis for the community.

ACTION DETECTION

A Novel Online Action Detection Framework from Untrimmed Video Streams

17 Mar 2020

Online temporal action localization from an untrimmed video stream is a challenging problem in computer vision.

ACTION DETECTION TEMPORAL ACTION LOCALIZATION

ZSTAD: Zero-Shot Temporal Activity Detection

12 Mar 2020

An integral part of video analysis and surveillance is temporal activity detection, which means to simultaneously recognize and localize activities in long untrimmed videos.

ACTION DETECTION ACTIVITY DETECTION

Crossmodal learning for audio-visual speech event localization

9 Mar 2020

We present a state-of-the-art audio-visual voice activity detection system and demonstrate that the learned embeddings can effectively localize to active speakers in the visual frames.

ACTION DETECTION ACTIVITY DETECTION

DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team

23 Feb 2020

Our diarization system includes multiple modules, namely voice activity detection (VAD), segmentation, speaker embedding extraction, similarity scoring, clustering, resegmentation and overlap detection.

ACTION DETECTION ACTIVITY DETECTION

3D ResNet with Ranking Loss Function for Abnormal Activity Detection in Videos

4 Feb 2020

Afterwards, using these features and deep multiple instance learning along with the proposed ranking loss, our model learns to predict the abnormality score at the video segment level.

ACTION DETECTION ACTIVITY DETECTION MULTIPLE INSTANCE LEARNING TEMPORAL ACTION LOCALIZATION