Action Detection

233 papers with code • 11 benchmarks • 33 datasets

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Libraries

Use these libraries to find Action Detection models and implementations
6 papers
3,884
2 papers
2,986
See all 6 libraries.

Latest papers with no code

High-speed Low-consumption sEMG-based Transient-state micro-Gesture Recognition

no code yet • 4 Mar 2024

The accuracy of the proposed SNN is 83. 85% and 93. 52% on the two datasets respectively.

Fast Low-parameter Video Activity Localization in Collaborative Learning Environments

no code yet • 2 Mar 2024

Research on video activity detection has primarily focused on identifying well-defined human activities in short video segments.

Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach

no code yet • 28 Feb 2024

Grant-free random access (RA) has been recognized as a promising solution to support massive connectivity due to the removal of the uplink grant request procedures.

Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection

no code yet • 13 Feb 2024

A channel-number invariant loss is proposed to learn a unique feature representation regardless of the number of available microphones.

Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO

no code yet • 7 Feb 2024

Different from traditional compressed sensing (CS) methods that only use the sparsity of user activities, we develop several Approximate Message Passing (AMP) based CS algorithms by exploiting the sparsity of user activities and mmWave channels.

A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model

no code yet • 5 Feb 2024

Criminal and suspicious activity detection has become a popular research topic in recent years.

Joint User Detection and Localization in Near-Field Using Reconfigurable Intelligent Surfaces

no code yet • 4 Feb 2024

This letter studies the problem of jointly detecting active user equipments (UEs) and estimating their location in the near field, wherein the base station (BS) is unaware of the number of active (or inactive) UEs and their positions.

Activity Detection for Massive Connectivity in Cell-free Networks with Unknown Large-scale Fading, Channel Statistics, Noise Variance, and Activity Probability: A Bayesian Approach

no code yet • 30 Jan 2024

This problem is even more severe in cell-free networks as there are many of these parameters to be acquired.

Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model

no code yet • 29 Jan 2024

The results are promising for real-time application, and the falls are detected on video level with a state-of-the-art 0. 96 F1 score on the HQFSD dataset under the given experimental settings.

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

no code yet • 16 Jan 2024

The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.