Activity Detection

63 papers with code • 1 benchmarks • 12 datasets

Detecting activities in extended videos.

Benchmarks

Add a Result

These leaderboards are used to track progress in Activity Detection

Trend	Dataset	Best Model	Paper	Code	Compare
	AVA-Speech	CNN-BiLSTM_best			See all

Libraries

Use these libraries to find Activity Detection models and implementations

alibaba-damo-academy/FunASR

3 papers

3,189

Datasets

Most implemented papers

Most implemented Social Latest No code

Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks

imatge-upc/activitynet-2016-cvprw • • 29 Aug 2016

This thesis explore different approaches using Convolutional and Recurrent Neural Networks to classify and temporally localize activities on videos, furthermore an implementation to achieve it has been proposed.

Paper
Code

An End-to-End Architecture for Keyword Spotting and Voice Activity Detection

mindorii/kws • • 28 Nov 2016

We propose a single neural network architecture for two tasks: on-line keyword spotting and voice activity detection.

Paper
Code

R-C3D: Region Convolutional 3D Network for Temporal Activity Detection

VisionLearningGroup/R-C3D • ICCV 2017

We address the problem of activity detection in continuous, untrimmed video streams.

Paper
Code

Fine-grained Activity Recognition in Baseball Videos

piergiaj/mlb-youtube • • 9 Apr 2018

In this paper, we introduce a challenging new dataset, MLB-YouTube, designed for fine-grained activity detection.

Paper
Code

rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method

zhenghuatan/rVAD • 9 Jun 2019

In the end, a posteriori SNR weighted energy difference is applied to the extended pitch segments of the denoised speech signal for detecting voice activity.

Paper
Code

pyannote.audio: neural building blocks for speaker diarization

pyannote/pyannote-audio • • 4 Nov 2019

We introduce pyannote. audio, an open-source toolkit written in Python for speaker diarization.

Paper
Code

Multi-Speaker and Wide-Band Simulated Conversations as Training Data for End-to-End Neural Diarization

butspeechfit/eend • • 12 Nov 2022

End-to-end diarization presents an attractive alternative to standard cascaded diarization systems because a single system can handle all aspects of the task at once.

Paper
Code

Learning Latent Super-Events to Detect Multiple Activities in Videos

piergiaj/super-events-cvpr18 • • CVPR 2018

In this paper, we introduce the concept of learning latent super-events from activity videos, and present how it benefits activity detection in continuous videos.

Paper
Code

Personal VAD: Speaker-Conditioned Voice Activity Detection

pirxus/personalVAD • • 12 Aug 2019

In this paper, we propose "personal VAD", a system to detect the voice activity of a target speaker at the frame level.

Paper
Code

Harvesting Ambient RF for Presence Detection Through Deep Learning

bigtreeyanger/presence_detection_cnn • • 13 Feb 2020

With presence detection, how to collect training data with human presence can have a significant impact on the performance.

Paper
Code

Activity Detection

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result