Activity Detection

30 papers with code • 0 benchmarks • 9 datasets

Detecting activities in extended videos.

Latest papers with code

Fine-Grained Classroom Activity Detection from Audio with Neural Networks

hutchresearch/fine-grained-cad 29 Jul 2021

We obtain strong results on the new fine-grained task and state-of-the-art on the 4-way task: our best model obtains frame-level error rates of 6. 2%, 7. 7% and 28. 0% when generalizing to unseen instructors for the 4-way, 5-way, and 9-way classification tasks, respectively (relative reductions of 35. 4%, 48. 3% and 21. 6% over a strong baseline).

Action Detection Activity Detection

29 Jul 2021

WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments

aispeech-lab/wase 13 Jun 2021

In the speaker extraction problem, it is found that additional information from the target speaker contributes to the tracking and extraction of the target speaker, which includes voiceprint, lip movement, facial expression, and spatial information.

Action Detection Activity Detection

13 Jun 2021

End-to-end speaker segmentation for overlap-aware resegmentation

pyannote/segmentation 8 Apr 2021

Experiments on multiple speaker diarization datasets conclude that our model can be used with great success on both voice activity detection and overlapped speech detection.

Action Detection Activity Detection +2

08 Apr 2021

A Hybrid CNN-BiLSTM Voice Activity Detector

NickWilkinson37/voxseg 5 Mar 2021

We find that significantly smaller models with near optimal parameters perform on par with larger models trained with optimal parameters.

Action Detection Activity Detection

05 Mar 2021

Coarse-Fine Networks for Temporal Activity Detection in Videos

kkahatapitiya/Coarse-Fine-Networks CVPR 2021

In this paper, we introduce Coarse-Fine Networks, a two-stream architecture which benefits from different abstractions of temporal resolution to learn better video representations for long-term motion.

Action Detection Activity Detection

01 Mar 2021

ROAD: The ROad event Awareness Dataset for Autonomous Driving

gurkirt/road-dataset 23 Feb 2021

Humans approach driving in a holistic fashion which entails, in particular, understanding road events and their evolution.

Action Detection Activity Detection +3

23 Feb 2021

AV Taris: Online Audio-Visual Speech Recognition

georgesterpu/Taris 14 Dec 2020

In recent years, Automatic Speech Recognition (ASR) technology has approached human-level performance on conversational speech under relatively clean listening conditions.

Action Detection Activity Detection +2

14 Dec 2020

VoxLingua107: a Dataset for Spoken Language Recognition

alumae/torch-xvectors-wav 25 Nov 2020

Speech activity detection and speaker diarization are used to extract segments from the videos that contain speech.

Action Detection Activity Detection +3

25 Nov 2020

Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection

dairui01/TSU_evaluation 28 Oct 2020

This work aims at building a large scale dataset with daily-living activities performed in a natural manner.

Action Detection Activity Detection

28 Oct 2020

RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns

arnabkmondal/RespVAD 21 Aug 2020

The Respiration Pattern is first extracted from the video focusing on the abdominal-thoracic region of a speaker using an optical flow based method.

Action Detection Activity Detection +1

21 Aug 2020