…The dataset consists of two components: segmented videos for activity recognition and continuous videos for activity classification.
5 PAPERS • NO BENCHMARKS YET
…AVA Speech densely annotates audio-based speech activity in AVA v1.0 videos, and explicitly labels 3 background noise conditions, resulting in ~46K labeled segments spanning 45 hours of data.
98 PAPERS • 7 BENCHMARKS
…Squats Bird Dogs Supermans Bicycle Crunches Leg Raises Front Raises (with dumbbells) Overhead Press (with dumbbells) Annotations The dataset includes the following annotations: Bounding boxes Segmentation
0 PAPER • NO BENCHMARKS YET