The ShanghaiTech Campus dataset has 13 scenes with complex light conditions and camera angles. It contains 130 abnormal events and over 270, 000 training frames. Moreover, both the frame-level and pixel-level ground truth of abnormal events are annotated in this dataset.
165 PAPERS • 4 BENCHMARKS
XD-Violence is a large-scale audio-visual dataset for violence detection in videos.
37 PAPERS • 1 BENCHMARK
This dataset focuses only on the robbery category, presenting a new weakly labelled dataset that contains 486 new real–world robbery surveillance videos acquired from public sources.
0 PAPER • NO BENCHMARKS YET