Mining Inter-Video Proposal Relations for Video Object Detection

Recent studies have shown that, context aggregating information from proposals in different frames can clearly enhance the performance of video object detection. However, these approaches mainly exploit the intra-proposal relation within single video, while ignoring the intra-proposal relation among different videos, which can provide important discriminative cues for recognizing confusing objects... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK BENCHMARK
Video Object Detection ImageNet VID HVRNet + ResNeXt101-32x4d MAP 85.5 # 1
Video Object Detection ImageNet VID HVRNet + ResNest101 MAP 83.8 # 5

Methods used in the Paper