Integrated Object Detection and Tracking with Tracklet-Conditioned Detection

27 Nov 2018  ·  Zheng Zhang, Dazhi Cheng, Xizhou Zhu, Stephen Lin, Jifeng Dai ·

Accurate detection and tracking of objects is vital for effective video understanding. In previous work, the two tasks have been combined in a way that tracking is based heavily on detection, but the detection benefits marginally from the tracking. To increase synergy, we propose to more tightly integrate the tasks by conditioning the object detection in the current frame on tracklets computed in prior frames. With this approach, the object detection results not only have high detection responses, but also improved coherence with the existing tracklets. This greater coherence leads to estimated object trajectories that are smoother and more stable than the jittered paths obtained without tracklet-conditioned detection. Over extensive experiments, this approach is shown to achieve state-of-the-art performance in terms of both detection and tracking accuracy, as well as noticeable improvements in tracking stability.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Video Object Detection ImageNet VID Tracklet-Conditioned Detection+DCNv2+FGFA MAP 83.5 # 19

Methods


No methods listed for this paper. Add relevant methods here