Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection

ICCV 2017  ·  Pierre Baqué, François Fleuret, Pascal Fua ·

People detection in single 2D images has improved greatly in recent years. However, comparatively little of this progress has percolated into multi-camera multi-people tracking algorithms, whose performance still degrades severely when scenes become very crowded. In this work, we introduce a new architecture that combines Convolutional Neural Nets and Conditional Random Fields to explicitly model those ambiguities. One of its key ingredients are high-order CRF terms that model potential occlusions and give our approach its robustness even when many people are present. Our model is trained end-to-end and we show that it outperforms several state-of-art algorithms on challenging scenes.

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Multiview Detection MultiviewX Deep-Occulsion MODA 75.2 # 8
MODP 54.7 # 8
Multiview Detection Wildtrack Deep-Occlusion MODA 74.1 # 9
MODP 53.8 # 8

Methods


No methods listed for this paper. Add relevant methods here