Multiple Object Forecasting: Predicting Future Object Locations in Diverse Environments

26 Sep 2019  ·  Olly Styles, Tanaya Guha, Victor Sanchez ·

This paper introduces the problem of multiple object forecasting (MOF), in which the goal is to predict future bounding boxes of tracked objects. In contrast to existing works on object trajectory forecasting which primarily consider the problem from a birds-eye perspective, we formulate the problem from an object-level perspective and call for the prediction of full object bounding boxes, rather than trajectories alone. Towards solving this task, we introduce the Citywalks dataset, which consists of over 200k high-resolution video frames. Citywalks comprises of footage recorded in 21 cities from 10 European countries in a variety of weather conditions and over 3.5k unique pedestrian trajectories. For evaluation, we adapt existing trajectory forecasting methods for MOF and confirm cross-dataset generalizability on the MOT-17 dataset without fine-tuning. Finally, we present STED, a novel encoder-decoder architecture for MOF. STED combines visual and temporal features to model both object-motion and ego-motion, and outperforms existing approaches for MOF. Code & dataset link: https://github.com/olly-styles/Multiple-Object-Forecasting

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Multiple Object Forecasting Citywalks STED ADE 26.7 # 1
AIOU 54.3 # 1

Methods


No methods listed for this paper. Add relevant methods here