2 code implementations • ICCV 2019 • Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh
We present the first single-network approach for 2D~whole-body pose estimation, which entails simultaneous localization of body, face, hands, and feet keypoints.
no code implementations • CVPR 2019 • Yaadhav Raaj, Haroon Idrees, Gines Hidalgo, Yaser Sheikh
We present an online approach to efficiently and simultaneously detect and track the 2D pose of multiple people in a video sequence.
Ranked #7 on Pose Tracking on PoseTrack2017 (using extra training data)
no code implementations • ECCV 2018 • Haroon Idrees, Muhmmad Tayyab, Kishan Athrey, Dong Zhang, Somaya Al-Maadeed, Nasir Rajpoot, Mubarak Shah
With multiple crowd gatherings of millions of people every year in events ranging from pilgrimages to protests, concerts to marathons, and festivals to funerals; visual crowd analysis is emerging as a new frontier in computer vision.
Ranked #15 on Crowd Counting on UCF-QNRF
no code implementations • 4 Feb 2017 • Eyasu Zemene, Yonatan Tariku, Haroon Idrees, Andrea Prati, Marcello Pelillo, Mubarak Shah
We cast the geo-localization as a clustering problem on local image features.
no code implementations • 7 Dec 2016 • Shayan Modiri Assari, Haroon Idrees, Mubarak Shah
This paper addresses the problem of human re-identification across non-overlapping cameras in crowds. Re-identification in crowded scenes is a challenging problem due to large number of people and frequent occlusions, coupled with changes in their appearance due to different properties and exposure of cameras.
no code implementations • 4 Dec 2016 • Khurram Soomro, Haroon Idrees, Mubarak Shah
For online prediction of action (interaction) confidences, we propose an approach based on Structural SVM that operates on short video segments, and is trained with the objective that confidence of an action or interaction increases as time progresses.
no code implementations • CVPR 2016 • Khurram Soomro, Haroon Idrees, Mubarak Shah
This paper proposes a novel approach to tackle the challenging problem of 'online action localization' which entails predicting actions and their locations as they happen in a video.
no code implementations • 21 Apr 2016 • Haroon Idrees, Amir R. Zamir, Yu-Gang Jiang, Alex Gorban, Ivan Laptev, Rahul Sukthankar, Mubarak Shah
Additionally, we include a comprehensive empirical study evaluating the differences in action recognition between trimmed and untrimmed videos, and how well methods trained on trimmed videos generalize to untrimmed videos.
no code implementations • ICCV 2015 • Khurram Soomro, Haroon Idrees, Mubarak Shah
Context relations are learned during training which capture displacements from all the supervoxels in a video to those belonging to foreground actions.
no code implementations • CVPR 2014 • Afshin Dehghan, Haroon Idrees, Mubarak Shah
A video captures a sequence and interactions of concepts that can be static, for instance, objects or scenes, or dynamic, such as actions.
no code implementations • CVPR 2014 • Mahdi M. Kalayeh, Haroon Idrees, Mubarak Shah
Such models become obsolete and require relearning when new images and tags are added to database.
no code implementations • CVPR 2013 • Haroon Idrees, Imran Saleemi, Cody Seibert, Mubarak Shah
Instead, our approach relies on multiple sources such as low confidence head detections, repetition of texture elements (using SIFT), and frequency-domain analysis to estimate counts, along with confidence associated with observing individuals, in an image region.
Ranked #21 on Crowd Counting on UCF CC 50