1 code implementation • ICCV 2023 • Anshul Shah, Benjamin Lundell, Harpreet Sawhney, Rama Chellappa
We address the problem of extracting key steps from unlabeled procedural videos, motivated by the potential of Augmented Reality (AR) headsets to revolutionize job training and performance.
no code implementations • 10 Jul 2022 • Ashraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke
We evaluate our SSL approach on two downstream tasks -- object detection and semantic segmentation, using COCO, PASCAL VOC, and CityScapes datasets.
no code implementations • 22 Jul 2019 • Huseyin Coskun, Zeeshan Zia, Bugra Tekin, Federica Bogo, Nassir Navab, Federico Tombari, Harpreet Sawhney
The lack of large-scale real datasets with annotations makes transfer learning a necessity for video activity understanding.
no code implementations • 2 Dec 2015 • Mohamed Elhoseiny, Jingen Liu, Hui Cheng, Harpreet Sawhney, Ahmed Elgammal
To our knowledge, this is the first Zero-Shot event detection model that is built on top of distributional semantics and extends it in the following directions: (a) semantic embedding of multimodal information in videos (with focus on the visual modalities), (b) automatically determining relevance of concepts/attributes to a free text query, which could be useful for other applications, and (c) retrieving videos by free text event query (e. g., "changing a vehicle tire") based on their content.
no code implementations • 25 Oct 2015 • S. Hussain Raza, Omar Javed, Aveek Das, Harpreet Sawhney, Hui Cheng, Irfan Essa
We propose to learn and infer depth in videos from appearance, motion, occlusion boundaries, and geometric context of the scene.
no code implementations • CVPR 2014 • Jiejie Zhu, Omar Javed, Jingen Liu, Qian Yu, Hui Cheng, Harpreet Sawhney
In this paper, we give a comparative evaluation of the proposed method and demonstrate that MIMS outperforms the state of the art approaches in identifying pedestrians from low resolution airborne videos.
no code implementations • CVPR 2013 • Weixin Li, Qian Yu, Harpreet Sawhney, Nuno Vasconcelos
A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes.