no code implementations • 12 Sep 2023 • Syed Waleed Hyder, Muhammad Usama, Anas Zafar, Muhammad Naufil, Fawad Javed Fateh, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran
This paper presents a 2D skeleton-based action segmentation method with applications in fine-grained human activity recognition.
no code implementations • 31 May 2023 • Quoc-Huy Tran, Muhammad Ahmed, Murad Popattia, M. Hassan Ahmed, Andrey Konin, M. Zeeshan Zia
This paper presents a self-supervised temporal video alignment framework which is useful for several fine-grained human activity understanding applications.
no code implementations • 31 May 2023 • Quoc-Huy Tran, Ahmed Mehmood, Muhammad Ahmed, Muhammad Naufil, Anas Zafar, Andrey Konin, M. Zeeshan Zia
The frame-level prediction module is trained in an unsupervised manner via temporal optimal transport.
1 code implementation • 21 Jul 2022 • Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen
To the best of our knowledge, our work is the first to explore transductive few-shot video classification.
no code implementations • 30 Jun 2022 • Hamza Khan, Sanjay Haresh, Awais Ahmed, Shakeeb Siddiqui, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran
We introduce a novel approach for temporal activity segmentation with timestamp supervision.
1 code implementation • NeurIPS 2021 • Duong H. Le, Khoi D. Nguyen, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua
In this work, we propose to use out-of-distribution samples, i. e., unlabeled samples coming from outside the target classes, to improve few-shot learning.
1 code implementation • Advances in Neural Information Processing Systems 2021 • Duong H. Le*, Khoi D. Nguyen*, Khoi Nguyen, Quoc-Huy Tran, Rang Nguyen, Binh-Son Hua
In this work, we propose to use out-of-distribution samples, i. e., unlabeled samples coming from outside the target classes, to improve few-shot learning.
no code implementations • CVPR 2022 • Sateesh Kumar, Sanjay Haresh, Awais Ahmed, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran
The temporal optimal transport module enables our approach to learn effective representations for unsupervised activity segmentation.
no code implementations • CVPR 2021 • Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram Najam Syed, Andrey Konin, Muhammad Zeeshan Zia, Quoc-Huy Tran
To overcome this problem, we propose a temporal regularization term (i. e., Contrastive-IDM) which encourages different frames to be mapped to different points in the embedding space.
no code implementations • ECCV 2020 • Bingbing Zhuang, Quoc-Huy Tran
In this paper, we derive a new differential homography that can account for the scanline-varying camera poses in Rolling Shutter (RS) cameras, and demonstrate its application to carry out RS-aware image stitching and rectification at one stroke.
no code implementations • ECCV 2020 • Yuliang Zou, Pan Ji, Quoc-Huy Tran, Jia-Bin Huang, Manmohan Chandraker
Monocular visual odometry (VO) suffers severely from error accumulation during frame-to-frame pose estimation.
1 code implementation • ECCV 2020 • Lokender Tiwari, Pan Ji, Quoc-Huy Tran, Bingbing Zhuang, Saket Anand, Manmohan Chandraker
Classical monocular Simultaneous Localization And Mapping (SLAM) and the recently emerging convolutional neural networks (CNNs) for monocular depth prediction represent two largely disjoint approaches towards building a 3D map of the surrounding environment.
no code implementations • 11 Apr 2020 • Sanjay Haresh, Sateesh Kumar, M. Zeeshan Zia, Quoc-Huy Tran
We apply: (i) one-class classification loss and (ii) reconstruction-based loss, for anomaly detection on RetroTrucks as well as on existing static-camera datasets.
no code implementations • 30 Jul 2019 • Bingbing Zhuang, Quoc-Huy Tran, Pan Ji, Gim Hee Lee, Loong Fah Cheong, Manmohan Chandraker
Self-calibration of camera intrinsics and radial distortion has a long history of research in the computer vision community.
no code implementations • ECCV 2018 • Mohammed E. Fathy, Quoc-Huy Tran, M. Zeeshan Zia, Paul Vernaza, Manmohan Chandraker
Further, we propose to use activation maps at different layers of a CNN, as an effective and principled replacement for the multi-resolution image pyramids often used for matching tasks.
no code implementations • 8 Jan 2018 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker
In this work, we explore an approach for injecting prior domain structure into neural network training by supervising hidden layers of a CNN with intermediate concepts that normally are not observed in practice.
no code implementations • CVPR 2017 • Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker
Monocular 3D object parsing is highly desirable in various scenarios including occlusion reasoning and holistic scene interpretation.
no code implementations • CVPR 2016 • Vikas Dhiman, Quoc-Huy Tran, Jason J. Corso, Manmohan Chandraker
We present a physically interpretable, continuous 3D model for handling occlusions with applications to road scene understanding.