no code implementations • 18 Nov 2024 • Seonghyeon Moon, Haein Kong, Muhammad Haris Khan, Yuewei Lin
Therefore, having strong prior information for the target object using the support set is essential for guiding the initial training of FSS, which leads to the success of few-shot segmentation in challenging cases, such as when the target object shows considerable variation in appearance, texture, or scale across the support and query images.
1 code implementation • 4 Nov 2024 • Sharon Chokuwa, Muhammad Haris Khan
This approach ensures that our DG model remains robust against early susceptibility to label noise, even when only a limited dataset of non-DR fundus images is available for pretraining.
1 code implementation • 27 Oct 2024 • Yu Liu, Arif Mahmood, Muhammad Haris Khan
To this end, this paper presents NT-VOT211, a new benchmark tailored for evaluating visual object tracking algorithms in the challenging night-time conditions.
1 code implementation • 27 Oct 2024 • Yu Liu, Arif Mahmood, Muhammad Haris Khan
RGB video object tracking is a fundamental task in computer vision.
1 code implementation • 21 Oct 2024 • Xilin He, Jingyu Hu, Qinliang Lin, Cheng Luo, Weicheng Xie, Siyang Song, Muhammad Haris Khan, Linlin Shen
Given the theoretical justification of models' biased learning behavior on different spatial frequency components, which is based on the dataset frequency properties, we argue that the learning behavior on various frequency components could be manipulated by changing the dataset statistical structure in the Fourier domain.
no code implementations • 13 Oct 2024 • Xilin He, Cheng Luo, Xiaole Xian, Bing Li, Siyang Song, Muhammad Haris Khan, Weicheng Xie, Linlin Shen, ZongYuan Ge
Facial expression datasets remain limited in scale due to privacy concerns, the subjectivity of annotations, and the labor-intensive nature of data collection.
no code implementations • 27 Sep 2024 • Salma Hassan, Hamad Al Hammadi, Ibrahim Mohammed, Muhammad Haris Khan
The early detection and nuanced subtype classification of non-small cell lung cancer (NSCLC), a predominant cause of cancer mortality worldwide, is a critical and complex issue.
no code implementations • 16 Sep 2024 • Artem Lykov, Miguel Altamirano Cabrera, Mikhail Konenkov, Valerii Serpiva, Koffivi Fid`ele Gbagbe, Ali Alabbas, Aleksey Fedoseev, Luis Moreno, Muhammad Haris Khan, Ziang Guo, Dzmitry Tsetserukou
This paper presents the concept of Industry 6. 0, introducing the world's first fully automated production system that autonomously handles the entire product design and manufacturing process based on user-provided natural language descriptions.
1 code implementation • 11 Sep 2024 • Sanoojan Baliah, Qinliang Lin, Shengcai Liao, Xiaodan Liang, Muhammad Haris Khan
Unlike prior works reliant on multiple off-the-shelf models, ours is a relatively unified approach and so it is resilient to errors in other off-the-shelf models.
1 code implementation • 4 Sep 2024 • Chamuditha Jayanaga Galappaththige, Zachary Izzo, Xilin He, Honglu Zhou, Muhammad Haris Khan
In search of this endeavor, we study the challenging problem of semi-supervised domain generalization (SSDG), where the goal is to learn a domain-generalizable model while using only a small fraction of labeled data and a relatively large fraction of unlabeled data.
no code implementations • 16 Aug 2024 • Eman Ali, Sathira Silva, Muhammad Haris Khan
Finally, it addresses visual-textual misalignment by aligning textual prototypes with image prototypes to further improve the adaptation performance.
no code implementations • 14 Aug 2024 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Zaigham Zaheer, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf, Hassan Sajjad, Tom De Schepper, Markus Schedl
Multimodal networks have demonstrated remarkable performance improvements over their unimodal counterparts.
no code implementations • 1 Aug 2024 • Malitha Gunawardhana, Limalka Sadith, Liel David, Daniel Harari, Muhammad Haris Khan
The exploration of video content via Self-Supervised Learning (SSL) models has unveiled a dynamic field of study, emphasizing both the complex challenges and unique opportunities inherent in this area.
1 code implementation • 18 Jul 2024 • Ans Munir, Faisal Z. Qureshi, Muhammad Haris Khan, Mohsen Ali
We are exploring Open World Compositional Zero-Shot Learning (OW-CZSL) in this study, where our test space encompasses all potential combinations of attributes and objects.
no code implementations • 5 Jul 2024 • Seonghyeon Moon, Qingze, Liu, Haein Kong, Muhammad Haris Khan
However, to our knowledge, no method has been developed that can determine the success of segmentation refinement.
1 code implementation • CVPR 2024 • Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir, M. Saquib Sarfraz, Mohsen Ali
The results consistently demonstrate the superiority of our approach compared to existing methods.
1 code implementation • 22 May 2024 • Muhammad Ibraheem Siddiqui, Muhammad Umer Sheikh, Hassan Abid, Muhammad Haris Khan
To address this, we propose PerSense, an end-to-end, training-free, and model-agnostic one-shot framework for Personalized instance Segmentation in dense images.
1 code implementation • 14 Apr 2024 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf
The Face-voice Association in Multilingual Environments (FAME) Challenge 2024 focuses on exploring face-voice association under a unique condition of multilingual scenario.
1 code implementation • CVPR 2024 • Siddharth Tourani, Ahmed Alwheibi, Arif Mahmood, Muhammad Haris Khan
Second, motivated by the ZeroShot performance, we develop a ULD algorithm based on diffusion features using self-training and clustering which also outperforms prior methods by notable margins.
2 code implementations • CVPR 2024 • Chamuditha Jayanga Galappaththige, Sanoojan Baliah, Malitha Gunawardhana, Muhammad Haris Khan
Existing domain generalization (DG) methods which are unable to exploit unlabeled data perform poorly compared to semi-supervised learning (SSL) methods under SSDG setting.
4 code implementations • CVPR 2024 • Kumaranage Ravindu Yasas Nagasinghe, Honglu Zhou, Malitha Gunawardhana, Martin Renqiang Min, Daniel Harari, Muhammad Haris Khan
This knowledge, sourced from training procedure plans and structured as a directed weighted graph, equips the agent to better navigate the complexities of step sequencing and its potential variations.
1 code implementation • 25 Jan 2024 • Adnan Khan, Mai A. Shaaban, Muhammad Haris Khan
A key challenge, faced by the best-performing SSL-based SSDG methods, is selecting accurate pseudo-labels under multiple domain shifts and reducing overfitting to source domains under limited labels.
no code implementations • 24 Jan 2024 • Sathira Silva, Savindu Bhashitha Wannigama, Gihan Jayatilaka, Muhammad Haris Khan, Roshan Ragel
Holistic understanding and reasoning in 3D scenes play a vital role in the success of autonomous driving systems.
no code implementations • 8 Nov 2023 • Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali
Deep learning based object detectors struggle generalizing to a new target domain bearing significant variations in object and background.
1 code implementation • NeurIPS 2023 • Muhammad Akhtar Munir, Salman Khan, Muhammad Haris Khan, Mohsen Ali, Fahad Shahbaz Khan
Third, we develop a logit mixing approach that acts as a regularizer with detection-specific losses and is also complementary to the uncertainty-guided logit modulation technique to further improve the calibration performance.
1 code implementation • 26 Oct 2023 • Chamuditha Jayanga Galappaththige, Gayal Kuruppu, Muhammad Haris Khan
Therefore, automated diabetic retinopathy classification using deep learning techniques has gained interest in the medical imaging community.
no code implementations • 26 Sep 2023 • Eman Ali, Muhammad Haris Khan
NtUA works as a key-value cache that formulates visual features and predicted pseudo-labels of the few unlabelled target samples as key-value pairs.
1 code implementation • 19 Sep 2023 • Mamona Awan, Muhammad Haris Khan, Sanoojan Baliah, Muhammad Ahmad Waseem, Salman Khan, Fahad Shahbaz Khan, Arif Mahmood
In the current work, we introduce a consistency-guided bottleneck in an image reconstruction-based pipeline that leverages landmark consistency, a measure of compatibility score with the pseudo-ground truth to generate adaptive heatmaps.
no code implementations • 6 Sep 2023 • Vinith Kugathasan, Muhammad Haris Khan
It is based on the observation that a model miscalibration is directly related to its predictive certainty, so a higher gap between the mean confidence and certainty amounts to a poor calibration both for in-distribution and out-of-distribution predictions.
1 code implementation • 27 Aug 2023 • Sanoojan Baliah, Fadillah A. Maani, Santosh Sanjeev, Muhammad Haris Khan
In this study, we investigate CLIP's transfer learning capabilities and its potential for cross-domain generalization in diabetic retinopathy (DR) classification.
1 code implementation • 18 Jul 2023 • Siddharth Tourani, Carsten Rother, Muhammad Haris Khan, Bogdan Savchynskyy
We contribute to the sparsely populated area of unsupervised deep graph matching with application to keypoint matching in images.
2 code implementations • CVPR 2023 • Bimsara Pathiraja, Malitha Gunawardhana, Muhammad Haris Khan
Surprisingly, very little to no attempts have been made in studying the calibration of object detection methods, which occupy a pivotal space in vision-based security-sensitive, and safety-critical applications.
1 code implementation • CVPR 2023 • Muhammad Akhtar Munir, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan
Since the original formulation of our loss depends on the counts of true positives and false positives in a minibatch, we develop a differentiable proxy of our loss that can be used during training with other application-specific loss functions.
1 code implementation • 10 Mar 2023 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood
With the rapid growth of social media platforms, users are sharing billions of multimedia posts containing audio, images, and text.
1 code implementation • ICCV 2023 • Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia
To extract information relevant to the target class, a dominant approach in best-performing FSS methods removes background features using a support mask.
Ranked #3 on Few-Shot Semantic Segmentation on FSS-1000 (1-shot)
no code implementations • 15 Sep 2022 • Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali
To this end, we first propose a new, plug-and-play, train-time calibration loss for object detection (coined as TCD).
1 code implementation • 22 Aug 2022 • Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue
In addition, we leverage cross-modal verification and matching tasks to analyze the impact of multiple languages on face-voice association.
2 code implementations • 25 Jul 2022 • Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan
Similar to CNNs, ViTs also struggle in out-of-distribution scenarios and the main culprit is overfitting to source domains.
1 code implementation • 24 Mar 2022 • Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, Fahad Shahbaz Khan
When using the ResNet50 backbone, our MS-STS achieves a mask AP of 50. 1 %, outperforming the best reported results in literature by 2. 7 % and by 4. 8 % at higher overlap threshold of AP_75, while being comparable in model size and speed on Youtube-VIS 2019 val.
1 code implementation • 24 Mar 2022 • Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia
A fundamental limitation of FM is the inability to preserve the fine-grained spatial details that affect the accuracy of segmentation mask, especially for small target objects.
Ranked #5 on Few-Shot Semantic Segmentation on FSS-1000 (1-shot)
no code implementations • CVPR 2022 • Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segu, Fisher Yu, Seung-Ik Lee
Video anomaly detection is well investigated in weakly-supervised and one-class classification (OCC) settings.
1 code implementation • 24 Jan 2022 • Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu
Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators.
2 code implementations • 20 Dec 2021 • Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue
Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks.
no code implementations • 6 Dec 2021 • Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas
Accurate and robust visual object tracking is one of the most challenging and fundamental computer vision problems.
no code implementations • NeurIPS 2021 • Muhammad Akhtar Munir, Muhammad Haris Khan, M. Sarfraz, Mohsen Ali
In this paper, we propose to leverage model’s predictive uncertainty to strike the right balance between adversarial feature alignment and class-level alignment.
no code implementations • 1 Oct 2021 • Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali
In this paper, we propose to leverage model predictive uncertainty to strike the right balance between adversarial feature alignment and class-level alignment.
no code implementations • 26 Apr 2021 • Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, Fahad Shahbaz Khan
Human learning benefits from multi-modal inputs that often appear as rich semantics (e. g., description of an object's attributes while learning about it).
Ranked #1 on Few-Shot Image Classification on Oxford 102 Flower (using extra training data)
no code implementations • ICCV 2019 • Tiancai Wang, Rao Muhammad Anwer, Muhammad Haris Khan, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Jorma Laaksonen
Our approach outperforms the state-of-the-art on all datasets.
1 code implementation • ICCV 2019 • Yanwei Pang, Jin Xie, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Ling Shao
Our approach obtains an absolute gain of 9. 5% in log-average miss rate, compared to the best reported results on the heavily occluded (HO) pedestrian set of CityPersons test set.
1 code implementation • CVPR 2020 • Muhammad Haris Khan, John McDonagh, Salman Khan, Muhammad Shahabuddin, Aditya Arora, Fahad Shahbaz Khan, Ling Shao, Georgios Tzimiropoulos
Several studies show that animal needs are often expressed through their faces.
no code implementations • 3 Nov 2018 • Themos Stafylakis, Muhammad Haris Khan, Georgios Tzimiropoulos
A further analysis on the utility of target word boundaries is provided, as well as on the capacity of the network in modeling the linguistic context of the target word.
no code implementations • ICCV 2017 • Muhammad Haris Khan, John McDonagh, Georgios Tzimiropoulos
Tracking-by-detection is drift-free but results in low accuracy fittings.
no code implementations • ICCV 2015 • Xiaomeng Wang, Michel Valstar, Brais Martinez, Muhammad Haris Khan, Tony Pridmore
This paper proposes a novel approach to part-based tracking by replacing local matching of an appearance model by direct prediction of the displacement between local image patches and part locations.