no code implementations • 3 Dec 2024 • Khawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood, Karthik Nandakumar, Naveed Akhtar
Data augmentation is widely used to enhance generalization in visual classification tasks.
no code implementations • 21 Nov 2024 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
We identify trade-offs between safety and censorship, which presents a necessary perspective in the development of ethical AI models.
no code implementations • 21 Nov 2024 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
The widespread availability of multimodal generative models has sparked critical discussions on their fairness, reliability, and potential for misuse.
no code implementations • 1 Oct 2024 • Zaid Ilyas, Naveed Akhtar, David Suter, Syed Zulqarnain Gilani
Unique to the proposed GLMHA is its ability to provide computational gain for both short and long input sequences.
1 code implementation • 5 Jul 2024 • Peiyu Yang, Naveed Akhtar, Mubarak Shah, Ajmal Mian
Trustworthy machine learning necessitates meticulous regulation of model reliance on non-robust features.
no code implementations • 13 Jun 2024 • Basim Azam, Naveed Akhtar
Kolmogorov-Arnold Networks (KANs) introduce a paradigm of neural modeling that implements learnable functions on the edges of the networks, diverging from the traditional node-centric activations in neural networks.
no code implementations • 22 May 2024 • Muhammad Bilal Shaikh, Syed Mohammed Shamsul Islam, Douglas Chai, Naveed Akhtar
We analyze the classic and emerging techniques in this regard, while also highlighting the popular trends in the adaption of CNN and Transformer building blocks for the overall problem.
no code implementations • 2 May 2024 • Peiyu Yang, Naveed Akhtar, Jiantong Jiang, Ajmal Mian
This setup is ultimately employed for a comprehensive comparison of existing methods using our BackX benchmark.
no code implementations • 3 Apr 2024 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
As a by-product, this control serves as a form of precise prompt engineering to generate images which are generally implausible using regular text prompts.
1 code implementation • 28 Jan 2024 • Shuai Yuan, Hanlin Qin, Xiang Yan, Shiqi Yang, Shuowen Yang, Naveed Akhtar
In a real-world infrared imaging system, effectively learning a consistent stripe noise removal model is essential.
1 code implementation • 28 Jan 2024 • Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Ajmal Mian
In the proposed SCTBs, the outputs of all encoders are interacted with cross transformer to generate mixed features, which are redistributed to all decoders to effectively reinforce semantic differences between the target and clutter at full scales.
no code implementations • 18 Jan 2024 • Juwita juwita, Ghulam Mubashar Hassan, Naveed Akhtar, Amitava Datta
Segmenting organs in CT scan images is a necessary process for multiple downstream medical image analysis tasks.
no code implementations • 20 Dec 2023 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
Bias in text-to-image (T2I) models can propagate unfair social representations and may be used to aggressively market ideas or push controversial agendas.
no code implementations • 26 Sep 2023 • Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian
Recent advances in diffusion models have led to a quantum leap in the quality of generative visual content.
no code implementations • 26 Sep 2023 • Yunzhuo Chen, Nur Al Hasan Haldar, Naveed Akhtar, Ajmal Mian
To curb their exploitation for Deepfakes, it is imperative to first explore the extent to which diffusion models can be used to generate realistic content that is controllable with convenient prompts.
no code implementations • 20 Sep 2023 • Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh Singh Rawat
Inspired by this, we introduce a novel problem of PRofiling Adversarial aTtacks (PRAT).
no code implementations • 1 Aug 2023 • Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar
This model employs an intuitive approach for the combination of audio-image and video modalities, with a primary aim to escalate the effectiveness of multimodal human action recognition (MHAR).
1 code implementation • 31 Jul 2023 • Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian
Based on the penetration level, BAGM takes the form of a suite of attacks that are referred to as surface, shallow and deep attacks in this article.
1 code implementation • 12 Jul 2023 • Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar, Muhammad Usman, Naveed Akhtar, Nick Barnes, Ajmal Mian
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond.
no code implementations • 3 Jul 2023 • Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Ajmal Mian
The results ascertain the efficacy of our technique.
no code implementations • 23 May 2023 • Naveed Akhtar, Muhammad A. A. K. Jalwana
We also devise a scheme to preclude the conditions in which visual model interpretation can invalidate the axiomatic properties of path attribution.
no code implementations • 31 Mar 2023 • Naveed Akhtar, Mohammad A. A. K. Jalwana
Addressing the gap, we introduce a new perspective of input-agnostic saliency mapping that computationally estimates the high-level features attributed by the model to its outputs.
no code implementations • 31 Jan 2023 • Naveed Akhtar
Deep visual models have widespread applications in high-stake domains.
no code implementations • 21 Jan 2023 • Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar
We fuse the output of this layer with the predictions of a model induced with the traditional transfer learning performed over our pre-text transferred model.
no code implementations • 21 Jan 2023 • Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal Mian
We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data.
1 code implementation • CVPR 2023 • Zechuan Li, Hongshan Yu, Zhengeng Yang, Tongjia Chen, Naveed Akhtar
In this work, we propose AShapeFormer, a semantics-guided object-level shape encoding module for 3D object detection.
no code implementations • 23 Nov 2022 • Rohit Gupta, Naveed Akhtar, Gaurav Kumar Nayak, Ajmal Mian, Mubarak Shah
By using a nearly disjoint dataset to train the substitute model, our method removes the requirement that the substitute model be trained using the same dataset as the target model, and leverages queries to the target model to retain the fooling rate benefits provided by query-based methods.
2 code implementations • 21 Nov 2022 • Congliang Li, ShiJie Sun, XiangYu Song, HuanSheng Song, Naveed Akhtar, Ajmal Saeed Mian
Using the labeling method, we provide the KITTI-6DoF dataset with $\sim7. 5$K annotated frames.
no code implementations • 7 Oct 2022 • Anwaar Ulhaq, Naveed Akhtar
In this review, we present the most recent advances in diffusion models for vision, specifically focusing on the important design aspects that affect the computational efficiency of DMs.
no code implementations • 13 Sep 2022 • Anwaar Ulhaq, Naveed Akhtar, Ganna Pogrebna, Ajmal Mian
Finally, it provides a discussion on the challenges, outlook, and future avenues for this research direction.
no code implementations • IEEE International Conference on Visual Communications and Image Processing (VCIP) 2023 • Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar
Currently, action recognition is predominately performed on video data as processed by CNNs.
no code implementations • 22 Jul 2022 • Rohit Gupta, Naveed Akhtar, Ajmal Mian, Mubarak Shah
We establish that this is a result of the presence of false negative pairs in the training process, which increases model sensitivity to input perturbations.
1 code implementation • CVPR 2022 • Zhao Jin, Yinjie Lei, Naveed Akhtar, Haifeng Li, Munawar Hayat
With that, we develop a large-scale synthetic scene flow dataset GTA-SF.
no code implementations • 15 Feb 2022 • Jie Wang, Ghulam Mubashar Hassan, Naveed Akhtar
It provides a comprehensible gateway to the broader community to understand the recent developments in Neural Trojans.
2 code implementations • 3 Dec 2021 • Huan Lei, Naveed Akhtar, Mubarak Shah, Ajmal Mian
In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes.
no code implementations • 12 Aug 2021 • Fouzia Altaf, Syed M. S. Islam, Naveed Akhtar
We hope that our reproducible investigation will help in curbing hype-driven claims for the critical problem of COVID-19 diagnosis, and pave the way for a more transparent performance evaluation of techniques for CT-based COVID-19 detection.
no code implementations • 1 Aug 2021 • Naveed Akhtar, Ajmal Mian, Navid Kardan, Mubarak Shah
In [2], we reviewed the contributions made by the computer vision community in adversarial attacks on deep learning (and their defenses) until the advent of year 2018.
no code implementations • 7 Jul 2021 • Nayyer Aafaq, Naveed Akhtar, Wei Liu, Mubarak Shah, Ajmal Mian
In contrast, we propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN such that the resulting deep features of the input image enable a controlled incorrect caption generation through the recurrent network.
no code implementations • 20 Jun 2021 • Naveed Akhtar, Muhammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian
Exploring this phenomenon further, we alter the `adversarial' objective of our attack to use it as a tool to `explain' deep visual representation.
1 code implementation • CVPR 2021 • Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
2 code implementations • CVPR 2021 • Huan Lei, Naveed Akhtar, Ajmal Mian
We present Picasso, a CUDA-based library comprising novel modules for deep learning over complex real-world 3D meshes.
no code implementations • 16 Feb 2021 • Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar
COVID-19 classification using chest Computed Tomography (CT) has been found pragmatically useful by several studies.
3 code implementations • ECCV 2020 • Shi-Jie Sun, Naveed Akhtar, Xiang-Yu Song, HuanSheng Song, Ajmal Mian, Mubarak Shah
Deep learning-based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection. This results in deep models that are detector biased and evaluations that are detector influenced.
no code implementations • 26 Jun 2020 • Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian
On the other, deep learning has also been found vulnerable to adversarial attacks, which calls for new techniques to defend deep models against these attacks.
1 code implementation • 25 Feb 2020 • Camilo Pestana, Naveed Akhtar, Wei Liu, David Glance, Ajmal Mian
Our results show that our approach achieves the best balance between defence against adversarial attacks such as FGSM, PGD and DDN and maintaining the original accuracies of VGG-16, ResNet50 and DenseNet121 on clean images.
no code implementations • 21 Nov 2019 • Nayyer Aafaq, Naveed Akhtar, Wei Liu, Ajmal Mian
We perform extensive experiments by varying the constituent components of the video captioning framework, and quantify the performance gains that are possible by mere component selection.
3 code implementations • 20 Sep 2019 • Huan Lei, Naveed Akhtar, Ajmal Mian
We propose a spherical kernel for efficient graph convolution of 3D point clouds.
Ranked #5 on 3D Object Classification on ModelNet40
no code implementations • 14 Sep 2019 • Jian Liu, Naveed Akhtar, Ajmal Mian
We also explore the possibility of semantically imperceptible localized attacks with CIASA, and succeed in fooling the state-of-the-art skeleton action recognition models with high confidence.
no code implementations • 1 Jun 2019 • Jian Liu, Naveed Akhtar, Ajmal Mian
A major challenge in this regard is the lack of appropriately annotated video data for learning the desired deep models.
no code implementations • 27 May 2019 • Naveed Akhtar, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian
We introduce Label Universal Targeted Attack (LUTA) that makes a deep model predict a label of attacker's choice for `any' sample of a given source class with high probability.
no code implementations • CVPR 2019 • Huan Lei, Naveed Akhtar, Ajmal Mian
We propose an octree guided neural network architecture and spherical convolutional kernel for machine learning from arbitrary 3D point clouds.
Ranked #11 on 3D Part Segmentation on ShapeNet-Part
no code implementations • CVPR 2019 • Nayyer Aafaq, Naveed Akhtar, Wei Liu, Syed Zulqarnain Gilani, Ajmal Mian
The final representation is projected to a compact space and fed to a language model.
no code implementations • 15 Feb 2019 • Fouzia Altaf, Syed M. S. Islam, Naveed Akhtar, Naeem K. Janjua
Unique to this study is the Computer Vision/Machine Learning perspective taken on the advances of Deep Learning in Medical Imaging.
1 code implementation • 28 Oct 2018 • Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Ajmal Mian, Mubarak Shah
In this paper, we harness the power of deep learning for data association in tracking by jointly modelling object appearances and their affinities between different frames in an end-to-end fashion.
no code implementations • 21 May 2018 • Huan Lei, Naveed Akhtar, Ajmal Mian
We propose a neural network for 3D point cloud processing that exploits `spherical' convolution kernels and octree partitioning of space.
1 code implementation • 12 Apr 2018 • Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Chaoyang Zhang, Jian-Xin Li, Ajmal Mian
A thorough evaluation on PCDS demonstrates that our technique is able to count people in cluttered scenes with high accuracy at 45 fps on a 1. 7 GHz processor, and hence it can be deployed for effective real-time people counting for intelligent transportation systems.
no code implementations • 15 Jan 2018 • Naveed Akhtar, Ajmal Mian
We propose to recover spectral details from RGB images of known spectral quantization by modeling natural spectra under Gaussian Processes and combining them with the RGB images.
3 code implementations • 2 Jan 2018 • Naveed Akhtar, Ajmal Mian
This article presents the first comprehensive survey on adversarial attacks on deep learning in Computer Vision.
no code implementations • CVPR 2018 • Naveed Akhtar, Jian Liu, Ajmal Mian
A rigorous evaluation shows that our framework can defend the network classifiers against unseen adversarial perturbations in the real-world scenarios with up to 97. 5% success rate.
no code implementations • 16 Nov 2017 • Jian Liu, Naveed Akhtar, Ajmal Mian
The proposed action recognition exploits the representation in a hierarchical manner by first capturing the micro-temporal relations between the skeleton joints with the Skepxels and then exploiting their macro-temporal relations by computing the Fourier Temporal Pyramids over the CNN features of the skeletal images.
no code implementations • 15 Sep 2017 • Jian Liu, Naveed Akhtar, Ajmal Mian
The proposed technique capitalizes on the spatio-temporal information available in the two data streams to the extract action features that are largely insensitive to the viewpoint variations.
no code implementations • 4 Jul 2017 • Jian Liu, Naveed Akhtar, Ajmal Mian
We propose Human Pose Models that represent RGB and depth images of human poses independent of clothing textures, backgrounds, lighting conditions, body shapes and camera viewpoints.
Ranked #100 on Skeleton Based Action Recognition on NTU RGB+D
no code implementations • CVPR 2017 • Naveed Akhtar, Ajmal Mian, Fatih Porikli
To further encourage discrimination in the dictionary, our model uses separate (sets of) Bernoulli distributions to represent data from different classes.
no code implementations • 29 Nov 2015 • Naveed Akhtar, Faisal Shafait, Ajmal Mian
Many classification approaches first represent a test sample using the training samples of all the classes.
no code implementations • CVPR 2015 • Naveed Akhtar, Faisal Shafait, Ajmal Mian
We propose a hyperspectral image super resolution approach that fuses a high resolution image with the low resolution hyperspectral image using non-parametric Bayesian sparse representation.
no code implementations • 27 Mar 2015 • Naveed Akhtar, Faisal Shafait, Ajmal Mian
We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data.