Search Results for author: Naveed Akhtar

Found 66 papers, 17 papers with code

On the Fairness, Diversity and Reliability of Text-to-Image Generative Models

no code implementations21 Nov 2024 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

The widespread availability of multimodal generative models has sparked critical discussions on their fairness, reliability, and potential for misuse.

Diversity Ethics +1

Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density

1 code implementation5 Jul 2024 Peiyu Yang, Naveed Akhtar, Mubarak Shah, Ajmal Mian

Trustworthy machine learning necessitates meticulous regulation of model reliance on non-robust features.

Suitability of KANs for Computer Vision: A preliminary investigation

no code implementations13 Jun 2024 Basim Azam, Naveed Akhtar

Kolmogorov-Arnold Networks (KANs) introduce a paradigm of neural modeling that implements learnable functions on the edges of the networks, diverging from the traditional node-centric activations in neural networks.

Kolmogorov-Arnold Networks

From CNNs to Transformers in Multimodal Human Action Recognition: A Survey

no code implementations22 May 2024 Muhammad Bilal Shaikh, Syed Mohammed Shamsul Islam, Douglas Chai, Naveed Akhtar

We analyze the classic and emerging techniques in this regard, while also highlighting the popular trends in the adaption of CNN and Transformer building blocks for the overall problem.

Action Recognition Temporal Action Localization

Backdoor-based Explainable AI Benchmark for High Fidelity Evaluation of Attribution Methods

no code implementations2 May 2024 Peiyu Yang, Naveed Akhtar, Jiantong Jiang, Ajmal Mian

This setup is ultimately employed for a comprehensive comparison of existing methods using our BackX benchmark.

Benchmarking

Manipulating and Mitigating Generative Model Biases without Retraining

no code implementations3 Apr 2024 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

As a by-product, this control serves as a form of precise prompt engineering to generate images which are generally implausible using regular text prompts.

Backdoor Attack Language Modelling +1

ASCNet: Asymmetric Sampling Correction Network for Infrared Image Destriping

1 code implementation28 Jan 2024 Shuai Yuan, Hanlin Qin, Xiang Yan, Shiqi Yang, Shuowen Yang, Naveed Akhtar

In a real-world infrared imaging system, effectively learning a consistent stripe noise removal model is essential.

Feature Upsampling Image Reconstruction

SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection

1 code implementation28 Jan 2024 Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Ajmal Mian

In the proposed SCTBs, the outputs of all encoders are interacted with cross transformer to generate mixed features, which are redistributed to all decoders to effectively reinforce semantic differences between the target and clutter at full scales.

Quantifying Bias in Text-to-Image Generative Models

no code implementations20 Dec 2023 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Bias in text-to-image (T2I) models can propagate unfair social representations and may be used to aggressively market ideas or push controversial agendas.

Hallucination Marketing

On quantifying and improving realism of images generated with diffusion

no code implementations26 Sep 2023 Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian

Recent advances in diffusion models have led to a quantum leap in the quality of generative visual content.

Attribute Benchmarking

Text-image guided Diffusion Model for generating Deepfake celebrity interactions

no code implementations26 Sep 2023 Yunzhuo Chen, Nur Al Hasan Haldar, Naveed Akhtar, Ajmal Mian

To curb their exploitation for Deepfakes, it is imperative to first explore the extent to which diffusion models can be used to generate realistic content that is controllable with convenient prompts.

Face Swapping

PRAT: PRofiling Adversarial aTtacks

no code implementations20 Sep 2023 Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh Singh Rawat

Inspired by this, we introduce a novel problem of PRofiling Adversarial aTtacks (PRAT).

Adversarial Attack

MAiVAR-T: Multimodal Audio-image and Video Action Recognizer using Transformers

no code implementations1 Aug 2023 Muhammad Bilal Shaikh, Douglas Chai, Syed Mohammed Shamsul Islam, Naveed Akhtar

This model employs an intuitive approach for the combination of audio-image and video modalities, with a primary aim to escalate the effectiveness of multimodal human action recognition (MHAR).

Action Recognition Temporal Action Localization

BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models

1 code implementation31 Jul 2023 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Based on the penetration level, BAGM takes the form of a suite of attacks that are referred to as surface, shallow and deep attacks in this article.

Backdoor Attack Language Modelling +2

A Comprehensive Overview of Large Language Models

1 code implementation12 Jul 2023 Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar, Muhammad Usman, Naveed Akhtar, Nick Barnes, Ajmal Mian

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond.

Benchmarking

Towards credible visual model interpretation with path attribution

no code implementations23 May 2023 Naveed Akhtar, Muhammad A. A. K. Jalwana

We also devise a scheme to preclude the conditions in which visual model interpretation can invalidate the axiomatic properties of path attribution.

Rethinking interpretation: Input-agnostic saliency mapping of deep visual classifiers

no code implementations31 Mar 2023 Naveed Akhtar, Mohammad A. A. K. Jalwana

Addressing the gap, we introduce a new perspective of input-agnostic saliency mapping that computationally estimates the high-level features attributed by the model to its outputs.

A Survey of Explainable AI in Deep Visual Modeling: Methods and Metrics

no code implementations31 Jan 2023 Naveed Akhtar

Deep visual models have widespread applications in high-stake domains.

Pre-text Representation Transfer for Deep Learning with Limited Imbalanced Data : Application to CT-based COVID-19 Detection

no code implementations21 Jan 2023 Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar

We fuse the output of this layer with the predictions of a model induced with the traditional transfer learning performed over our pre-text transferred model.

Transfer Learning

Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

no code implementations21 Jan 2023 Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal Mian

We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data.

Autonomous Vehicles Outdoor Localization +1

Query Efficient Cross-Dataset Transferable Black-Box Attack on Action Recognition

no code implementations23 Nov 2022 Rohit Gupta, Naveed Akhtar, Gaurav Kumar Nayak, Ajmal Mian, Mubarak Shah

By using a nearly disjoint dataset to train the substitute model, our method removes the requirement that the substitute model be trained using the same dataset as the target model, and leverages queries to the target model to retain the fooling rate benefits provided by query-based methods.

Action Recognition

Efficient Diffusion Models for Vision: A Survey

no code implementations7 Oct 2022 Anwaar Ulhaq, Naveed Akhtar

In this review, we present the most recent advances in diffusion models for vision, specifically focusing on the important design aspects that affect the computational efficiency of DMs.

Computational Efficiency Survey

Vision Transformers for Action Recognition: A Survey

no code implementations13 Sep 2022 Anwaar Ulhaq, Naveed Akhtar, Ganna Pogrebna, Ajmal Mian

Finally, it provides a discussion on the challenges, outlook, and future avenues for this research direction.

Action Recognition Dimensionality Reduction +2

Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility

no code implementations22 Jul 2022 Rohit Gupta, Naveed Akhtar, Ajmal Mian, Mubarak Shah

We establish that this is a result of the presence of false negative pairs in the training process, which increases model sensitivity to input perturbations.

Adversarial Robustness Self-Supervised Learning +1

A Survey of Neural Trojan Attacks and Defenses in Deep Learning

no code implementations15 Feb 2022 Jie Wang, Ghulam Mubashar Hassan, Naveed Akhtar

It provides a comprehensible gateway to the broader community to understand the recent developments in Neural Trojans.

Deep Learning

Mesh Convolution with Continuous Filters for 3D Surface Parsing

2 code implementations3 Dec 2021 Huan Lei, Naveed Akhtar, Mubarak Shah, Ajmal Mian

In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes.

Scene Parsing Scene Segmentation

Resetting the baseline: CT-based COVID-19 diagnosis with Deep Transfer Learning is not as accurate as widely thought

no code implementations12 Aug 2021 Fouzia Altaf, Syed M. S. Islam, Naveed Akhtar

We hope that our reproducible investigation will help in curbing hype-driven claims for the critical problem of COVID-19 diagnosis, and pave the way for a more transparent performance evaluation of techniques for CT-based COVID-19 detection.

Computed Tomography (CT) COVID-19 Diagnosis +1

Advances in adversarial attacks and defenses in computer vision: A survey

no code implementations1 Aug 2021 Naveed Akhtar, Ajmal Mian, Navid Kardan, Mubarak Shah

In [2], we reviewed the contributions made by the computer vision community in adversarial attacks on deep learning (and their defenses) until the advent of year 2018.

Controlled Caption Generation for Images Through Adversarial Attacks

no code implementations7 Jul 2021 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Mubarak Shah, Ajmal Mian

In contrast, we propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN such that the resulting deep features of the input image enable a controlled incorrect caption generation through the recurrent network.

Caption Generation Image Captioning +1

Attack to Fool and Explain Deep Networks

no code implementations20 Jun 2021 Naveed Akhtar, Muhammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

Exploring this phenomenon further, we alter the `adversarial' objective of our attack to use it as a tool to `explain' deep visual representation.

Adversarial Attack Image Manipulation

CAMERAS: Enhanced Resolution And Sanity preserving Class Activation Mapping for image saliency

1 code implementation CVPR 2021 Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

2 code implementations CVPR 2021 Huan Lei, Naveed Akhtar, Ajmal Mian

We present Picasso, a CUDA-based library comprising novel modules for deep learning over complex real-world 3D meshes.

Boosting Deep Transfer Learning for COVID-19 Classification

no code implementations16 Feb 2021 Fouzia Altaf, Syed M. S. Islam, Naeem K. Janjua, Naveed Akhtar

COVID-19 classification using chest Computed Tomography (CT) has been found pragmatically useful by several studies.

Classification Computed Tomography (CT) +4

Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking

3 code implementations ECCV 2020 Shi-Jie Sun, Naveed Akhtar, Xiang-Yu Song, HuanSheng Song, Ajmal Mian, Mubarak Shah

Deep learning-based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection. This results in deep models that are detector biased and evaluations that are detector influenced.

Multiple Object Tracking Object

Orthogonal Deep Models As Defense Against Black-Box Attacks

no code implementations26 Jun 2020 Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

On the other, deep learning has also been found vulnerable to adversarial attacks, which calls for new techniques to defend deep models against these attacks.

Adversarial Perturbations Prevail in the Y-Channel of the YCbCr Color Space

1 code implementation25 Feb 2020 Camilo Pestana, Naveed Akhtar, Wei Liu, David Glance, Ajmal Mian

Our results show that our approach achieves the best balance between defence against adversarial attacks such as FGSM, PGD and DDN and maintaining the original accuracies of VGG-16, ResNet50 and DenseNet121 on clean images.

Empirical Autopsy of Deep Video Captioning Frameworks

no code implementations21 Nov 2019 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Ajmal Mian

We perform extensive experiments by varying the constituent components of the video captioning framework, and quantify the performance gains that are possible by mere component selection.

Decoder Language Modelling +2

Adversarial Attack on Skeleton-based Human Action Recognition

no code implementations14 Sep 2019 Jian Liu, Naveed Akhtar, Ajmal Mian

We also explore the possibility of semantically imperceptible localized attacks with CIASA, and succeed in fooling the state-of-the-art skeleton action recognition models with high confidence.

Action Recognition Adversarial Attack +2

Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video

no code implementations1 Jun 2019 Jian Liu, Naveed Akhtar, Ajmal Mian

A major challenge in this regard is the lack of appropriately annotated video data for learning the desired deep models.

Label Universal Targeted Attack

no code implementations27 May 2019 Naveed Akhtar, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

We introduce Label Universal Targeted Attack (LUTA) that makes a deep model predict a label of attacker's choice for `any' sample of a given source class with high probability.

Octree guided CNN with Spherical Kernels for 3D Point Clouds

no code implementations CVPR 2019 Huan Lei, Naveed Akhtar, Ajmal Mian

We propose an octree guided neural network architecture and spherical convolutional kernel for machine learning from arbitrary 3D point clouds.

3D Object Classification 3D Part Segmentation +1

Going Deep in Medical Image Analysis: Concepts, Methods, Challenges and Future Directions

no code implementations15 Feb 2019 Fouzia Altaf, Syed M. S. Islam, Naveed Akhtar, Naeem K. Janjua

Unique to this study is the Computer Vision/Machine Learning perspective taken on the advances of Deep Learning in Medical Imaging.

Anatomy BIG-bench Machine Learning +2

Deep Affinity Network for Multiple Object Tracking

1 code implementation28 Oct 2018 Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Ajmal Mian, Mubarak Shah

In this paper, we harness the power of deep learning for data association in tracking by jointly modelling object appearances and their affinities between different frames in an end-to-end fashion.

Benchmarking Multiple Object Tracking +3

Spherical Convolutional Neural Network for 3D Point Clouds

no code implementations21 May 2018 Huan Lei, Naveed Akhtar, Ajmal Mian

We propose a neural network for 3D point cloud processing that exploits `spherical' convolution kernels and octree partitioning of space.

3D Object Classification General Classification +1

Benchmark data and method for real-time people counting in cluttered scenes using depth sensors

1 code implementation12 Apr 2018 Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Chaoyang Zhang, Jian-Xin Li, Ajmal Mian

A thorough evaluation on PCDS demonstrates that our technique is able to count people in cluttered scenes with high accuracy at 45 fps on a 1. 7 GHz processor, and hence it can be deployed for effective real-time people counting for intelligent transportation systems.

Benchmarking

Hyperspectral recovery from RGB images using Gaussian Processes

no code implementations15 Jan 2018 Naveed Akhtar, Ajmal Mian

We propose to recover spectral details from RGB images of known spectral quantization by modeling natural spectra under Gaussian Processes and combining them with the RGB images.

Gaussian Processes Quantization

Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

3 code implementations2 Jan 2018 Naveed Akhtar, Ajmal Mian

This article presents the first comprehensive survey on adversarial attacks on deep learning in Computer Vision.

Deep Learning Self-Driving Cars

Defense against Universal Adversarial Perturbations

no code implementations CVPR 2018 Naveed Akhtar, Jian Liu, Ajmal Mian

A rigorous evaluation shows that our framework can defend the network classifiers against unseen adversarial perturbations in the real-world scenarios with up to 97. 5% success rate.

Skepxels: Spatio-temporal Image Representation of Human Skeleton Joints for Action Recognition

no code implementations16 Nov 2017 Jian Liu, Naveed Akhtar, Ajmal Mian

The proposed action recognition exploits the representation in a hierarchical manner by first capturing the micro-temporal relations between the skeleton joints with the Skepxels and then exploiting their macro-temporal relations by computing the Fourier Temporal Pyramids over the CNN features of the skeletal images.

Action Analysis Action Recognition +1

Viewpoint Invariant Action Recognition using RGB-D Videos

no code implementations15 Sep 2017 Jian Liu, Naveed Akhtar, Ajmal Mian

The proposed technique capitalizes on the spatio-temporal information available in the two data streams to the extract action features that are largely insensitive to the viewpoint variations.

Action Recognition Temporal Action Localization +1

Learning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition

no code implementations4 Jul 2017 Jian Liu, Naveed Akhtar, Ajmal Mian

We propose Human Pose Models that represent RGB and depth images of human poses independent of clothing textures, backgrounds, lighting conditions, body shapes and camera viewpoints.

Action Recognition Skeleton Based Action Recognition +1

Joint Discriminative Bayesian Dictionary and Classifier Learning

no code implementations CVPR 2017 Naveed Akhtar, Ajmal Mian, Fatih Porikli

To further encourage discrimination in the dictionary, our model uses separate (sets of) Bernoulli distributions to represent data from different classes.

Action Recognition Temporal Action Localization

Bayesian Sparse Representation for Hyperspectral Image Super Resolution

no code implementations CVPR 2015 Naveed Akhtar, Faisal Shafait, Ajmal Mian

We propose a hyperspectral image super resolution approach that fuses a high resolution image with the low resolution hyperspectral image using non-parametric Bayesian sparse representation.

Hyperspectral Image Super-Resolution Image Super-Resolution

Discriminative Bayesian Dictionary Learning for Classification

no code implementations27 Mar 2015 Naveed Akhtar, Faisal Shafait, Ajmal Mian

We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data.

Action Recognition Classification +3

Cannot find the paper you are looking for? You can Submit a new open access paper.