no code implementations • 20 Jan 2020 • Moshiur R. Farazi, Salman H. Khan, Nick Barnes
However, modelling the visual and semantic features in a high dimensional (joint embedding) space is computationally expensive, and more complex models often result in trivial improvements in the VQA accuracy.
no code implementations • 9 Dec 2019 • Taimur Hassan, Salman H. Khan, Samet Akcay, Mohammed Bennamoun, Naoufel Werghi
In the last two decades, luggage scanning has globally become one of the prime aviation security concerns.
1 code implementation • NeurIPS 2019 • Jathushan Rajasegaran, Munawar Hayat, Salman H. Khan, Fahad Shahbaz Khan, Ling Shao
In order to maintain an equilibrium between previous and newly acquired knowledge, we propose a simple controller to dynamically balance the model plasticity.
Ranked #7 on Continual Learning on F-CelebA (10 tasks)
no code implementations • 9 Aug 2019 • Moshiur R. Farazi, Salman H. Khan, Nick Barnes
Visual Question Answering (VQA) models employ attention mechanisms to discover image locations that are most relevant for answering a specific question.
no code implementations • CVPR 2019 • Salman H. Khan, Yulan Guo, Munawar Hayat, Nick Barnes
Using the primitive parts for shapes as attributes, a parameterized 3D representation is modeled in the first stage.
2 code implementations • NeurIPS 2019 • Muzammal Naseer, Salman H. Khan, Harris Khan, Fahad Shahbaz Khan, Fatih Porikli
To this end, we propose a framework capable of launching highly transferable attacks that crafts adversarial patterns to mislead networks trained on wholly different domains.
1 code implementation • 7 Jan 2019 • Aamir Mustafa, Salman H. Khan, Munawar Hayat, Jianbing Shen, Ling Shao
The proposed scheme is simple and has the following advantages: (1) it does not require any model training or parameter optimization, (2) it complements other existing defense mechanisms, (3) it is agnostic to the attacked model and attack type and (4) it provides superior performance across all popular attack algorithms.
no code implementations • 30 Nov 2018 • Moshiur R. Farazi, Salman H. Khan, Nick Barnes
To evaluate our model, we propose a new split for VQA, separating Unknown visual and semantic concepts from the training set.
1 code implementation • 22 Nov 2018 • Muzammal Naseer, Salman H. Khan, Shafin Rahman, Fatih Porikli
Deep neural networks (DNNs) can be easily fooled by adding human imperceptible perturbations to the images.
5 code implementations • 3 Jul 2018 • Muzammal Naseer, Salman H. Khan, Fatih Porikli
Deep neural networks (DNNs) have shown vulnerability to adversarial attacks, i. e., carefully perturbed inputs designed to mislead the network at inference time.
no code implementations • 11 May 2018 • Moshiur R. Farazi, Salman H. Khan
Existing attention mechanisms either attend to local image grid or object level features for Visual Question Answering (VQA).
1 code implementation • 27 Apr 2018 • Salman H. Khan, Munawar Hayat, Nick Barnes
Our model simultaneously learns to match the data, reconstruction loss and the latent distributions of real and fake images to improve the quality of generated samples.
no code implementations • 9 Mar 2018 • Muzammal Naseer, Salman H. Khan, Fatih Porikli
With the availability of low-cost and compact 2. 5/3D visual sensing devices, computer vision community is experiencing a growing interest in visual scene understanding of indoor environments.
no code implementations • ICCV 2017 • Salman H. Khan, Munawar Hayat, Fatih Porikli
To the best of our knowledge, this is the first attempt to use deep learning based spectral features explicitly for image classification task.
no code implementations • CVPR 2017 • Munawar Hayat, Salman H. Khan, Naoufel Werghi, Roland Goecke
We validate the proposed scheme on template based unconstrained face identification.
no code implementations • 27 Jun 2017 • Shafin Rahman, Salman H. Khan, Fatih Porikli
Then, it learns how to combine these directions to obtain the principal direction for each unseen class such that the CAPD of the test image is aligned with the semantic embedding of the true class, and opposite to the other classes.
no code implementations • 7 Jun 2016 • Salman H. Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
We apply a constrained mean-field algorithm to estimate the pixel-level labels, and use the estimated labels to update the parameters of the CNN in an iterative EM framework.
no code implementations • ICCV 2015 • Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel
The contractive constraints ensure that the achieved separating margin in the input space is larger than or equal to the separating margin in the output layer.
no code implementations • 14 Aug 2015 • Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
Class imbalance is a common problem in the case of real-world object detection and classification tasks.
no code implementations • 18 Jun 2015 • Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Senjian An
This paper introduces a new learnable feature descriptor called "spatial layout and scale invariant convolutional activations" to deal with these challenges.
no code implementations • 17 Jun 2015 • Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel
To this end, we introduce a new large-scale dataset of 1300 object categories which are commonly present in indoor scenes.
no code implementations • CVPR 2015 • Salman H. Khan, Xuming He, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
Objects' spatial layout estimation and clutter identification are two important tasks to understand indoor scenes.