Search Results for author: Ajmal Mian

Found 109 papers, 29 papers with code

Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data

no code implementations10 Apr 2024 Aakash Kumar, Chen Chen, Ajmal Mian, Neils Lobo, Mubarak Shah

Our method requires only a small number of 3D points, that can be obtained from a low-cost, low-resolution sensor.

3D Object Detection Autonomous Driving +1

Severity Controlled Text-to-Image Generative Model Bias Manipulation

no code implementations3 Apr 2024 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Charting the susceptibility of T2I models to such manipulation, we first expose the new possibility of a dynamic and computationally efficient exploitation of model bias by targeting the embedded language models.

Backdoor Attack Prompt Engineering

Towards Temporally Consistent Referring Video Object Segmentation

1 code implementation28 Mar 2024 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Mubarak Shah, Ajmal Mian

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects.

Ranked #3 on Referring Video Object Segmentation on Refer-YouTube-VOS (using extra training data)

Object Referring Video Object Segmentation +4

3D Object Detection from Point Cloud via Voting Step Diffusion

no code implementations21 Mar 2024 Haoran Hou, Mingtao Feng, Zijie Wu, Weisheng Dong, Qing Zhu, Yaonan Wang, Ajmal Mian

In this work, we focus on the distributional properties of point clouds and formulate the voting process as generating new points in the high-density region of the distribution of object centers.

3D Object Detection Object +2

External Knowledge Enhanced 3D Scene Generation from Sketch

no code implementations21 Mar 2024 Zijie Wu, Mingtao Feng, Yaonan Wang, He Xie, Weisheng Dong, Bo Miao, Ajmal Mian

Generating realistic 3D scenes is challenging due to the complexity of room layouts and object geometries. We propose a sketch based knowledge enhanced diffusion architecture (SEK) for generating customized, diverse, and plausible 3D scenes.

Denoising Object +1

Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling

no code implementations21 Mar 2024 Yong He, Hongshan Yu, Muhammad Ibrahim, Xiaoyan Liu, Tongjia Chen, Anwaar Ulhaq, Ajmal Mian

This strategy allows various transformer blocks to share the same position information over the same resolution points, thereby reducing network parameters and training time without compromising accuracy. Experimental comparisons with existing methods on multiple datasets demonstrate the efficacy of SMTransformer and skip-attention-based up-sampling for point cloud processing tasks, including semantic segmentation and classification.

Position Segmentation +1

Beyond Skeletons: Integrative Latent Mapping for Coherent 4D Sequence Generation

no code implementations20 Mar 2024 Qitong Yang, Mingtao Feng, Zijie Wu, ShiJie Sun, Weisheng Dong, Yaonan Wang, Ajmal Mian

To address this, we propose a novel framework that generates coherent 4D sequences with animation of 3D shapes under given conditions with dynamic evolution of shape and color over time through integrative latent mapping.

SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection

no code implementations28 Jan 2024 Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Ajmal Mian

In the proposed SCTBs, the outputs of all encoders are interacted with cross transformer to generate mixed features, which are redistributed to all decoders to effectively reinforce semantic differences between the target and clutter at full scales.

Quantifying Bias in Text-to-Image Generative Models

no code implementations20 Dec 2023 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Bias in text-to-image (T2I) models can propagate unfair social representations and may be used to aggressively market ideas or push controversial agendas.

Hallucination Marketing

Text-image guided Diffusion Model for generating Deepfake celebrity interactions

no code implementations26 Sep 2023 Yunzhuo Chen, Nur Al Hasan Haldar, Naveed Akhtar, Ajmal Mian

To curb their exploitation for Deepfakes, it is imperative to first explore the extent to which diffusion models can be used to generate realistic content that is controllable with convenient prompts.

Face Swapping

On quantifying and improving realism of images generated with diffusion

no code implementations26 Sep 2023 Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian

Recent advances in diffusion models have led to a quantum leap in the quality of generative visual content.

Attribute Benchmarking

PRAT: PRofiling Adversarial aTtacks

no code implementations20 Sep 2023 Rahul Ambati, Naveed Akhtar, Ajmal Mian, Yogesh Singh Rawat

Inspired by this, we introduce a novel problem of PRofiling Adversarial aTtacks (PRAT).

Adversarial Attack

Dual Student Networks for Data-Free Model Stealing

no code implementations18 Sep 2023 James Beetham, Navid Kardan, Ajmal Mian, Mubarak Shah

To this end, the two main challenges are estimating gradients of the target model without access to its parameters, and generating a diverse set of training samples that thoroughly explores the input space.

Quantum-Inspired Machine Learning: a Survey

no code implementations22 Aug 2023 Larry Huynh, Jin Hong, Ajmal Mian, Hajime Suzuki, Yanqiu Wu, Seyit Camtepe

Quantum-inspired Machine Learning (QiML) is a burgeoning field, receiving global attention from researchers for its potential to leverage principles of quantum mechanics within classical computational frameworks.

Quantum Machine Learning

Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation

no code implementations ICCV 2023 Zijie Wu, Yaonan Wang, Mingtao Feng, He Xie, Ajmal Mian

In this paper, we propose a sketch and text guided probabilistic diffusion model for colored point cloud generation that conditions the denoising process jointly with a hand drawn sketch of the object and its textual description.

Denoising Image Generation +1

BAGM: A Backdoor Attack for Manipulating Text-to-Image Generative Models

1 code implementation31 Jul 2023 Jordan Vice, Naveed Akhtar, Richard Hartley, Ajmal Mian

Based on the penetration level, BAGM takes the form of a suite of attacks that are referred to as surface, shallow and deep attacks in this article.

Backdoor Attack Language Modelling +2

Spectrum-guided Multi-granularity Referring Video Object Segmentation

1 code implementation ICCV 2023 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

To address the drift problem, we propose a Spectrum-guided Multi-granularity (SgMg) approach, which performs direct segmentation on the encoded features and employs visual details to further optimize the masks.

 Ranked #1 on Referring Expression Segmentation on J-HMDB (using extra training data)

Object Referring Expression Segmentation +4

A Comprehensive Overview of Large Language Models

1 code implementation12 Jul 2023 Humza Naveed, Asad Ullah Khan, Shi Qiu, Muhammad Saqib, Saeed Anwar, Muhammad Usman, Naveed Akhtar, Nick Barnes, Ajmal Mian

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond.

Benchmarking

Human Gesture and Gait Analysis for Autism Detection

no code implementations17 Apr 2023 Sania Zahan, Zulqarnain Gilani, Ghulam Mubashar Hassan, Ajmal Mian

Autism diagnosis presents a major challenge due to the vast heterogeneity of the condition and the elusive nature of early detection.

Full Point Encoding for Local Feature Aggregation in 3D Point Clouds

no code implementations8 Mar 2023 Yong He, Hongshan Yu, Zhengeng Yang, Xiaoyan Liu, Wei Sun, Ajmal Mian

In particular, we achieve state-of-the-art semantic segmentation results of 76% mIoU on S3DIS 6-fold and 72. 2% on S3DIS Area5.

object-detection Object Detection +2

Q-Cogni: An Integrated Causal Reinforcement Learning Framework

no code implementations26 Feb 2023 Cris Cunha, Wei Liu, Tim French, Ajmal Mian

We present Q-Cogni, an algorithmically integrated causal reinforcement learning framework that redesigns Q-Learning with an autonomous causal structure discovery method to improve the learning process with causal inference.

Causal Inference Decision Making +3

Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

no code implementations21 Jan 2023 Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, Michael Wise, Ajmal Mian

We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data.

Autonomous Vehicles Outdoor Localization +1

Learning Sparse Temporal Video Mapping for Action Quality Assessment in Floor Gymnastics

no code implementations15 Jan 2023 Sania Zahan, Ghulam Mubashar Hassan, Ajmal Mian

Athlete performance measurement in sports videos requires modeling long sequences since the entire spatio-temporal progression contributes dominantly to the performance.

Action Quality Assessment

3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud

no code implementations CVPR 2023 Mingtao Feng, Haoran Hou, Liang Zhang, Zijie Wu, Yulan Guo, Ajmal Mian

In-depth understanding of a 3D scene not only involves locating/recognizing individual objects, but also requires to infer the relationships and interactions among them.

Fast Parallel Exact Inference on Bayesian Networks: Poster

1 code implementation8 Dec 2022 Jiantong Jiang, Zeyi Wen, Atif Mansoor, Ajmal Mian

Bayesian networks (BNs) are attractive, because they are graphical and interpretable machine learning models.

Interpretable Machine Learning

Fast Parallel Bayesian Network Structure Learning

1 code implementation8 Dec 2022 Jiantong Jiang, Zeyi Wen, Ajmal Mian

The mainstream BN structure learning methods require performing a large number of conditional independence (CI) tests.

Query Efficient Cross-Dataset Transferable Black-Box Attack on Action Recognition

no code implementations23 Nov 2022 Rohit Gupta, Naveed Akhtar, Gaurav Kumar Nayak, Ajmal Mian, Mubarak Shah

By using a nearly disjoint dataset to train the substitute model, our method removes the requirement that the substitute model be trained using the same dataset as the target model, and leverages queries to the target model to retain the fooling rate benefits provided by query-based methods.

Action Recognition

3DMODT: Attention-Guided Affinities for Joint Detection & Tracking in 3D Point Clouds

no code implementations1 Nov 2022 Jyoti Kini, Ajmal Mian, Mubarak Shah

We propose a method for joint detection and tracking of multiple objects in 3D point clouds, a task conventionally treated as a two-step process comprising object detection followed by data association.

object-detection Object Detection +1

Vision Transformers for Action Recognition: A Survey

no code implementations13 Sep 2022 Anwaar Ulhaq, Naveed Akhtar, Ganna Pogrebna, Ajmal Mian

Finally, it provides a discussion on the challenges, outlook, and future avenues for this research direction.

Action Recognition Dimensionality Reduction +1

Domain-invariant Prototypes for Semantic Segmentation

no code implementations12 Aug 2022 Zhengeng Yang, Hongshan Yu, Wei Sun, Li-Cheng, Ajmal Mian

In this paper, we present an easy-to-train framework that learns domain-invariant prototypes for domain adaptive semantic segmentation.

Domain Adaptation Few-Shot Learning +2

Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility

no code implementations22 Jul 2022 Rohit Gupta, Naveed Akhtar, Ajmal Mian, Mubarak Shah

We establish that this is a result of the presence of false negative pairs in the training process, which increases model sensitivity to input perturbations.

Adversarial Robustness Self-Supervised Learning +1

Region Aware Video Object Segmentation with Deep Motion Modeling

no code implementations21 Jul 2022 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

Current semi-supervised video object segmentation (VOS) methods usually leverage the entire features of one frame to predict object masks and update memory.

Object Segmentation +3

Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection

1 code implementation28 Apr 2022 Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian

Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.

Saliency Prediction

Visual Attention Methods in Deep Learning: An In-Depth Survey

no code implementations16 Apr 2022 Mohammed Hassanin, Saeed Anwar, Ibrahim Radwan, Fahad S Khan, Ajmal Mian

Inspired by the human cognitive system, attention is a mechanism that imitates the human cognitive awareness about specific information, amplifying critical details to focus more on the essential aspects of data.

Deep Attention

Bi-CLKT: Bi-Graph Contrastive Learning based Knowledge Tracing

no code implementations22 Jan 2022 XiangYu Song, JianXin Li, Qi Lei, Wei Zhao, Yunliang Chen, Ajmal Mian

The goal of Knowledge Tracing (KT) is to estimate how well students have mastered a concept based on their historical learning of related exercises.

Contrastive Learning Knowledge Tracing +1

Learning From Pixel-Level Noisy Label: A New Perspective for Light Field Saliency Detection

1 code implementation CVPR 2022 Mingtao Feng, Kendong Liu, Liang Zhang, Hongshan Yu, Yaonan Wang, Ajmal Mian

Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.

Saliency Prediction

Mesh Convolution with Continuous Filters for 3D Surface Parsing

2 code implementations3 Dec 2021 Huan Lei, Naveed Akhtar, Mubarak Shah, Ajmal Mian

In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes.

Scene Parsing Scene Segmentation

Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition

1 code implementation1 Aug 2021 Bo Miao, Liguang Zhou, Ajmal Mian, Tin Lun Lam, Yangsheng Xu

The final results in this work show that OTS successfully extracts object features and learns object relations from the segmentation network.

Object Scene Recognition

Advances in adversarial attacks and defenses in computer vision: A survey

no code implementations1 Aug 2021 Naveed Akhtar, Ajmal Mian, Navid Kardan, Mubarak Shah

In [2], we reviewed the contributions made by the computer vision community in adversarial attacks on deep learning (and their defenses) until the advent of year 2018.

Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

1 code implementation27 Jul 2021 Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation.

Segmentation Semantic Segmentation +2

Controlled Caption Generation for Images Through Adversarial Attacks

no code implementations7 Jul 2021 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Mubarak Shah, Ajmal Mian

In contrast, we propose a GAN-based algorithm for crafting adversarial examples for neural image captioning that mimics the internal representation of the CNN such that the resulting deep features of the input image enable a controlled incorrect caption generation through the recurrent network.

Caption Generation Image Captioning +1

Attack to Fool and Explain Deep Networks

no code implementations20 Jun 2021 Naveed Akhtar, Muhammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

Exploring this phenomenon further, we alter the `adversarial' objective of our attack to use it as a tool to `explain' deep visual representation.

Adversarial Attack Image Generation +1

CAMERAS: Enhanced Resolution And Sanity preserving Class Activation Mapping for image saliency

1 code implementation CVPR 2021 Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.

Survey: Image Mixing and Deleting for Data Augmentation

1 code implementation13 Jun 2021 Humza Naveed, Saeed Anwar, Munawar Hayat, Kashif Javed, Ajmal Mian

One such method is augmentation which introduces different types of corruption in the data to prevent the model from overfitting and to memorize patterns present in the data.

Image Augmentation Image Classification +2

Physical world assistive signals for deep neural network classifiers -- neither defense nor attack

no code implementations3 May 2021 Camilo Pestana, Wei Liu, David Glance, Robyn Owens, Ajmal Mian

We discuss how we can exploit these insights to re-think, or avoid, some patterns that might contribute to, or degrade, the detectability of objects in the real-world.

Free-form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud

1 code implementation ICCV 2021 Mingtao Feng, Zhen Li, Qi Li, Liang Zhang, Xiangdong Zhang, Guangming Zhu, HUI ZHANG, Yaonan Wang, Ajmal Mian

There are three main challenges in 3D object grounding: to find the main focus in the complex and diverse description; to understand the point cloud scene; and to locate the target object.

Object

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

2 code implementations CVPR 2021 Huan Lei, Naveed Akhtar, Ajmal Mian

We present Picasso, a CUDA-based library comprising novel modules for deep learning over complex real-world 3D meshes.

LSDAT: Low-Rank and Sparse Decomposition for Decision-based Adversarial Attack

no code implementations19 Mar 2021 Ashkan Esmaeili, Marzieh Edraki, Nazanin Rahnavard, Mubarak Shah, Ajmal Mian

It is set forth that the proposed sparse perturbation is the most aligned sparse perturbation with the shortest path from the input sample to the decision boundary for some initial adversarial sample (the best sparse approximation of shortest path, likely to fool the model).

Adversarial Attack Computational Efficiency +1

Deep Learning Based 3D Segmentation: A Survey

no code implementations9 Mar 2021 Yong He, Hongshan Yu, Xiaoyan Liu, Zhengeng Yang, Wei Sun, Ajmal Mian

This paper fills the gap and provides a comprehensive survey of the recent progress made in deep learning based 3D segmentation.

Autonomous Driving Point Cloud Segmentation +2

Self-supervised Learning with Fully Convolutional Networks

no code implementations18 Dec 2020 Zhengeng Yang, Hongshan Yu, Yong He, Zhi-Hong Mao, Ajmal Mian

By learning to solve a Jigsaw Puzzle problem with 25 patches and transferring the learned features to semantic segmentation task on Cityscapes dataset, we achieve a 5. 8 percentage point improvement over the baseline model that initialized from random values.

Segmentation Self-Supervised Learning +1

Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty

1 code implementation5 Nov 2020 Camilo Pestana, Wei Liu, David Glance, Ajmal Mian

We propose three metrics to determine the proportion of robust images in a dataset and provide scoring to determine the dataset bias.

Adversarial Attack Benchmarking

Image Colorization: A Survey and Dataset

1 code implementation25 Aug 2020 Saeed Anwar, Muhammad Tahir, Chongyi Li, Ajmal Mian, Fahad Shahbaz Khan, Abdul Wahab Muzaffar

Image colorization is the process of estimating RGB colors for grayscale images or video frames to improve their aesthetic and perceptual quality.

Benchmarking Colorization +1

Fast ORB-SLAM without Keypoint Descriptors

no code implementations22 Aug 2020 Qiang Fu, Hongshan Yu, Xiaolong Wang, Zhengeng Yang, Hong Zhang, Ajmal Mian

ORB-SLAM2 \cite{orbslam2} is a benchmark method in this domain, however, it consumes significant time for computing descriptors that never get reused unless a frame is selected as a keyframe.

Robotics Computational Geometry I.4.0; I.4.9

Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking

3 code implementations ECCV 2020 Shi-Jie Sun, Naveed Akhtar, Xiang-Yu Song, HuanSheng Song, Ajmal Mian, Mubarak Shah

Deep learning-based Multiple Object Tracking (MOT) currently relies on off-the-shelf detectors for tracking-by-detection. This results in deep models that are detector biased and evaluations that are detector influenced.

Multiple Object Tracking Object

Odyssey: Creation, Analysis and Detection of Trojan Models

1 code implementation16 Jul 2020 Marzieh Edraki, Nazmul Karim, Nazanin Rahnavard, Ajmal Mian, Mubarak Shah

We propose a detector that is based on the analysis of the intrinsic DNN properties; that are affected due to the Trojaning process.

Data Poisoning

Orthogonal Deep Models As Defense Against Black-Box Attacks

no code implementations26 Jun 2020 Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

On the other, deep learning has also been found vulnerable to adversarial attacks, which calls for new techniques to defend deep models against these attacks.

Minimum Potential Energy of Point Cloud for Robust Global Registration

no code implementations11 Jun 2020 Zijie Wu, Yaonan Wang, Qing Zhu, Jianxu Mao, Haotian Wu, Mingtao Feng, Ajmal Mian

Different from the most existing point set registration methods which usually extract the descriptors to find correspondences between point sets, our proposed MPE alignment method is able to handle large scale raw data offset without depending on traditional descriptors extraction, whether for the local or global registration methods.

Adversarial Perturbations Prevail in the Y-Channel of the YCbCr Color Space

1 code implementation25 Feb 2020 Camilo Pestana, Naveed Akhtar, Wei Liu, David Glance, Ajmal Mian

Our results show that our approach achieves the best balance between defence against adversarial attacks such as FGSM, PGD and DDN and maintaining the original accuracies of VGG-16, ResNet50 and DenseNet121 on clean images.

Relation Graph Network for 3D Object Detection in Point Clouds

no code implementations30 Nov 2019 Mingtao Feng, Syed Zulqarnain Gilani, Yaonan Wang, Liang Zhang, Ajmal Mian

Convolutional Neural Networks (CNNs) have emerged as a powerful strategy for most object detection tasks on 2D images.

3D Object Detection Object +3

Empirical Autopsy of Deep Video Captioning Frameworks

no code implementations21 Nov 2019 Nayyer Aafaq, Naveed Akhtar, Wei Liu, Ajmal Mian

We perform extensive experiments by varying the constituent components of the video captioning framework, and quantify the performance gains that are possible by mere component selection.

Language Modelling Video Captioning +1

Point Attention Network for Semantic Segmentation of 3D Point Clouds

no code implementations27 Sep 2019 Mingtao Feng, Liang Zhang, Xuefei Lin, Syed Zulqarnain Gilani, Ajmal Mian

We propose a point attention network that learns rich local shape features and their contextual correlations for 3D point cloud semantic segmentation.

Point Cloud Segmentation Semantic Segmentation

Adversarial Attack on Skeleton-based Human Action Recognition

no code implementations14 Sep 2019 Jian Liu, Naveed Akhtar, Ajmal Mian

We also explore the possibility of semantically imperceptible localized attacks with CIASA, and succeed in fooling the state-of-the-art skeleton action recognition models with high confidence.

Action Recognition Adversarial Attack +2

Temporally Coherent Full 3D Mesh Human Pose Recovery from Monocular Video

no code implementations1 Jun 2019 Jian Liu, Naveed Akhtar, Ajmal Mian

A major challenge in this regard is the lack of appropriately annotated video data for learning the desired deep models.

Label Universal Targeted Attack

1 code implementation27 May 2019 Naveed Akhtar, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

We introduce Label Universal Targeted Attack (LUTA) that makes a deep model predict a label of attacker's choice for `any' sample of a given source class with high probability.

Converting a Common Document Scanner to a Multispectral Scanner

no code implementations17 Apr 2019 Zohaib Khan, Faisal Shafait, Ajmal Mian

We propose the construction of a prototype scanner designed to capture multispectral images of documents.

Multidimensional ground reaction forces and moments from wearable sensor accelerations via deep learning

no code implementations18 Mar 2019 William R. Johnson, Ajmal Mian, Mark A. Robinson, Jasper Verheul, David G. Lloyd, Jacqueline A. Alderson

Competing convolutional neural network (CNN) deep learning models were trained using laboratory-derived stance phase GRF/M data and simulated sensor accelerations for running and sidestepping maneuvers derived from nearly half a million legacy motion trials.

Octree guided CNN with Spherical Kernels for 3D Point Clouds

no code implementations CVPR 2019 Huan Lei, Naveed Akhtar, Ajmal Mian

We propose an octree guided neural network architecture and spherical convolutional kernel for machine learning from arbitrary 3D point clouds.

3D Object Classification 3D Part Segmentation +1

Deep Affinity Network for Multiple Object Tracking

1 code implementation28 Oct 2018 Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Ajmal Mian, Mubarak Shah

In this paper, we harness the power of deep learning for data association in tracking by jointly modelling object appearances and their affinities between different frames in an end-to-end fashion.

Benchmarking Multiple Object Tracking +3

On-field player workload exposure and knee injury risk monitoring via deep learning

no code implementations21 Sep 2018 William R. Johnson, Ajmal Mian, David G. Lloyd, Jacqueline A. Alderson

In sports analytics, an understanding of accurate on-field 3D knee joint moments (KJM) could provide an early warning system for athlete workload exposure and knee injury risk.

Sports Analytics

Unsupervised Deep Multi-focus Image Fusion

no code implementations19 Jun 2018 Xiang Yan, Syed Zulqarnain Gilani, Hanlin Qin, Ajmal Mian

Convolutional neural networks have recently been used for multi-focus image fusion.

SSIM

Video Description: A Survey of Methods, Datasets and Evaluation Metrics

no code implementations1 Jun 2018 Nayyer Aafaq, Ajmal Mian, Wei Liu, Syed Zulqarnain Gilani, Mubarak Shah

Video description is the automatic generation of natural language sentences that describe the contents of a given video.

Language Modelling Video Description

Spherical Convolutional Neural Network for 3D Point Clouds

no code implementations21 May 2018 Huan Lei, Naveed Akhtar, Ajmal Mian

We propose a neural network for 3D point cloud processing that exploits `spherical' convolution kernels and octree partitioning of space.

3D Object Classification General Classification +1

Deep Keyframe Detection in Human Action Videos

no code implementations26 Apr 2018 Xiang Yan, Syed Zulqarnain Gilani, Hanlin Qin, Mingtao Feng, Liang Zhang, Ajmal Mian

Detecting representative frames in videos based on human actions is quite challenging because of the combined factors of human pose in action and the background.

Benchmark data and method for real-time people counting in cluttered scenes using depth sensors

1 code implementation12 Apr 2018 Shi-Jie Sun, Naveed Akhtar, HuanSheng Song, Chaoyang Zhang, Jian-Xin Li, Ajmal Mian

A thorough evaluation on PCDS demonstrates that our technique is able to count people in cluttered scenes with high accuracy at 45 fps on a 1. 7 GHz processor, and hence it can be deployed for effective real-time people counting for intelligent transportation systems.

Benchmarking

Hyperspectral recovery from RGB images using Gaussian Processes

no code implementations15 Jan 2018 Naveed Akhtar, Ajmal Mian

We propose to recover spectral details from RGB images of known spectral quantization by modeling natural spectra under Gaussian Processes and combining them with the RGB images.

Gaussian Processes Quantization

Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

3 code implementations2 Jan 2018 Naveed Akhtar, Ajmal Mian

This article presents the first comprehensive survey on adversarial attacks on deep learning in Computer Vision.

Self-Driving Cars

Defense against Universal Adversarial Perturbations

no code implementations CVPR 2018 Naveed Akhtar, Jian Liu, Ajmal Mian

A rigorous evaluation shows that our framework can defend the network classifiers against unseen adversarial perturbations in the real-world scenarios with up to 97. 5% success rate.

Learning from Millions of 3D Scans for Large-scale 3D Face Recognition

1 code implementation CVPR 2018 Syed Zulqarnain Gilani, Ajmal Mian

Unlike 2D photographs, 3D facial scans cannot be sourced from the web causing a bottleneck in the development of deep 3D face recognition networks and datasets.

Face Recognition

Skepxels: Spatio-temporal Image Representation of Human Skeleton Joints for Action Recognition

no code implementations16 Nov 2017 Jian Liu, Naveed Akhtar, Ajmal Mian

The proposed action recognition exploits the representation in a hierarchical manner by first capturing the micro-temporal relations between the skeleton joints with the Skepxels and then exploiting their macro-temporal relations by computing the Fourier Temporal Pyramids over the CNN features of the skeletal images.

Action Analysis Action Recognition +1

Viewpoint Invariant Action Recognition using RGB-D Videos

no code implementations15 Sep 2017 Jian Liu, Naveed Akhtar, Ajmal Mian

The proposed technique capitalizes on the spatio-temporal information available in the two data streams to the extract action features that are largely insensitive to the viewpoint variations.

Action Recognition Temporal Action Localization +1

Learning Human Pose Models from Synthesized Data for Robust RGB-D Action Recognition

no code implementations4 Jul 2017 Jian Liu, Naveed Akhtar, Ajmal Mian

We propose Human Pose Models that represent RGB and depth images of human poses independent of clothing textures, backgrounds, lighting conditions, body shapes and camera viewpoints.

Action Recognition Skeleton Based Action Recognition +1

Joint Discriminative Bayesian Dictionary and Classifier Learning

no code implementations CVPR 2017 Naveed Akhtar, Ajmal Mian, Fatih Porikli

To further encourage discrimination in the dictionary, our model uses separate (sets of) Bernoulli distributions to represent data from different classes.

Action Recognition Temporal Action Localization

3D Action Recognition From Novel Viewpoints

no code implementations CVPR 2016 Hossein Rahmani, Ajmal Mian

We propose a human pose representation model that transfers human poses acquired from different unknown views to a view-invariant high-level space.

3D Action Recognition Clustering

Learning a Deep Model for Human Action Recognition from Novel Viewpoints

no code implementations2 Feb 2016 Hossein Rahmani, Ajmal Mian, Mubarak Shah

The strength of our technique is that we learn a single R-NKTM for all actions and all viewpoints for knowledge transfer of any real human action video without the need for re-training or fine-tuning the model.

Action Recognition Temporal Action Localization +1

Shape-Based Automatic Detection of a Large Number of 3D Facial Landmarks

no code implementations CVPR 2015 Syed Zulqarnain Gilani, Faisal Shafait, Ajmal Mian

Our approach does not use texture and is completely shape based in order to detect landmarks that are morphologically significant.

Bayesian Sparse Representation for Hyperspectral Image Super Resolution

no code implementations CVPR 2015 Naveed Akhtar, Faisal Shafait, Ajmal Mian

We propose a hyperspectral image super resolution approach that fuses a high resolution image with the low resolution hyperspectral image using non-parametric Bayesian sparse representation.

Hyperspectral Image Super-Resolution Image Super-Resolution

Discriminative Bayesian Dictionary Learning for Classification

no code implementations27 Mar 2015 Naveed Akhtar, Faisal Shafait, Ajmal Mian

We propose a Bayesian approach to learn discriminative dictionaries for sparse representation of data.

Action Recognition Classification +3

Representation Learning with Deep Extreme Learning Machines for Efficient Image Set Classification

no code implementations9 Mar 2015 Muhammad Uzair, Faisal Shafait, Bernard Ghanem, Ajmal Mian

Efficient and accurate joint representation of a collection of images, that belong to the same class, is a major research challenge for practical image set classification.

Classification General Classification +1

Dense 3D Face Correspondence

no code implementations19 Oct 2014 Syed Zulqarnain Gilani, Ajmal Mian, Faisal Shafait, Ian Reid

A deformable model (K3DM) is constructed from the dense corresponded faces and an algorithm is proposed for morphing the K3DM to fit unseen faces.

Face Recognition

Histogram of Oriented Principal Components for Cross-View Action Recognition

no code implementations24 Sep 2014 Hossein Rahmani, Arif Mahmood, Du Huynh, Ajmal Mian

We propose the Histogram of Oriented Principal Components (HOPC) descriptor that is robust to noise, viewpoint, scale and action speed variations.

3D Action Recognition

HOPC: Histogram of Oriented Principal Components of 3D Pointclouds for Action Recognition

no code implementations17 Aug 2014 Hossein Rahmani, Arif Mahmood, Du. Q. Huynh, Ajmal Mian

In contrast, we directly process the pointclouds and propose a new technique for action recognition which is more robust to noise, action speed and viewpoint variations.

3D Action Recognition Keypoint Detection

Optimizing Auto-correlation for Fast Target Search in Large Search Space

no code implementations14 Jul 2014 Arif Mahmood, Ajmal Mian, Robyn Owens

To this end we propose an Efficient Group Size (EGS) algorithm which minimizes the number of similarity computations for a particular search image.

Template Matching

Semi-supervised Spectral Clustering for Image Set Classification

no code implementations CVPR 2014 Arif Mahmood, Ajmal Mian, Robyn Owens

We present an image set classification algorithm based on unsupervised clustering of labeled training and unlabeled test data where labels are only used in the stopping criterion.

Classification Clustering +1

Multispectral Palmprint Encoding and Recognition

no code implementations6 Feb 2014 Zohaib Khan, Faisal Shafait, Yiqun Hu, Ajmal Mian

Comprehensive experiments for both identification and verification scenarios are performed on two public datasets -- one captured with a contact-based sensor (PolyU dataset), and the other with a contact-free sensor (CASIA dataset).

Cannot find the paper you are looking for? You can Submit a new open access paper.