no code implementations • 13 Nov 2024 • ChengYuan Zhang, Yilin Zhang, Lei Zhu, Deyin Liu, Lin Wu, Bo Li, Shichao Zhang, Mohammed Bennamoun, Farid Boussaid
This paper introduces a novel framework for unified incremental few-shot object detection (iFSOD) and instance segmentation (iFSIS) using the Transformer architecture.
1 code implementation • 27 Oct 2024 • Bo Miao, Mingtao Feng, Zijie Wu, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian
We introduce Referring Human Pose and Mask Estimation (R-HPM) in the wild, where either a text or positional prompt specifies the person of interest in an image.
no code implementations • 18 Oct 2024 • Ahmad Obeid, Said Boumaraf, Anabia Sohail, Taimur Hassan, Sajid Javed, Jorge Dias, Mohammed Bennamoun, Naoufel Werghi
Recent years witnessed remarkable progress in computational histopathology, largely fueled by deep learning.
no code implementations • 5 Oct 2024 • Maria Marrium, Arif Mahmood, Mohammed Bennamoun
Consequently, Noisy Labels Learning (NLL) has become a critical research field for Convolutional Neural Networks (CNNs), though it remains less explored for Vision Transformers (ViTs).
1 code implementation • 22 Aug 2024 • Tahmina Khanam, Hamid Laga, Mohammed Bennamoun, Guanjin Wang, Ferdous Sohel, Farid Boussaid, Guan Wang, Anuj Srivastava
In this paper, we propose a novel mathematical representation of the shape space of such trajectories, a Riemannian metric on that space, and computational tools for fast and accurate spatiotemporal registration and geodesics computation between 4D tree-shaped structures.
no code implementations • 27 Jul 2024 • Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Zinuo Li, Hamid Laga, Farid Boussaid
This paper investigates the role of CLIP image embeddings within the Stable Video Diffusion (SVD) framework, focusing on their impact on video generation quality and computational efficiency.
1 code implementation • 6 Jul 2024 • Qi Wang, Zhou Xu, Yuming Lin, Jingtao Ye, Hongsheng Li, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Liang Zhang
By setting a new benchmark in the field, we challenge the current limitations of neuromorphic data processing and invite a surge of new approaches in event-based action recognition techniques, which paves the way for future explorations in neuromorphic computing and beyond.
no code implementations • 28 Jun 2024 • Uchitha Rajapaksha, Ferdous Sohel, Hamid Laga, Dean Diepeveen, Mohammed Bennamoun
Estimating depth from single RGB images and videos is of widespread interest due to its applications in many areas, including autonomous driving, 3D reconstruction, digital entertainment, and robotics.
no code implementations • 10 Jun 2024 • Nicholas J. Pritchard, Andreas Wicenec, Mohammed Bennamoun, Richard Dodson
This study underscores the potential of RFI detection as a benchmark problem for SNN researchers, emphasizing the efficacy of SNNs in addressing complex time-series segmentation tasks in radio astronomy.
1 code implementation • CVPR 2024 • Sajid Javed, Arif Mahmood, Iyyakutti Iyappan Ganapathi, Fayaz Ali Dharejo, Naoufel Werghi, Mohammed Bennamoun
This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation.
no code implementations • CVPR 2024 • Ning Wang, Guangming Zhu, HS Li, Liang Zhang, Syed Afaq Ali Shah, Mohammed Bennamoun
Extensive experiments on two complex video action datasets, Charades & CAD-120, validates the improved performance and interpretability of our LaIAR framework.
1 code implementation • 28 Mar 2024 • Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Mubarak Shah, Ajmal Mian
Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects.
Ranked #2 on Referring Video Object Segmentation on MeViS
no code implementations • 2 Mar 2024 • Lian Xu, Mohammed Bennamoun, Farid Boussaid, Wanli Ouyang, Ferdous Sohel, Dan Xu
We propose AuxSegNet+, a weakly supervised auxiliary learning framework to explore the rich information from these saliency maps and the significant inter-task correlation between saliency detection and semantic segmentation.
1 code implementation • 27 Feb 2024 • Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Hamid Laga, Farid Boussaid
To address these deficiencies, we introduce the Box-it-to-Bind-it (B2B) module - a novel, training-free approach for improving spatial control and semantic accuracy in text-to-image (T2I) diffusion models.
1 code implementation • 17 Feb 2024 • Thang-Anh-Quan Nguyen, Amine Bourki, Mátyás Macudzinski, Anthony Brunel, Mohammed Bennamoun
This review thoroughly examines the role of semantically-aware Neural Radiance Fields (NeRFs) in visual scene understanding, covering an analysis of over 250 scholarly papers.
1 code implementation • 8 Jan 2024 • Huanyu Liu, JianFeng Cai, Tingjia Zhang, Hongsheng Li, Siyuan Wang, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Liang Zhang
Automated conversion methods are essential to overcome manual conversion challenges.
1 code implementation • 24 Nov 2023 • Nicholas J. Pritchard, Andreas Wicenec, Mohammed Bennamoun, Richard Dodson
This work demonstrates the viability of SNNs as a promising avenue for machine-learning-based RFI detection in radio telescopes by establishing a minimal performance baseline on traditional and nascent satellite-based RFI sources and is the first work to our knowledge to apply SNNs in astronomy.
Ranked #1 on Semantic Segmentation on LOFAR RFI Detection
no code implementations • 23 Nov 2023 • Do Huu Dat, Po Yuan Mao, Tien Hoang Nguyen, Wray Buntine, Mohammed Bennamoun
In our paper, we propose a novel framework that for the first time combines the Modern Hopfield Network with a Mixture of Experts (HOMOE) to classify the compositions of previously unseen objects.
1 code implementation • 10 Sep 2023 • Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed Bennamoun
Transformers have rapidly gained popularity in computer vision, especially in the field of object recognition and detection.
no code implementations • 1 Sep 2023 • Juan lu, Mohammed Bennamoun, Jonathon Stewart, JasonK. Eshraghian, Yanbin Liu, Benjamin Chow, Frank M. Sanfilippo, Girish Dwivedi
Diagnostic investigation has an important role in risk stratification and clinical decision making of patients with suspected and documented Coronary Artery Disease (CAD).
1 code implementation • 6 Aug 2023 • Lian Xu, Mohammed Bennamoun, Farid Boussaid, Hamid Laga, Wanli Ouyang, Dan Xu
Building upon the observation that the attended regions of the one-class token in the standard vision transformer can contribute to a class-agnostic localization map, we explore the potential of the transformer model to capture class-specific attention for class-discriminative object localization by learning multiple class tokens.
Object Localization Weakly supervised Semantic Segmentation +1
1 code implementation • ICCV 2023 • Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian
To address the drift problem, we propose a Spectrum-guided Multi-granularity (SgMg) approach, which performs direct segmentation on the encoded features and employs visual details to further optimize the masks.
Ranked #1 on Referring Expression Segmentation on J-HMDB (using extra training data)
no code implementations • 14 Apr 2023 • Nicholas J. Pritchard, Andreas Wicenec, Mohammed Bennamoun, Richard Dodson
In particular, spiking neural networks hold the potential to advance artificial intelligence as the basis of third-generation neural networks.
no code implementations • 9 Mar 2023 • Hao Tang, Aref Miri Rekavandi, Dharjinder Rooprai, Girish Dwivedi, Frank Sanfilippo, Farid Boussaid, Mohammed Bennamoun
This study investigates the effectiveness of Explainable Artificial Intelligence (XAI) techniques in predicting suicide risks and identifying the dominant causes for such behaviours.
no code implementations • ICCV 2023 • Zhiheng Fu, Longguang Wang, Lian Xu, Zhiyong Wang, Hamid Laga, Yulan Guo, Farid Boussaid, Mohammed Bennamoun
In this paper, we thus propose an unsupervised viewpoint representation learning scheme for 3D point cloud completion without explicit viewpoint estimation.
no code implementations • CVPR 2023 • Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu
Weakly supervised dense object localization (WSDOL) relies generally on Class Activation Mapping (CAM), which exploits the correlation between the class weights of the image classifier and the pixel-level features.
1 code implementation • 24 Nov 2022 • Jinshuai Bai, Laith Alzubaidi, Qingxia Wang, Ellen Kuhl, Mohammed Bennamoun, Yuantong Gu
Deep learning (DL) relies heavily on data, and the quality of data influences its performance significantly.
1 code implementation • 12 Oct 2022 • Yanbin Liu, Girish Dwivedi, Farid Boussaid, Mohammed Bennamoun
Generative models such as generative adversarial networks and autoencoders have gained a great deal of attention in the medical field due to their excellent data generation capability.
1 code implementation • 17 Sep 2022 • Laurent Jospin, Allen Antony, Lian Xu, Hamid Laga, Farid Boussaid, Mohammed Bennamoun
In this paper, we propose the Active-Passive SimStereo dataset and a corresponding benchmark to evaluate the performance gap between passive and active stereo images for stereo matching algorithms.
no code implementations • 12 Sep 2022 • Laurent Valentin Jospin, Hamid Laga, Farid Boussaid, Mohammed Bennamoun
A major focus of recent developments in stereo vision has been on how to obtain accurate dense disparity maps in passive stereo vision.
no code implementations • 8 Aug 2022 • Yanbin Liu, Girish Dwivedi, Farid Boussaid, Frank Sanfilippo, Makoto Yamada, Mohammed Bennamoun
Novel 3D network architectures are proposed for both the generator and discriminator of the GAN model to significantly reduce the number of parameters while maintaining the quality of image generation.
no code implementations • 26 Jul 2022 • Aref Miri Rekavandi, Lian Xu, Farid Boussaid, Abd-Krim Seghouane, Stephen Hoefs, Mohammed Bennamoun
Small object detection (SOD) in optical images and videos is a challenging problem that even state-of-the-art generic object detection methods fail to accurately localize and identify such objects.
no code implementations • 24 Jul 2022 • Litao Yu, Jian Zhang, Mohammed Bennamoun, Xiaojun Chang, Vute Sirivivatnanon, Ali Nezhad
Concrete workability measure is mostly determined based on subjective assessment of a certified assessor with visual inspections.
no code implementations • 21 Jul 2022 • Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian
Current semi-supervised video object segmentation (VOS) methods usually leverage the entire features of one frame to predict object masks and update memory.
Ranked #16 on Semi-Supervised Video Object Segmentation on DAVIS 2017 (val) (using extra training data)
no code implementations • 9 Jul 2022 • Lin Wu, Deyin Liu, Wenying Zhang, Dapeng Chen, ZongYuan Ge, Farid Boussaid, Mohammed Bennamoun, Jialie Shen
In this paper, we present a pseudo-pair based self-similarity learning approach for unsupervised person re-ID without human annotations.
no code implementations • 9 Jul 2022 • Lin Wu, Lingqiao Liu, Yang Wang, Zheng Zhang, Farid Boussaid, Mohammed Bennamoun
It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.
no code implementations • 9 Jul 2022 • Deyin Liu, Lin Wu, Haifeng Zhao, Farid Boussaid, Mohammed Bennamoun, Xianghua Xie
Moreover, adversarially training a defense model in general cannot produce interpretable predictions towards the inputs with perturbations, whilst a highly interpretable robust model is required by different domain experts to understand the behaviour of a DNN.
no code implementations • 13 Jun 2022 • Sami Barchid, José Mennesson, Jason Eshraghian, Chaabane Djéraba, Mohammed Bennamoun
Spiking neural networks have shown much promise as an energy-efficient alternative to artificial neural networks.
1 code implementation • 24 Mar 2022 • Mohammed Hassanin, Abdelwahed Khamiss, Mohammed Bennamoun, Farid Boussaid, Ibrahim Radwan
3D human pose estimation can be handled by encoding the geometric dependencies between the body parts and enforcing the kinematic constraints.
Ranked #29 on 3D Human Pose Estimation on Human3.6M
1 code implementation • CVPR 2022 • Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu
To this end, we propose a Multi-class Token Transformer, termed as MCTformer, which uses multiple class tokens to learn interactions between the class tokens and the patch tokens.
no code implementations • 5 Feb 2022 • Guofeng Mei, Litao Yu, Qiang Wu, Jian Zhang, Mohammed Bennamoun
This paper proposes a general unsupervised approach, named \textbf{ConClu}, to perform the learning of point-wise and global features by jointly leveraging point-level clustering and instance-level contrasting.
no code implementations • 2 Feb 2022 • Juan lu, Rebecca Hutchens, Joseph Hung, Mohammed Bennamoun, Brendan McQuillan, Tom Briffa, Ferdous Sohel, Kevin Murray, Jonathon Stewart, Benjamin Chow, Frank Sanfilippo, Girish Dwivedi
Conclusions Multilabel ML models can outperform clinical risk stratification scores for predicting the risk of major bleeding and death in non-valvular AF patients.
no code implementations • 7 Jan 2022 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun
Then we use an RNN on the spatial relations to predict the spatio-temporal relations of reviewers in the group.
1 code implementation • 7 Jan 2022 • Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi
Screening cluttered and occluded contraband items from baggage X-ray scans is a cumbersome task even for the expert security staff.
no code implementations • 3 Jan 2022 • Guangming Zhu, Liang Zhang, Youliang Jiang, Yixuan Dang, Haoran Hou, Peiyi Shen, Mingtao Feng, Xia Zhao, Qiguang Miao, Syed Afaq Ali Shah, Mohammed Bennamoun
In this paper, we provide a comprehensive survey of recent achievements in this field brought about by deep learning techniques.
no code implementations • 29 Dec 2021 • Guofeng Mei, Xiaoshui Huang, Litao Yu, Jian Zhang, Mohammed Bennamoun
Generating a set of high-quality correspondences or matches is one of the most critical steps in point cloud registration.
no code implementations • 25 Dec 2021 • Isaac Ronald Ward, Ling Wang, Juan lu, Mohammed Bennamoun, Girish Dwivedi, Frank M Sanfilippo
Using XAI, we quantified the contribution that specific drugs had on these ACS predictions, thus creating an XAI-based technique for pharmacovigilance monitoring, using ACS as an example of the adverse outcome to detect.
Explainable artificial intelligence Explainable Artificial Intelligence (XAI) +2
1 code implementation • 2 Dec 2021 • Laurent Valentin Jospin, Farid Boussaid, Hamid Laga, Mohammed Bennamoun
In this paper, we show that closed form formulae for subpixel disparity computation for the case of one dimensional matching, e. g., in the case of rectified stereo images where the search space is of one dimension, exists when using the standard NCC, SSD and SAD cost functions.
no code implementations • 10 Nov 2021 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun
Many studies proposed approaches based on user behaviors and review text to address the challenges of fraud detection.
3 code implementations • 27 Sep 2021 • Jason K. Eshraghian, Max Ward, Emre Neftci, Xinxin Wang, Gregor Lenz, Girish Dwivedi, Mohammed Bennamoun, Doo Seok Jeong, Wei D. Lu
This paper serves as a tutorial and perspective showing how to apply the lessons learnt from several decades of research in deep learning, gradient descent, backpropagation and neuroscience to biologically plausible spiking neural neural networks.
no code implementations • 23 Aug 2021 • Nima Mirnateghi, Syed Afaq Ali Shah, Mohammed Bennamoun
In practice, the vulnerability of deep learning systems against carefully perturbed images, known as adversarial examples, poses a dire security threat in the physical world applications.
1 code implementation • 22 Aug 2021 • Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi
Furthermore, to the best of our knowledge, this is the first contour instance segmentation framework that leverages multi-scale information to recognize cluttered and concealed contraband data from the colored and grayscale security X-ray imagery.
1 code implementation • 27 Jul 2021 • Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian
We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation.
1 code implementation • ICCV 2021 • Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel, Dan Xu
Motivated by the significant inter-task correlation, we propose a novel weakly supervised multi-task framework termed as AuxSegNet, to leverage saliency detection and multi-label image classification as auxiliary tasks to improve the primary task of semantic segmentation using only image-level ground-truth labels.
1 code implementation • 15 Jul 2021 • Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi
Identifying potential threats concealed within the baggage is of prime concern for the security staff.
1 code implementation • 24 Jun 2021 • Johann Li, Guangming Zhu, Cong Hua, Mingtao Feng, BasheerBennamoun, Ping Li, Xiaoyuan Lu, Juan Song, Peiyi Shen, Xu Xu, Lin Mei, Liang Zhang, Syed Afaq Ali Shah, Mohammed Bennamoun
Thus, as comprehensive as possible, this paper provides a collection of medical image datasets with their associated challenges for deep learning research.
no code implementations • 20 Jun 2021 • Naveed Akhtar, Muhammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian
Exploring this phenomenon further, we alter the `adversarial' objective of our attack to use it as a tool to `explain' deep visual representation.
1 code implementation • CVPR 2021 • Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian
Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input.
1 code implementation • 2 Jun 2021 • Syed Saiq Hussain, Muhammad Usman, Taha Hasan Masood Siddique, Imran Naseem, Roberto Togneri, Mohammed Bennamoun
In this research a novel stochastic gradient descent based learning approach for the radial basis function neural networks (RBFNN) is proposed.
no code implementations • 25 May 2021 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun
Social reviews are indispensable resources for modern consumers' decision making.
no code implementations • 23 Jan 2021 • Hamid Laga, Marcel Padilla, Ian H. Jermyn, Sebastian Kurtek, Mohammed Bennamoun, Anuj Srivastava
With this formulation, the statistical analysis of 4D surfaces can be cast as the problem of analyzing trajectories embedded in a nonlinear Riemannian manifold.
no code implementations • 24 Dec 2020 • Delphine Poux, Benjamin Allaert, Nacim Ihaddadene, Ioan Marius Bilasco, Chaabane Djeraba, Mohammed Bennamoun
To handle occlusions, solutions based on the reconstruction of the occluded part of the face have been proposed.
Dynamic Facial Expression Recognition Facial Expression Recognition +2
1 code implementation • 24 Dec 2020 • Naeha Sharif, Lyndon White, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah
Automatic evaluation metrics hold a fundamental importance in the development and fine-grained analysis of captioning systems.
no code implementations • 24 Dec 2020 • Naeha Sharif, Lyndon White, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah
The area of automatic image caption evaluation is still undergoing intensive research to address the needs of generating captions which can meet adequacy and fluency requirements.
no code implementations • 24 Dec 2020 • Naeha Sharif, Mohammed Bennamoun, Wei Liu, Syed Afaq Ali Shah
In this work we address this common limitation of IC systems in dealing with rare words in the corpora.
no code implementations • 22 Dec 2020 • Zehua Sun, Qiuhong Ke, Hossein Rahmani, Mohammed Bennamoun, Gang Wang, Jun Liu
Human Action Recognition (HAR) aims to understand human behavior and assign a label to each action.
no code implementations • 1 Dec 2020 • Saqib Ejaz Awan, Mohammed Bennamoun, Ferdous Sohel, Frank M Sanfilippo, Girish Dwivedi
State-of-the-art imputation approaches, such as Generative Adversarial Imputation Nets (GAIN), model the distribution of observed data to approximate the missing values.
1 code implementation • 11 Oct 2020 • Isaac Ronald Ward, Jack Joyner, Casey Lickfold, Yulan Guo, Mohammed Bennamoun
Graph neural networks (GNNs) have recently grown in popularity in the field of artificial intelligence (AI) due to their unique ability to ingest relatively unstructured data types as input data.
1 code implementation • 28 Sep 2020 • Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi
Detecting baggage threats is one of the most difficult tasks, even for expert officers.
4 code implementations • 14 Jul 2020 • Laurent Valentin Jospin, Wray Buntine, Farid Boussaid, Hamid Laga, Mohammed Bennamoun
Modern deep learning methods constitute incredibly powerful tools to tackle a myriad of challenging problems.
1 code implementation • 6 Jul 2020 • Syed Muhammad Atif, Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun
A simple yet effective architectural design of radial basis function neural networks (RBFNN) makes them amongst the most popular conventional neural networks.
no code implementations • 26 Jun 2020 • Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian
On the other, deep learning has also been found vulnerable to adversarial attacks, which calls for new techniques to defend deep models against these attacks.
no code implementations • 11 Jun 2020 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun
However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews.
no code implementations • 10 Jun 2020 • Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun
In this research, instead of focusing only on one component, detecting either fraud reviews or fraud users (fraudsters), vector representations are learnt for each component, enabling multi-component classification.
no code implementations • 1 Jun 2020 • Hamid Laga, Laurent Valentin Jospin, Farid Boussaid, Mohammed Bennamoun
Motivated by their growing success in solving various 2D and 3D vision problems, deep learning for stereo-based depth estimation has attracted growing interest from the community, with more than 150 papers published in this area between 2014 and 2019.
Ranked #1 on Monocular Depth Estimation on Make3D (RMSE metric)
no code implementations • 29 May 2020 • Uzair Nadeem, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel
We use this concept to directly localize images in a 3D point cloud.
no code implementations • 14 Apr 2020 • Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi
In the last two decades, baggage scanning has globally become one of the prime aviation security concerns.
no code implementations • 30 Jan 2020 • Liang Zhang, Yufei Liu, Hang Xiao, Lu Yang, Guangming Zhu, Syed Afaq Shah, Mohammed Bennamoun, Peiyi Shen
Scene text detection has received attention for years and achieved an impressive performance across various benchmarks.
1 code implementation • 30 Jan 2020 • Liang Zhang, Xudong Wang, Hongsheng Li, Guangming Zhu, Peiyi Shen, Ping Li, Xiaoyuan Lu, Syed Afaq Ali Shah, Mohammed Bennamoun
To solve these problems mentioned above, we propose a novel graph self-adaptive pooling method with the following objectives: (1) to construct a reasonable pooled graph topology, structure and feature information of the graph are considered simultaneously, which provide additional veracity and objectivity in node selection; and (2) to make the pooled nodes contain sufficiently effective graph information, node feature information is aggregated before discarding the unimportant nodes; thus, the selected nodes contain information from neighbor nodes, which can enhance the use of features of the unselected nodes.
3 code implementations • 27 Dec 2019 • Yulan Guo, Hanyun Wang, Qingyong Hu, Hao liu, Li Liu, Mohammed Bennamoun
To stimulate future research, this paper presents a comprehensive review of recent progress in deep learning methods for point clouds.
no code implementations • 9 Dec 2019 • Taimur Hassan, Salman H. Khan, Samet Akcay, Mohammed Bennamoun, Naoufel Werghi
In the last two decades, luggage scanning has globally become one of the prime aviation security concerns.
1 code implementation • 30 Nov 2019 • Shervin Minaee, Amirali Abdolrashidi, Hang Su, Mohammed Bennamoun, David Zhang
Deep learning-based models have been very successful in achieving state-of-the-art results in many of the computer vision, speech recognition, and natural language processing tasks in the last few years.
no code implementations • IEEE Transactions on Image Processing 2019 • Qiuhong Ke, Mohammed Bennamoun, Hossein Rahmani, Senjian An, Ferdous Sohel, Farid Boussaid
Human actions represented with 3D skeleton sequences are robust to clustered backgrounds and illumination changes.
Ranked #4 on Skeleton Based Action Recognition on SYSU 3D
no code implementations • 22 Jul 2019 • Isaac Ronald Ward, Hamid Laga, Mohammed Bennamoun
Deep learning techniques, coupled with the availability of large training datasets, have now revolutionized the field of computer vision, including RGB-D object detection, achieving an unprecedented level of performance.
no code implementations • 26 Jun 2019 • Ammar Mahmood, Ana Giraldo Ospina, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid, Renae Hovey, Robert B. Fisher, Gary Kendrick
Across the globe, remote image data is rapidly being collected for the assessment of benthic communities from shallow to extremely deep waters on continental slopes to the abyssal seas.
no code implementations • 15 Jun 2019 • Xian-Feng Han, Hamid Laga, Mohammed Bennamoun
Given this new era of rapid evolution, this article provides a comprehensive survey of the recent developments in this field.
no code implementations • 14 Jun 2019 • Uzair Nadeem, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel
We use this concept to localize the position and orientation (pose) of the camera of a query image in dense point clouds.
no code implementations • 27 May 2019 • Naveed Akhtar, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian
We introduce Label Universal Targeted Attack (LUTA) that makes a deep model predict a label of attacker's choice for `any' sample of a given source class with high probability.
no code implementations • 9 May 2019 • Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun
In this paper, we propose a novel adaptive kernel for the radial basis function (RBF) neural networks.
no code implementations • 28 Apr 2019 • Isaac Ronald Ward, M. A. Asim K. Jalwana, Mohammed Bennamoun
This work investigates the impact of the loss function on the performance of Neural Networks, in the context of a monocular, RGB-only, image localization task.
no code implementations • 25 Apr 2019 • Benjamin Allaert, Isaac Ronald Ward, Ioan Marius Bilasco, Chaabane Djeraba, Mohammed Bennamoun
Optical flow techniques are becoming increasingly performant and robust when estimating motion in a scene, but their performance has yet to be proven in the area of facial expression recognition.
no code implementations • 26 Feb 2019 • Md Moniruzzaman, S. M. Shamsul Islam, Paul Lavery, Mohammed Bennamoun, C. Peng Lam
The detection and mapping of underwater vegetation, especially seagrass has drawn the attention of the research community as early as the nineteen eighties.
1 code implementation • NeurIPS 2018 • Liang Zhang, Guangming Zhu, Lin Mei, Peiyi Shen, Syed Afaq Ali Shah, Mohammed Bennamoun
On this basis, a new variant of LSTM is derived, in which the convolutional structures are only embedded into the input-to-state transition of LSTM.
no code implementations • 25 Sep 2018 • Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun
In extreme cold weather, living organisms produce Antifreeze Proteins (AFPs) to counter the otherwise lethal intracellular formation of ice.
no code implementations • ECCV 2018 • Naeha Sharif, Lyndon White, Mohammed Bennamoun, Syed Afaq Ali Shah
The automatic evaluation of image descriptions is an intricate task, and it is highly important in the development and fine-grained analysis of captioning systems.
2 code implementations • 3 Aug 2018 • Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun
We present DataDeps. jl: a julia package for the reproducible handling of static datasets to enhance the repeatability of scripts used in the data and computational sciences.
Software Engineering
no code implementations • ACL 2018 • Naeha Sharif, Lyndon White, Mohammed Bennamoun, Syed Afaq Ali Shah
The evaluation of image caption quality is a challenging task, which requires the assessment of two main aspects in a caption: adequacy and fluency.
1 code implementation • ACL 2018 • Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun
Our tool detects the main character that each section is from the POV of, and allows the user to generate a new ebook with only those sections.
no code implementations • 26 Mar 2018 • Uzair Nadeem, Syed Afaq Ali Shah, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel
Class specific gallery subspaces are used to estimate regression models for each image of the test image set.
no code implementations • IEEE Transactions on Image Processing ( Volume: 27 , Issue: 6 , June 2018 ) 2018 • Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid
This paper presents a new representation of skeleton sequences for 3D action recognition.
Ranked #67 on Skeleton Based Action Recognition on NTU RGB+D 120
no code implementations • 14 Nov 2017 • Senjian An, Farid Boussaid, Mohammed Bennamoun, Ferdous Sohel
By introducing sign constraints on the weights, this paper proposes sign constrained rectifier networks (SCRNs), whose training can be solved efficiently by the well known majorization-minimization (MM) algorithms.
no code implementations • ICCV 2017 • Hossein Rahmani, Mohammed Bennamoun
Depth sensors open up possibilities of dealing with the human action recognition problem by providing 3D human skeleton data and depth images of the scene.
1 code implementation • 27 Sep 2017 • Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun
Color names are often made up of multiple words.
no code implementations • 24 Aug 2017 • Senjian An, Mohammed Bennamoun, Farid Boussaid
To show the superior compressive power of deep rectifier networks over shallow rectifier networks, we prove that the maximum boundary resolution of a single hidden layer rectifier network classifier grows exponentially with the number of units when this number is smaller than the dimension of the patterns.
no code implementations • IEEE Signal Processing Letters ( Volume: 24 , Issue: 6 , June 2017 ) 2017 • Qiuhong Ke, Senjian An, Mohammed Bennamoun, Ferdous Sohel, Farid Boussaid
Given a skeleton sequence, the spatial structure of the skeleton joints in each frame and the temporal information between multiple frames are two important factors for action recognition.
Ranked #110 on Skeleton Based Action Recognition on NTU RGB+D
no code implementations • 30 Mar 2017 • Senjian An, Farid Boussaid, Mohammed Bennamoun, Jiankun Hu
Similarly, for a residual net and a conventional rectifier net with the same structure except for the skip connections in the residual net, the corresponding single hidden layer representation of the residual net is much more complex than the corresponding single hidden layer representation of the conventional net.
no code implementations • CVPR 2017 • Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid
This paper presents a new method for 3D action recognition with skeleton sequences (i. e., 3D trajectories of human skeleton joints).
Ranked #71 on Skeleton Based Action Recognition on NTU RGB+D 120
no code implementations • 10 Jan 2017 • Syed Afaq Ali Shah, Uzair Nadeem, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
We estimate regression models for each test image using the class specific gallery subspaces.
no code implementations • 21 Nov 2016 • Ammar Mahmood, Mohammed Bennamoun, Senjian An, Ferdous Sohel
Deep residual networks have recently emerged as the state-of-the-art architecture in image segmentation and object detection.
no code implementations • 18 Aug 2016 • Qiuhong Ke, Mohammed Bennamoun, Senjian An, Farid Bossaid, Ferdous Sohel
The structural models, including the spatial and the temporal models, are learned with Long Short Term Memory (LSTM) networks to capture the dependency of the global and local contexts of each RGB frame and each optical flow image, respectively.
no code implementations • 7 Jun 2016 • Salman H. Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
We apply a constrained mean-field algorithm to estimate the pixel-level labels, and use the estimated labels to update the parameters of the CNN in an iterative EM framework.
no code implementations • ICCV 2015 • Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel
The contractive constraints ensure that the achieved separating margin in the input space is larger than or equal to the separating margin in the output layer.
no code implementations • ICCV 2015 • Chao Sui, Mohammed Bennamoun, Roberto Togneri
This paper presents a novel feature learning method for visual speech recognition using Deep Boltzmann Machines (DBM).
no code implementations • 14 Aug 2015 • Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
Class imbalance is a common problem in the case of real-world object detection and classification tasks.
no code implementations • 18 Jun 2015 • Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Senjian An
This paper introduces a new learnable feature descriptor called "spatial layout and scale invariant convolutional activations" to deal with these challenges.
no code implementations • 17 Jun 2015 • Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel
To this end, we introduce a new large-scale dataset of 1300 object categories which are commonly present in indoor scenes.
no code implementations • CVPR 2015 • Salman H. Khan, Xuming He, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
Objects' spatial layout estimation and clutter identification are two important tasks to understand indoor scenes.
no code implementations • CVPR 2014 • Salman Hameed Khan, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri
We present a practical framework to automatically detect shadows in real world scenes from a single photograph.
no code implementations • CVPR 2014 • Munawar Hayat, Mohammed Bennamoun, Senjian An
We propose a deep learning framework for image set classification with application to face recognition.
no code implementations • 11 Apr 2013 • Yulan Guo, Ferdous Sohel, Mohammed Bennamoun, Min Lu, Jianwei Wan
The performance of the proposed LRF, RoPS descriptor and object recognition algorithm was rigorously tested on a number of popular and publicly available datasets.