Search Results for author: Farid Boussaid

Found 30 papers, 8 papers with code

Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation

no code implementations2 Mar 2024 Lian Xu, Mohammed Bennamoun, Farid Boussaid, Wanli Ouyang, Ferdous Sohel, Dan Xu

We propose AuxSegNet+, a weakly supervised auxiliary learning framework to explore the rich information from these saliency maps and the significant inter-task correlation between saliency detection and semantic segmentation.

Auxiliary Learning Multi-Label Image Classification +5

Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models

no code implementations27 Feb 2024 Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Hamid Laga, Farid Boussaid

To address these deficiencies, we introduce the Box-it-to-Bind-it (B2B) module - a novel, training-free approach for improving spatial control and semantic accuracy in text-to-image (T2I) diffusion models.

Attribute

Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art

1 code implementation10 Sep 2023 Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed Bennamoun

Transformers have rapidly gained popularity in computer vision, especially in the field of object recognition and detection.

Object object-detection +2

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

1 code implementation6 Aug 2023 Lian Xu, Mohammed Bennamoun, Farid Boussaid, Hamid Laga, Wanli Ouyang, Dan Xu

Building upon the observation that the attended regions of the one-class token in the standard vision transformer can contribute to a class-agnostic localization map, we explore the potential of the transformer model to capture class-specific attention for class-discriminative object localization by learning multiple class tokens.

Object Localization Weakly supervised Semantic Segmentation +1

Analysis and Evaluation of Explainable Artificial Intelligence on Suicide Risk Assessment

no code implementations9 Mar 2023 Hao Tang, Aref Miri Rekavandi, Dharjinder Rooprai, Girish Dwivedi, Frank Sanfilippo, Farid Boussaid, Mohammed Bennamoun

This study investigates the effectiveness of Explainable Artificial Intelligence (XAI) techniques in predicting suicide risks and identifying the dominant causes for such behaviours.

Data Augmentation Decision Making +2

VAPCNet: Viewpoint-Aware 3D Point Cloud Completion

no code implementations ICCV 2023 Zhiheng Fu, Longguang Wang, Lian Xu, Zhiyong Wang, Hamid Laga, Yulan Guo, Farid Boussaid, Mohammed Bennamoun

In this paper, we thus propose an unsupervised viewpoint representation learning scheme for 3D point cloud completion without explicit viewpoint estimation.

Point Cloud Completion Representation Learning +1

Learning Multi-Modal Class-Specific Tokens for Weakly Supervised Dense Object Localization

no code implementations CVPR 2023 Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu

Weakly supervised dense object localization (WSDOL) relies generally on Class Activation Mapping (CAM), which exploits the correlation between the class weights of the image classifier and the pixel-level features.

Object Localization Representation Learning +2

3D Brain and Heart Volume Generative Models: A Survey

1 code implementation12 Oct 2022 Yanbin Liu, Girish Dwivedi, Farid Boussaid, Mohammed Bennamoun

Generative models such as generative adversarial networks and autoencoders have gained a great deal of attention in the medical field due to their excellent data generation capability.

Denoising

Active-Passive SimStereo -- Benchmarking the Cross-Generalization Capabilities of Deep Learning-based Stereo Methods

no code implementations17 Sep 2022 Laurent Jospin, Allen Antony, Lian Xu, Hamid Laga, Farid Boussaid, Mohammed Bennamoun

In this paper, we propose the Active-Passive SimStereo dataset and a corresponding benchmark to evaluate the performance gap between passive and active stereo images for stereo matching algorithms.

Benchmarking Stereo Matching

Bayesian Learning for Disparity Map Refinement for Semi-Dense Active Stereo Vision

no code implementations12 Sep 2022 Laurent Valentin Jospin, Hamid Laga, Farid Boussaid, Mohammed Bennamoun

A major focus of recent developments in stereo vision has been on how to obtain accurate dense disparity maps in passive stereo vision.

Disparity Estimation

Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

no code implementations8 Aug 2022 Yanbin Liu, Girish Dwivedi, Farid Boussaid, Frank Sanfilippo, Makoto Yamada, Mohammed Bennamoun

Novel 3D network architectures are proposed for both the generator and discriminator of the GAN model to significantly reduce the number of parameters while maintaining the quality of image generation.

Image Generation Medical Image Generation

A Guide to Image and Video based Small Object Detection using Deep Learning : Case Study of Maritime Surveillance

no code implementations26 Jul 2022 Aref Miri Rekavandi, Lian Xu, Farid Boussaid, Abd-Krim Seghouane, Stephen Hoefs, Mohammed Bennamoun

Small object detection (SOD) in optical images and videos is a challenging problem that even state-of-the-art generic object detection methods fail to accurately localize and identify such objects.

Decision Making Object +2

Learning Resolution-Adaptive Representations for Cross-Resolution Person Re-Identification

no code implementations9 Jul 2022 Lin Wu, Lingqiao Liu, Yang Wang, Zheng Zhang, Farid Boussaid, Mohammed Bennamoun

It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras.

Person Re-Identification Super-Resolution

Pseudo-Pair based Self-Similarity Learning for Unsupervised Person Re-identification

no code implementations9 Jul 2022 Lin Wu, Deyin Liu, Wenying Zhang, Dapeng Chen, ZongYuan Ge, Farid Boussaid, Mohammed Bennamoun, Jialie Shen

In this paper, we present a pseudo-pair based self-similarity learning approach for unsupervised person re-ID without human annotations.

Unsupervised Person Re-Identification

Jacobian Norm with Selective Input Gradient Regularization for Improved and Interpretable Adversarial Defense

no code implementations9 Jul 2022 Deyin Liu, Lin Wu, Haifeng Zhao, Farid Boussaid, Mohammed Bennamoun, Xianghua Xie

Moreover, adversarially training a defense model in general cannot produce interpretable predictions towards the inputs with perturbations, whilst a highly interpretable robust model is required by different domain experts to understand the behaviour of a DNN.

Adversarial Defense

CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation

1 code implementation24 Mar 2022 Mohammed Hassanin, Abdelwahed Khamiss, Mohammed Bennamoun, Farid Boussaid, Ibrahim Radwan

3D human pose estimation can be handled by encoding the geometric dependencies between the body parts and enforcing the kinematic constraints.

3D Human Pose Estimation

Multi-class Token Transformer for Weakly Supervised Semantic Segmentation

1 code implementation CVPR 2022 Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu

To this end, we propose a Multi-class Token Transformer, termed as MCTformer, which uses multiple class tokens to learn interactions between the class tokens and the patch tokens.

Object Object Localization +2

Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching

1 code implementation2 Dec 2021 Laurent Valentin Jospin, Farid Boussaid, Hamid Laga, Mohammed Bennamoun

In this paper, we show that closed form formulae for subpixel disparity computation for the case of one dimensional matching, e. g., in the case of rectified stereo images where the search space is of one dimension, exists when using the standard NCC, SSD and SAD cost functions.

Optical Flow Estimation Patch Matching +1

Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation

1 code implementation ICCV 2021 Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel, Dan Xu

Motivated by the significant inter-task correlation, we propose a novel weakly supervised multi-task framework termed as AuxSegNet, to leverage saliency detection and multi-label image classification as auxiliary tasks to improve the primary task of semantic segmentation using only image-level ground-truth labels.

Auxiliary Learning Multi-Label Image Classification +6

A Survey on Deep Learning Techniques for Stereo-based Depth Estimation

no code implementations1 Jun 2020 Hamid Laga, Laurent Valentin Jospin, Farid Boussaid, Mohammed Bennamoun

Motivated by their growing success in solving various 2D and 3D vision problems, deep learning for stereo-based depth estimation has attracted growing interest from the community, with more than 150 papers published in this area between 2014 and 2019.

 Ranked #1 on Monocular Depth Estimation on Make3D (RMSE metric)

Autonomous Driving Monocular Depth Estimation +2

Automatic Hierarchical Classification of Kelps using Deep Residual Features

no code implementations26 Jun 2019 Ammar Mahmood, Ana Giraldo Ospina, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid, Renae Hovey, Robert B. Fisher, Gary Kendrick

Across the globe, remote image data is rapidly being collected for the assessment of benthic communities from shallow to extremely deep waters on continental slopes to the abyssal seas.

Binary Classification Classification +1

Exploiting Layerwise Convexity of Rectifier Networks with Sign Constrained Weights

no code implementations14 Nov 2017 Senjian An, Farid Boussaid, Mohammed Bennamoun, Ferdous Sohel

By introducing sign constraints on the weights, this paper proposes sign constrained rectifier networks (SCRNs), whose training can be solved efficiently by the well known majorization-minimization (MM) algorithms.

On the Compressive Power of Deep Rectifier Networks for High Resolution Representation of Class Boundaries

no code implementations24 Aug 2017 Senjian An, Mohammed Bennamoun, Farid Boussaid

To show the superior compressive power of deep rectifier networks over shallow rectifier networks, we prove that the maximum boundary resolution of a single hidden layer rectifier network classifier grows exponentially with the number of units when this number is smaller than the dimension of the patterns.

General Classification

From Deep to Shallow: Transformations of Deep Rectifier Networks

no code implementations30 Mar 2017 Senjian An, Farid Boussaid, Mohammed Bennamoun, Jiankun Hu

Similarly, for a residual net and a conventional rectifier net with the same structure except for the skip connections in the residual net, the corresponding single hidden layer representation of the residual net is much more complex than the corresponding single hidden layer representation of the conventional net.

Contractive Rectifier Networks for Nonlinear Maximum Margin Classification

no code implementations ICCV 2015 Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel

The contractive constraints ensure that the achieved separating margin in the input space is larger than or equal to the separating margin in the output layer.

Classification General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.