RGB-D Salient Object Detection

56 papers with code • 8 benchmarks • 5 datasets

RGB-D Salient object detection (SOD) aims at distinguishing the most visually distinctive objects or regions in a scene from the given RGB and Depth data. It has a wide range of applications, including video/image segmentation, object recognition, visual tracking, foreground maps evaluation, image retrieval, content-aware image editing, information discovery, photosynthesis, and weakly supervised semantic segmentation. Here, depth information plays an important complementary role in finding salient objects. Online benchmark: http://dpfan.net/d3netbenchmark.

( Image credit: Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks, TNNLS20 )

Benchmarks

Add a Result

These leaderboards are used to track progress in RGB-D Salient Object Detection

Dataset	Best Model	Compare
NJU2K	DFormer-L	See all
SIP	DFormer-L	See all
NLPR	DFormer-L	See all
STERE	DFormer-L	See all
DES	DFormer-L	See all
LFSD	UCNet-CVAE	See all
RGBD135	DASNet	See all
NJUD	VST	See all

Libraries

Use these libraries to find RGB-D Salient Object Detection models and implementations

taozh2017/RGBD-SODsurvey

5 papers

320

kerenfu/JLDCF

3 papers

Datasets

Latest papers

Most implemented Social Latest No code

DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation

VCIP-RGBD/DFormer • • 18 Sep 2023

We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks.

109

18 Sep 2023

Paper
Code

Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection

rmcong/picr-net_acmmm23 • • 17 Aug 2023

By integrating complementary information from RGB image and depth map, the ability of salient object detection (SOD) for complex and challenging scenes can be improved.

17 Aug 2023

Paper
Code

Mutual Information Regularization for Weakly-supervised RGB-D Salient Object Detection

baneitixiaomai/mirv • • 6 Jun 2023

In particular, following the principle of disentangled representation learning, we introduce a mutual information upper bound with a mutual information minimization regularizer to encourage the disentangled representation of each modality for salient object detection.

06 Jun 2023

Paper
Code

CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection

Lin-Qinwei/CIR-Net-MindSpore.git • • 6 Oct 2022

Focusing on the issue of how to effectively capture and utilize cross-modality information in RGB-D salient object detection (SOD) task, we present a convolutional neural network (CNN) model, named CIR-Net, based on the novel cross-modality interaction and refinement.

06 Oct 2022

Paper
Code

Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

zwbx/DFM-Net • • 8 Aug 2022

Inspired by the fact that depth quality is a key factor influencing the accuracy, we propose an efficient depth quality-inspired feature manipulation (DQFM) process, which can dynamically filter depth features according to depth quality.

08 Aug 2022

Paper
Code

SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection

Hydragon516/SPSN • • 16 Jul 2022

However, despite advances in deep learning-based methods, RGB-D SOD is still challenging due to the large domain gap between an RGB image and the depth map and low-quality depth maps.

16 Jul 2022

Paper
Code

TANet: Transformer-based Asymmetric Network for RGB-D Salient Object Detection

lc012463/tanet • 4 Jul 2022

We employ the powerful feature extraction capability of Transformer (PVTv2) to extract global semantic information from RGB data and design a lightweight CNN backbone (LWDepthNet) to extract spatial structure information from depth data without pre-training.

04 Jul 2022

Paper
Code