Search Results for author: Deng-Ping Fan

Found 82 papers, 71 papers with code

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

9 code implementations ICCV 2021 Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao

Unlike the recently-proposed Transformer model (e. g., ViT) that is specially designed for image classification, we propose Pyramid Vision Transformer~(PVT), which overcomes the difficulties of porting Transformer to various dense prediction tasks.

Image Classification Instance Segmentation +3

Highly Accurate Dichotomous Image Segmentation

1 code implementation6 Mar 2022 Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Ling Shao, and Luc Van Gool

We present a systematic study on a new task called dichotomous image segmentation (DIS) , which aims to segment highly accurate objects from natural images.

3D Reconstruction Dichotomous Image Segmentation +3

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

2 code implementations16 Aug 2021 Bo Dong, Wenhai Wang, Deng-Ping Fan, Jinpeng Li, Huazhu Fu, Ling Shao

Unlike existing CNN-based methods, we adopt a transformer encoder, which learns more powerful and robust representations.

Medical Image Segmentation

Boundary-Aware Segmentation Network for Mobile and Web Applications

5 code implementations12 Jan 2021 Xuebin Qin, Deng-Ping Fan, Chenyang Huang, Cyril Diagne, Zichen Zhang, Adrià Cabeza Sant'Anna, Albert Suàrez, Martin Jagersand, Ling Shao

In this paper, we propose a simple yet powerful Boundary-Aware Segmentation Network (BASNet), which comprises a predict-refine architecture and a hybrid loss, for highly accurate image segmentation.

Camouflaged Object Segmentation Image Segmentation +3

Practical Blind Image Denoising via Swin-Conv-UNet and Data Synthesis

2 code implementations24 Mar 2022 Kai Zhang, Yawei Li, Jingyun Liang, JieZhang Cao, Yulun Zhang, Hao Tang, Deng-Ping Fan, Radu Timofte, Luc van Gool

While recent years have witnessed a dramatic upsurge of exploiting deep neural networks toward solving image denoising, existing methods mostly rely on simple noise assumptions, such as additive white Gaussian noise (AWGN), JPEG compression noise and camera sensor noise, and a general-purpose blind denoising method for real images remains unsolved.

Image Denoising Image-to-Image Translation

Camouflaged Object Detection

2 code implementations CVPR 2020 Deng-Ping Fan, Ge-Peng Ji, Guolei Sun, Ming-Ming Cheng, Jianbing Shen, Ling Shao

We present a comprehensive study on a new task named camouflaged object detection (COD), which aims to identify objects that are "seamlessly" embedded in their surroundings.

Ranked #10 on Camouflaged Object Segmentation on CAMO (using extra training data)

Anomaly Detection Camouflaged Object Segmentation +3

RGB-D Salient Object Detection: A Survey

9 code implementations1 Aug 2020 Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, Ling Shao

Further, considering that the light field can also provide depth maps, we review SOD models and popular benchmark datasets from this domain as well.

Attribute Object +4

Siamese Network for RGB-D Salient Object Detection and Beyond

2 code implementations26 Aug 2020 Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao, Jianbing Shen, Ce Zhu

Inspired by the observation that RGB and depth modalities actually present certain commonality in distinguishing salient objects, a novel joint learning and densely cooperative fusion (JL-DCF) architecture is designed to learn from both RGB and depth inputs through a shared network backbone, known as the Siamese architecture.

object-detection RGB-D Salient Object Detection +2

Uncertainty Inspired RGB-D Saliency Detection

4 code implementations7 Sep 2020 Jing Zhang, Deng-Ping Fan, Yuchao Dai, Saeed Anwar, Fatemeh Saleh, Sadegh Aliakbarian, Nick Barnes

Our framework includes two main models: 1) a generator model, which maps the input image and latent variable to stochastic saliency prediction, and 2) an inference model, which gradually updates the latent variable by sampling it from the true or approximate posterior distribution.

RGB-D Salient Object Detection RGB Salient Object Detection +1

Specificity-preserving RGB-D Saliency Detection

3 code implementations ICCV 2021 Tao Zhou, Deng-Ping Fan, Geng Chen, Yi Zhou, Huazhu Fu

To effectively fuse cross-modal features in the shared learning network, we propose a cross-enhanced integration module (CIM) and then propagate the fused feature to the next layer for integrating cross-level information.

object-detection Object Detection +4

Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

1 code implementation CVPR 2021 Mingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao

We present a new vision-language (VL) pre-training model dubbed Kaleido-BERT, which introduces a novel kaleido strategy for fashion cross-modality representations from transformers.

Image Retrieval Retrieval +1

Concealed Object Detection

1 code implementation20 Feb 2021 Deng-Ping Fan, Ge-Peng Ji, Ming-Ming Cheng, Ling Shao

We present the first systematic study on concealed object detection (COD), which aims to identify objects that are "perfectly" embedded in their background.

Camouflaged Object Segmentation Dichotomous Image Segmentation +2

Shifting More Attention to Video Salient Object Detection

1 code implementation CVPR 2019 Deng-Ping Fan, Wenguan Wang, Ming-Ming Cheng, Jianbing Shen

This is the first work that explicitly emphasizes the challenge of saliency shift, i. e., the video salient object(s) may dynamically change.

Object object-detection +2

Deep Gradient Learning for Efficient Camouflaged Object Detection

1 code implementation25 May 2022 Ge-Peng Ji, Deng-Ping Fan, Yu-Cheng Chou, Dengxin Dai, Alexander Liniger, Luc van Gool

This paper introduces DGNet, a novel deep framework that exploits object gradient supervision for camouflaged object detection (COD).

Defect Detection Object +4

Bifurcated backbone strategy for RGB-D salient object detection

2 code implementations6 Jul 2020 Yingjie Zhai, Deng-Ping Fan, Jufeng Yang, Ali Borji, Ling Shao, Junwei Han, Liang Wang

In particular, first, we propose to regroup the multi-level features into teacher and student features using a bifurcated backbone strategy (BBS).

Object object-detection +3

BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network

1 code implementation ECCV 2020 Deng-Ping Fan, Yingjie Zhai, Ali Borji, Jufeng Yang, Ling Shao

In particular, we 1) propose a bifurcated backbone strategy (BBS) to split the multi-level features into teacher and student features, and 2) utilize a depth-enhanced module (DEM) to excavate informative parts of depth cues from the channel and spatial views.

object-detection RGB-D Salient Object Detection +1

Simultaneously Localize, Segment and Rank the Camouflaged Objects

1 code implementation CVPR 2021 Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan

With the above understanding about camouflaged objects, we present the first ranking based COD network (Rank-Net) to simultaneously localize, segment and rank camouflaged objects.

object-detection Object Detection

Towards Deeper Understanding of Camouflaged Object Detection

1 code implementation23 May 2022 Yunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Nick Barnes, Deng-Ping Fan

With the above understanding about camouflaged objects, we present the first triple-task learning framework to simultaneously localize, segment, and rank camouflaged objects, indicating the conspicuousness level of camouflage.

Object object-detection +1

Segment Anything Model for Medical Images?

1 code implementation28 Apr 2023 Yuhao Huang, Xin Yang, Lian Liu, Han Zhou, Ao Chang, Xinrui Zhou, Rusi Chen, Junxuan Yu, Jiongquan Chen, Chaoyu Chen, Sijing Liu, Haozhe Chi, Xindi Hu, Kejuan Yue, Lei LI, Vicente Grau, Deng-Ping Fan, Fajin Dong, Dong Ni

To fully validate SAM's performance on medical data, we collected and sorted 53 open-source datasets and built a large medical segmentation dataset with 18 modalities, 84 objects, 125 object-modality paired targets, 1050K 2D images, and 6033K masks.

Image Segmentation Medical Image Segmentation +3

Full-Duplex Strategy for Video Object Segmentation

1 code implementation ICCV 2021 Ge-Peng Ji, Deng-Ping Fan, Keren Fu, Zhe Wu, Jianbing Shen, Ling Shao

Previous video object segmentation approaches mainly focus on using simplex solutions between appearance and motion, limiting feature collaboration efficiency among and across these two cues.

Object Salient Object Detection +6

Advances in Deep Concealed Scene Understanding

1 code implementation21 Apr 2023 Deng-Ping Fan, Ge-Peng Ji, Peng Xu, Ming-Ming Cheng, Christos Sakaridis, Luc van Gool

Concealed scene understanding (CSU) is a hot computer vision topic aiming to perceive objects exhibiting camouflage.

Scene Understanding Semantic Segmentation

RGB-D Saliency Detection via Cascaded Mutual Information Minimization

1 code implementation ICCV 2021 Jing Zhang, Deng-Ping Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao

In this paper, we introduce a novel multi-stage cascaded learning framework via mutual information minimization to "explicitly" model the multi-modal information between RGB image and depth data.

Saliency Detection Thermal Image Segmentation

Structure-measure: A New Way to Evaluate Foreground Maps

1 code implementation ICCV 2017 Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, Ali Borji

Our new measure simultaneously evaluates region-aware and object-aware structural similarity between a SM and a GT map.

Object object-detection +5

Salient Objects in Clutter

2 code implementations7 May 2021 Deng-Ping Fan, Jing Zhang, Gang Xu, Ming-Ming Cheng, Ling Shao

This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.

Image Augmentation Object +4

Light Field Salient Object Detection: A Review and Benchmark

1 code implementation10 Oct 2020 Keren Fu, Yao Jiang, Ge-Peng Ji, Tao Zhou, Qijun Zhao, Deng-Ping Fan

Secondly, we benchmark nine representative light field SOD models together with several cutting-edge RGB-D SOD models on four widely used light field datasets, from which insightful discussions and analyses, including a comparison between light field SOD and RGB-D SOD models, are achieved.

Benchmarking Object +4

Referring Camouflaged Object Detection

1 code implementation13 Jun 2023 Xuying Zhang, Bowen Yin, Zheng Lin, Qibin Hou, Deng-Ping Fan, Ming-Ming Cheng

We consider the problem of referring camouflaged object detection (Ref-COD), a new task that aims to segment specified camouflaged objects based on a small set of referring images with salient target objects.

Object object-detection +1

Camouflaged Object Segmentation with Distraction Mining

1 code implementation CVPR 2021 Haiyang Mei, Ge-Peng Ji, Ziqi Wei, Xin Yang, Xiaopeng Wei, Deng-Ping Fan

In this paper, we strive to embrace challenges towards effective and efficient COS. To this end, we develop a bio-inspired framework, termed Positioning and Focus Network (PFNet), which mimics the process of predation in nature.

Camouflaged Object Segmentation Dichotomous Image Segmentation +3

Re-thinking Co-Salient Object Detection

2 code implementations7 Jul 2020 Deng-Ping Fan, Tengpeng Li, Zheng Lin, Ge-Peng Ji, Dingwen Zhang, Ming-Ming Cheng, Huazhu Fu, Jianbing Shen

CoSOD is an emerging and rapidly growing extension of salient object detection (SOD), which aims to detect the co-occurring salient objects in a group of images.

Benchmarking Co-Salient Object Detection +3

Group Collaborative Learning for Co-Salient Object Detection

1 code implementation CVPR 2021 Qi Fan, Deng-Ping Fan, Huazhu Fu, Chi Keung Tang, Ling Shao, Yu-Wing Tai

We present a novel group collaborative learning framework (GCoNet) capable of detecting co-salient objects in real time (16ms), by simultaneously mining consensus representations at group level based on the two necessary criteria: 1) intra-group compactness to better formulate the consistency among co-salient objects by capturing their inherent shared attributes using our novel group affinity module; 2) inter-group separability to effectively suppress the influence of noisy objects on the output by introducing our new group collaborating module conditioning the inconsistent consensus.

Co-Salient Object Detection Object +2

Implicit Motion Handling for Video Camouflaged Object Detection

1 code implementation CVPR 2022 Xuelian Cheng, Huan Xiong, Deng-Ping Fan, Yiran Zhong, Mehrtash Harandi, Tom Drummond, ZongYuan Ge

We propose a new video camouflaged object detection (VCOD) framework that can exploit both short-term dynamics and long-term temporal consistency to detect camouflaged objects from video frames.

Camouflaged Object Segmentation Motion Estimation +4

Dense Uncertainty Estimation

1 code implementation13 Oct 2021 Jing Zhang, Yuchao Dai, Mochu Xiang, Deng-Ping Fan, Peyman Moghadam, Mingyi He, Christian Walder, Kaihao Zhang, Mehrtash Harandi, Nick Barnes

Deep neural networks can be roughly divided into deterministic neural networks and stochastic neural networks. The former is usually trained to achieve a mapping from input space to output space via maximum likelihood estimation for the weights, which leads to deterministic predictions during testing.

Decision Making

JCS: An Explainable COVID-19 Diagnosis System by Joint Classification and Segmentation

1 code implementation15 Apr 2020 Yu-Huan Wu, Shang-Hua Gao, Jie Mei, Jun Xu, Deng-Ping Fan, Rong-Guo Zhang, Ming-Ming Cheng

The chest CT scan test provides a valuable complementary tool to the RT-PCR test, and it can identify the patients in the early-stage with high sensitivity.

COVID-19 Diagnosis General Classification +2

Mutual Graph Learning for Camouflaged Object Detection

1 code implementation CVPR 2021 Qiang Zhai, Xin Li, Fan Yang, Chenglizhao Chen, Hong Cheng, Deng-Ping Fan

Automatically detecting/segmenting object(s) that blend in with their surroundings is difficult for current models.

Graph Learning Object +2

Indiscernible Object Counting in Underwater Scenes

1 code implementation CVPR 2023 Guolei Sun, Zhaochong An, Yun Liu, Ce Liu, Christos Sakaridis, Deng-Ping Fan, Luc van Gool

We further advance the frontier of this field by systematically studying a new challenge named indiscernible object counting (IOC), the goal of which is to count objects that are blended with respect to their surroundings.

Benchmarking Object +2

GCoNet+: A Stronger Group Collaborative Co-Salient Object Detector

2 code implementations30 May 2022 Peng Zheng, Huazhu Fu, Deng-Ping Fan, Qi Fan, Jie Qin, Yu-Wing Tai, Chi-Keung Tang, Luc van Gool

In this paper, we present a novel end-to-end group collaborative learning network, termed GCoNet+, which can effectively and efficiently (250 fps) identify co-salient objects in natural scenes.

Co-Salient Object Detection Object +2

Bilateral Reference for High-Resolution Dichotomous Image Segmentation

1 code implementation7 Jan 2024 Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, Nicu Sebe

It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef).

 Ranked #1 on RGB Salient Object Detection on HRSOD (using extra training data)

Camouflaged Object Segmentation Dichotomous Image Segmentation +3

Generative Transformer for Accurate and Reliable Salient Object Detection

2 code implementations20 Apr 2021 Yuxin Mao, Jing Zhang, Zhexiong Wan, Yuchao Dai, Aixuan Li, Yunqiu Lv, Xinyu Tian, Deng-Ping Fan, Nick Barnes

For the former, we apply transformer to a deterministic model, and explain that the effective structure modeling and global context modeling abilities lead to its superior performance compared with the CNN based frameworks.

Attribute Camouflaged Object Segmentation +8

Enhanced-alignment Measure for Binary Foreground Map Evaluation

2 code implementations26 May 2018 Deng-Ping Fan, Cheng Gong, Yang Cao, Bo Ren, Ming-Ming Cheng, Ali Borji

The existing binary foreground map (FM) measures to address various types of errors in either pixel-wise or structural ways.

Instructive Feature Enhancement for Dichotomous Medical Image Segmentation

1 code implementation6 Jun 2023 Lian Liu, Han Zhou, Jiongquan Chen, Sijing Liu, Wenlong Shi, Dong Ni, Deng-Ping Fan, Xin Yang

Deep neural networks have been widely applied in dichotomous medical image segmentation (DMIS) of many anatomical structures in several modalities, achieving promising performance.

Image Segmentation Medical Image Segmentation +2

Masked Vision-Language Transformer in Fashion

1 code implementation27 Oct 2022 Ge-Peng Ji, Mingcheng Zhuge, Dehong Gao, Deng-Ping Fan, Christos Sakaridis, Luc van Gool

We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation.

Image Reconstruction Retrieval

How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges

1 code implementation27 Jul 2023 Haotong Qin, Ge-Peng Ji, Salman Khan, Deng-Ping Fan, Fahad Shahbaz Khan, Luc van Gool

Google's Bard has emerged as a formidable competitor to OpenAI's ChatGPT in the field of conversational AI.

LSANet: Feature Learning on Point Sets by Local Spatial Aware Layer

1 code implementation14 May 2019 Lin-Zhuo Chen, Xuan-Yi Li, Deng-Ping Fan, Kai Wang, Shao-Ping Lu, Ming-Ming Cheng

We design a novel Local Spatial Aware (LSA) layer, which can learn to generate Spatial Distribution Weights (SDWs) hierarchically based on the spatial relationship in local region for spatial independent operations, to establish the relationship between these operations and spatial distribution, thus capturing the local geometric structure sensitively. We further propose the LSANet, which is based on LSA layer, aggregating the spatial information with associated features in each layer of the network better in network design. The experiments show that our LSANet can achieve on par or better performance than the state-of-the-art methods when evaluating on the challenging benchmark datasets.

CamDiff: Camouflage Image Augmentation via Diffusion Model

1 code implementation11 Apr 2023 Xue-Jing Luo, Shuo Wang, Zongwei Wu, Christos Sakaridis, Yun Cheng, Deng-Ping Fan, Luc van Gool

Specifically, we leverage the latent diffusion model to synthesize salient objects in camouflaged scenes, while using the zero-shot image classification ability of the Contrastive Language-Image Pre-training (CLIP) model to prevent synthesis failures and ensure the synthesized object aligns with the input prompt.

Image Augmentation Image Classification +3

Bilateral Attention Network for RGB-D Salient Object Detection

1 code implementation30 Apr 2020 Zhao Zhang, Zheng Lin, Jun Xu, Wenda Jin, Shao-Ping Lu, Deng-Ping Fan

To better explore salient information in both foreground and background regions, this paper proposes a Bilateral Attention Network (BiANet) for the RGB-D SOD task.

Object object-detection +3

Taking a Deeper Look at Co-Salient Object Detection

1 code implementation CVPR 2020 Deng-Ping Fan, Zheng Lin, Ge-Peng Ji, Dingwen Zhang, Huazhu Fu, Ming-Ming Cheng

Co-salient object detection (CoSOD) is a newly emerging and rapidly growing branch of salient object detection (SOD), which aims to detect the co-occurring salient objects in multiple images.

Benchmarking Co-Salient Object Detection +3

From Semantic Categories to Fixations: A Novel Weakly-Supervised Visual-Auditory Saliency Detection Approach

1 code implementation CVPR 2021 Guotao Wang, Chenglizhao Chen, Deng-Ping Fan, Aimin Hao, Hong Qin

Thanks to the rapid advances in the deep learning techniques and the wide availability of large-scale training sets, the performances of video saliency detection models have been improving steadily and significantly.

Video Saliency Detection

Weakly Supervised Visual-Auditory Fixation Prediction with Multigranularity Perception

1 code implementation27 Dec 2021 Guotao Wang, Chenglizhao Chen, Deng-Ping Fan, Aimin Hao, Hong Qin

Moreover, we distill knowledge from these regions to obtain complete new spatial-temporal-audio (STA) fixation prediction (FP) networks, enabling broad applications in cases where video tags are not available.

Video Saliency Detection

Semantic Edge Detection with Diverse Deep Supervision

1 code implementation9 Apr 2018 Yun Liu, Ming-Ming Cheng, Deng-Ping Fan, Le Zhang, Jiawang Bian, DaCheng Tao

Semantic edge detection (SED), which aims at jointly extracting edges as well as their category information, has far-reaching applications in domains such as semantic segmentation, object proposal generation, and object recognition.

Edge Detection Object Proposal Generation +2

CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance Segmentation

1 code implementation16 Jul 2023 Jialun Pei, Tao Jiang, He Tang, Nian Liu, Yueming Jin, Deng-Ping Fan, Pheng-Ann Heng

We propose a novel approach for RGB-D salient instance segmentation using a dual-branch cross-modal feature calibration architecture called CalibNet.

Instance Segmentation Semantic Segmentation

Probabilistic Model Distillation for Semantic Correspondence

1 code implementation CVPR 2021 Xin Li, Deng-Ping Fan, Fan Yang, Ao Luo, Hong Cheng, Zicheng Liu

We address this problem with the use of a novel Probabilistic Model Distillation (PMD) approach which transfers knowledge learned by a probabilistic teacher model on synthetic data to a static student model with the use of unlabeled real image pairs.

Representation Learning Semantic correspondence

Face Sketch Synthesis Style Similarity:A New Structure Co-occurrence Texture Measure

1 code implementation9 Apr 2018 Deng-Ping Fan, Shengchuan Zhang, Yu-Huan Wu, Ming-Ming Cheng, Bo Ren, Rongrong Ji, Paul L. Rosin

However, human perception of the similarity of two sketches will consider both structure and texture as essential factors and is not sensitive to slight ("pixel-level") mismatches.

Face Sketch Synthesis

Scoot: A Perceptual Metric for Facial Sketches

1 code implementation ICCV 2019 Deng-Ping Fan, Shengchuan Zhang, Yu-Huan Wu, Yun Liu, Ming-Ming Cheng, Bo Ren, Paul L. Rosin, Rongrong Ji

In this paper, we design a perceptual metric, called Structure Co-Occurrence Texture (Scoot), which simultaneously considers the block-level spatial structure and co-occurrence texture statistics.

Face Sketch Synthesis SSIM

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model

1 code implementation4 Jan 2024 Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma

To this end, we propose Scalable Bias-Mode Attention Mask (BA-SAM) to enhance SAM's adaptability to varying image resolutions while eliminating the need for structure modifications.

WinDB: HMD-free and Distortion-free Panoptic Video Fixation Learning

2 code implementations23 May 2023 Guotao Wang, Chenglizhao Chen, Aimin Hao, Hong Qin, Deng-Ping Fan

The main reason is that there always exist "blind zooms" when using HMD to collect fixations since the users cannot keep spinning their heads to explore the entire panoptic scene all the time.

Acquiring Weak Annotations for Tumor Localization in Temporal and Volumetric Data

1 code implementation23 Oct 2023 Yu-Cheng Chou, Bowen Li, Deng-Ping Fan, Alan Yuille, Zongwei Zhou

In summary, this research proposes an efficient annotation strategy for tumor detection and localization that is less accurate than per-pixel annotations but useful for creating large-scale datasets for screening tumors in various medical modalities.

Weakly-supervised Learning

Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground

no code implementations ECCV 2018 Deng-Ping Fan, Ming-Ming Cheng, Jiang-Jiang Liu, Shang-Hua Gao, Qibin Hou, Ali Borji

Our analysis identifies a serious design bias of existing SOD datasets which assumes that each image contains at least one clearly outstanding salient object in low clutter.

Attribute Object +3

IDENTIFYING CONCEALED OBJECTS FROM VIDEOS

no code implementations29 Sep 2021 Xuelian Cheng, Huan Xiong, Deng-Ping Fan, Yiran Zhong, Mehrtash Harandi, Tom Drummond, ZongYuan Ge

The proposed SLT-Net leverages on both short-term dynamics and long-term temporal consistency to detect concealed objects in continuous video frames.

object-detection Object Detection

QR-CLIP: Introducing Explicit Open-World Knowledge for Location and Time Reasoning

no code implementations2 Feb 2023 Weimin Shi, Mingchen Zhuge, Dehong Gao, Zhong Zhou, Ming-Ming Cheng, Deng-Ping Fan

Daily images may convey abstract meanings that require us to memorize and infer profound information from them.

World Knowledge

SAM Struggles in Concealed Scenes -- Empirical Study on "Segment Anything"

no code implementations12 Apr 2023 Ge-Peng Ji, Deng-Ping Fan, Peng Xu, Ming-Ming Cheng, BoWen Zhou, Luc van Gool

Segmenting anything is a ground-breaking step toward artificial general intelligence, and the Segment Anything Model (SAM) greatly fosters the foundation models for computer vision.

OnUVS: Online Feature Decoupling Framework for High-Fidelity Ultrasound Video Synthesis

no code implementations16 Aug 2023 Han Zhou, Dong Ni, Ao Chang, Xinrui Zhou, Rusi Chen, Yanlin Chen, Lian Liu, Jiamin Liang, Yuhao Huang, Tong Han, Zhe Liu, Deng-Ping Fan, Xin Yang

Second, to better preserve the integrity and textural information of US images, we implemented a dual-decoder that decouples the content and textural features in the generator.

Edge-aware Feature Aggregation Network for Polyp Segmentation

no code implementations19 Sep 2023 Tao Zhou, Yizhe Zhang, Geng Chen, Yi Zhou, Ye Wu, Deng-Ping Fan

Besides, a Scale-aware Convolution Module (SCM) is proposed to learn scale-aware features by using dilated convolutions with different ratios, in order to effectively deal with scale variation.

Segmentation

Large Model Based Referring Camouflaged Object Detection

no code implementations28 Nov 2023 Shupeng Cheng, Ge-Peng Ji, Pengda Qin, Deng-Ping Fan, BoWen Zhou, Peng Xu

Our motivation is to make full use of the semantic intelligence and intrinsic knowledge of recent Multimodal Large Language Models (MLLMs) to decompose this complex task in a human-like way.

Object object-detection +2

Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes

no code implementations27 Jan 2024 Diandian Guo, Deng-Ping Fan, Tongyu Lu, Christos Sakaridis, Luc van Gool

The estimation of implicit cross-frame correspondences and the high computational cost have long been major challenges in video semantic segmentation (VSS) for driving scenes.

Motion Estimation Segmentation +2

Effectiveness Assessment of Recent Large Vision-Language Models

no code implementations7 Mar 2024 Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan

The advent of large vision-language models (LVLMs) represents a noteworthy advancement towards the pursuit of artificial general intelligence.

Anomaly Detection Attribute +7

Latent Semantic Consensus For Deterministic Geometric Model Fitting

1 code implementation11 Mar 2024 Guobao Xiao, Jun Yu, Jiayi Ma, Deng-Ping Fan, Ling Shao

The principle of LSC is to preserve the latent semantic consensus in both data points and model hypotheses.

Cannot find the paper you are looking for? You can Submit a new open access paper.