Search Results for author: Dingwen Zhang

Found 55 papers, 20 papers with code

AUG: A New Dataset and An Efficient Model for Aerial Image Urban Scene Graph Generation

no code implementations • 11 Apr 2024 • Yansheng Li, Kun Li, Yongjun Zhang, LinLin Wang, Dingwen Zhang

To fill in the gap of the overhead view dataset, this paper constructs and releases an aerial image urban scene graph generation (AUG) dataset.

Graph Generation Relationship Detection +1

Paper
Add Code

GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time

no code implementations • 15 Mar 2024 • Hao Li, Yuanyuan Gao, Chenming Wu, Dingwen Zhang, Yalun Dai, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han

Specifically, we design a novel joint learning framework that consists of an Iterative Pose Optimization Network (IPO-Net) and a Generalizable 3D-Gaussians (G-3DG) model.

Generalizable Novel View Synthesis Novel View Synthesis

Paper
Add Code

Continual All-in-One Adverse Weather Removal with Knowledge Replay on a Unified Network Structure

1 code implementation • 12 Mar 2024 • De Cheng, Yanling Ji, Dong Gong, Yan Li, Nannan Wang, Junwei Han, Dingwen Zhang

It considers the characteristics of the image restoration task with multiple degenerations in continual learning, and the knowledge for different degenerations can be shared and accumulated in the unified network structure.

Continual Learning Image Restoration +2

Paper
Code

SA-MixNet: Structure-aware Mixup and Invariance Learning for Scribble-supervised Road Extraction in Remote Sensing Images

no code implementations • 3 Mar 2024 • Jie Feng, Hao Huang, Junpeng Zhang, Weisheng Dong, Dingwen Zhang, Licheng Jiao

To eliminate the reliance on such priors, we propose a novel Structure-aware Mixup and Invariance Learning framework (SA-MixNet) for weakly supervised road extraction that improves the model invariance in a data-driven manner.

Paper
Add Code

Revisiting the Power of Prompt for Visual Tuning

1 code implementation • 4 Feb 2024 • Yuzhu Wang, Lechao Cheng, Chaowei Fang, Dingwen Zhang, Manni Duan, Meng Wang

Inspired by the observation that the prompt tokens tend to share high mutual information with patch tokens, we propose initializing prompts with downstream token prototypes.

Ranked #1 on Visual Prompt Tuning on VTAB-1k(Structured<8>)

Visual Prompt Tuning

Paper
Code

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

1 code implementation • 25 Nov 2023 • Ziyang Luo, Nian Liu, Wangbo Zhao, Xuguang Yang, Dingwen Zhang, Deng-Ping Fan, Fahad Khan, Junwei Han

Salient object detection (SOD) and camouflaged object detection (COD) are related yet distinct binary mapping tasks.

Model Optimization object-detection +3

Paper
Code

GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding

no code implementations • 20 Nov 2023 • Hao Li, Dingwen Zhang, Yalun Dai, Nian Liu, Lechao Cheng, Jingfeng Li, Jingdong Wang, Junwei Han

Applying NeRF to downstream perception tasks for scene understanding and representation is becoming increasingly popular.

Instance Segmentation Scene Understanding +2

Paper
Add Code

SegGPT Meets Co-Saliency Scene

no code implementations • 8 May 2023 • Yi Liu, Shoukun Xu, Dingwen Zhang, Jungong Han

Co-salient object detection targets at detecting co-existed salient objects among a group of images.

Co-Salient Object Detection Object +2

Paper
Add Code

Mitigating Undisciplined Over-Smoothing in Transformer for Weakly Supervised Semantic Segmentation

no code implementations • 4 May 2023 • Jingxuan He, Lechao Cheng, Chaowei Fang, Dingwen Zhang, Zhangye Wang, Wei Chen

A surge of interest has emerged in weakly supervised semantic segmentation due to its remarkable efficiency in recent years.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Add Code

Revisiting Long-tailed Image Classification: Survey and Benchmarks with New Evaluation Metrics

no code implementations • 3 Feb 2023 • Chaowei Fang, Dingwen Zhang, Wen Zheng, Xue Li, Le Yang, Lechao Cheng, Junwei Han

We set up novel evaluation benchmarks based on a series of testing sets with evolving distributions.

Ranked #64 on Long-tail Learning on CIFAR-100-LT (ρ=100)

Image Classification Long-tail Learning

Paper
Add Code

Boosting Low-Data Instance Segmentation by Unsupervised Pre-training with Saliency Prompt

no code implementations • CVPR 2023 • Hao Li, Dingwen Zhang, Nian Liu, Lechao Cheng, Yalun Dai, Chao Zhang, Xinggang Wang, Junwei Han

Inspired by the recent success of the Prompting technique, we introduce a new pre-training method that boosts QEIS models by giving Saliency Prompt for queries/kernels.

Instance Segmentation Semantic Segmentation +1

Paper
Add Code

Compound Batch Normalization for Long-tailed Image Classification

no code implementations • 2 Dec 2022 • Lechao Cheng, Chaowei Fang, Dingwen Zhang, Guanbin Li, Gang Huang

It can model the feature space more comprehensively and reduce the dominance of head classes.

Classification Image Classification

Paper
Add Code

Combating Noisy Labels in Long-Tailed Image Classification

no code implementations • 1 Sep 2022 • Chaowei Fang, Lechao Cheng, Huiyan Qi, Dingwen Zhang

Most existing methods that cope with noisy labels usually assume that the class distributions are well balanced, which has insufficient capacity to deal with the practical scenarios where training samples have imbalanced distributions.

Classification Image Classification

Paper
Add Code

Deep 3D Vessel Segmentation based on Cross Transformer Network

1 code implementation • 22 Aug 2022 • Chengwei Pan, Baolian Qi, Gangming Zhao, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li

In CTN, a transformer module is constructed in parallel to a U-Net to learn long-distance dependencies between different anatomical regions; and these dependencies are communicated to the U-Net at multiple stages to endow it with global awareness.

Computed Tomography (CT) Segmentation

Paper
Code

Computer-aided Tuberculosis Diagnosis with Attribute Reasoning Assistance

1 code implementation • 1 Jul 2022 • Chengwei Pan, Gangming Zhao, Junjie Fang, Baolian Qi, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li, Yizhou Yu

Although deep learning algorithms have been intensively developed for computer-aided tuberculosis diagnosis (CTD), they mainly depend on carefully annotated datasets, leading to much time and resource consumption.

Attribute Relational Reasoning +1

Paper
Code

Structured Attention Composition for Temporal Action Localization

2 code implementations • 20 May 2022 • Le Yang, Junwei Han, Tao Zhao, Nian Liu, Dingwen Zhang

To tackle this issue, we make an early effort to study temporal action localization from the perspective of multi-modality feature learning, based on the observation that different actions exhibit specific preferences to appearance or motion modality.

Action Detection Temporal Action Localization

264

Paper
Code

Robust Single Image Dehazing Based on Consistent and Contrast-Assisted Reconstruction

no code implementations • 29 Mar 2022 • De Cheng, Yan Li, Dingwen Zhang, Nannan Wang, Xinbo Gao, Jiande Sun

To properly address this problem, we propose a novel density-variational learning framework to improve the robustness of the image dehzing model assisted by a variety of negative hazy images, to better deal with various complex hazy scenarios.

Image Dehazing Single Image Dehazing

Paper
Add Code

Cross-Modality High-Frequency Transformer for MR Image Super-Resolution

no code implementations • 29 Mar 2022 • Chaowei Fang, Dingwen Zhang, Liang Wang, Yulun Zhang, Lechao Cheng, Junwei Han

Improving the resolution of magnetic resonance (MR) image data is critical to computer-aided diagnosis and brain function analysis.

Image Super-Resolution Vocal Bursts Intensity Prediction

Paper
Add Code

Hybrid Routing Transformer for Zero-Shot Learning

no code implementations • 29 Mar 2022 • De Cheng, Gerong Wang, Bo wang, Qiang Zhang, Jungong Han, Dingwen Zhang

This design makes the presented transformer model a hybrid of 1) top-down and bottom-up attention pathways and 2) dynamic and static routing pathways.

Attribute Zero-Shot Learning

Paper
Add Code

Learning Self-Supervised Low-Rank Network for Single-Stage Weakly and Semi-Supervised Semantic Segmentation

1 code implementation • 19 Mar 2022 • Junwen Pan, Pengfei Zhu, Kaihua Zhang, Bing Cao, Yu Wang, Dingwen Zhang, Junwei Han, QinGhua Hu

Semantic segmentation with limited annotations, such as weakly supervised semantic segmentation (WSSS) and semi-supervised semantic segmentation (SSSS), is a challenging task that has attracted much attention recently.

Ranked #34 on Weakly-Supervised Semantic Segmentation on COCO 2014 val

Pseudo Label Segmentation +3

Paper
Code

Colar: Effective and Efficient Online Action Detection by Consulting Exemplars

1 code implementation • CVPR 2022 • Le Yang, Junwei Han, Dingwen Zhang

Based on the exemplar-consultation mechanism, the long-term dependencies can be captured by regarding historical frames as exemplars, while the category-level modeling can be achieved by regarding representative frames from a category as exemplars.

Ranked #6 on Online Action Detection on TVSeries

Online Action Detection

264

Paper
Code

Cross-Modality Deep Feature Learning for Brain Tumor Segmentation

no code implementations • 7 Jan 2022 • Dingwen Zhang, Guohai Huang, Qiang Zhang, Jungong Han, Junwei Han, Yizhou Yu

Recent advances in machine learning and prevalence of digital medical images have opened up an opportunity to address the challenging brain tumor segmentation (BTS) task by using deep convolutional neural networks.

Brain Tumor Segmentation Segmentation +1

Paper
Add Code

Robust Region Feature Synthesizer for Zero-Shot Object Detection

1 code implementation • CVPR 2022 • Peiliang Huang, Junwei Han, De Cheng, Dingwen Zhang

Zero-shot object detection aims at incorporating class semantic vectors to realize the detection of (both seen and) unseen classes given an unconstrained test image.

Ranked #2 on Zero-Shot Object Detection on PASCAL VOC'07

Generalized Zero-Shot Object Detection Object +2

Paper
Code

Incremental Cross-view Mutual Distillation for Self-supervised Medical CT Synthesis

no code implementations • CVPR 2022 • Chaowei Fang, Liang Wang, Dingwen Zhang, Jun Xu, Yixuan Yuan, Junwei Han

Under this circumstance, the models learned from different views can distill valuable knowledge to guide the learning processes of each other.

Self-Supervised Learning

Paper
Add Code

Pixel Distillation: A New Knowledge Distillation Scheme for Low-Resolution Image Recognition

no code implementations • 17 Dec 2021 • Guangyu Guo, Longfei Han, Junwei Han, Dingwen Zhang

To this end, we make a pioneering effort to distill helpful knowledge from a heavy network model learned from high-resolution (HR) images to a compact network model that will handle LR images, thus advancing the current knowledge distillation technique with the novel pixel distillation.

Knowledge Distillation Model Compression +1

Paper
Add Code

Weakly Supervised Semantic Segmentation via Alternative Self-Dual Teaching

no code implementations • 17 Dec 2021 • Dingwen Zhang, Wenyuan Zeng, Guangyu Guo, Chaowei Fang, Lechao Cheng, Ming-Ming Cheng, Junwei Han

Current weakly supervised semantic segmentation (WSSS) frameworks usually contain the separated mask-refinement model and the main semantic region mining model.

Knowledge Distillation Weakly supervised Semantic Segmentation +1

Paper
Add Code

Background-Click Supervision for Temporal Action Localization

1 code implementation • 24 Nov 2021 • Le Yang, Junwei Han, Tao Zhao, Tianwei Lin, Dingwen Zhang, Jianxin Chen

Weakly supervised temporal action localization aims at learning the instance-level action pattern from the video-level labels, where a significant challenge is action-context confusion.

Position Weakly-supervised Temporal Action Localization +1

220

Paper
Code

Light Field Saliency Detection with Dual Local Graph Learning andReciprocative Guidance

1 code implementation • 2 Oct 2021 • Nian Liu, Wangbo Zhao, Dingwen Zhang, Junwei Han, Ling Shao

On the other hand, instead of processing the twokinds of data separately, we build a novel dual graph modelto guide the focal stack fusion process using all-focus pat-terns.

Graph Learning Saliency Detection

Paper
Code

Single Image Dehazing with An Independent Detail-Recovery Network

no code implementations • 22 Sep 2021 • Yan Li, De Cheng, Jiande Sun, Dingwen Zhang, Nannan Wang, Xinbo Gao

In this paper, we propose a single image dehazing method with an independent Detail Recovery Network (DRN), which considers capturing the details from the input image over a separate network and then integrates them into a coarse dehazed image.

Image Dehazing Single Image Dehazing

Paper
Add Code

ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation

no code implementations • CVPR 2021 • Qiang Zhang, Shenlu Zhao, Yongjiang Luo, Dingwen Zhang, Nianchang Huang, Jungong Han

Semantic segmentation models gain robustness against poor lighting conditions by virtue of complementary information from visible (RGB) and thermal images.

Ranked #27 on Thermal Image Segmentation on MFN Dataset

Image-to-Image Translation Segmentation +2

Paper
Add Code

Strengthen Learning Tolerance for Weakly Supervised Object Localization

1 code implementation • CVPR 2021 • Guangyu Guo, Junwei Han, Fang Wan, Dingwen Zhang

Weakly supervised object localization (WSOL) aims at learning to localize objects of interest by only using the image-level labels as the supervision.

Object Weakly-Supervised Object Localization

Paper
Code

A Structure-Aware Relation Network for Thoracic Diseases Detection and Segmentation

1 code implementation • 21 Apr 2021 • Jie Lian, Jingyu Liu, Shu Zhang, Kai Gao, Xiaoqing Liu, Dingwen Zhang, Yizhou Yu

Leveraging on constant structure and disease relations extracted from domain knowledge, we propose a structure-aware relation network (SAR-Net) extending Mask R-CNN.

Instance Segmentation Object Detection +2

Paper
Code

Weakly Supervised Object Localization and Detection: A Survey

no code implementations • 16 Apr 2021 • Dingwen Zhang, Junwei Han, Gong Cheng, Ming-Hsuan Yang

As an emerging and challenging problem in the computer vision community, weakly supervised object localization and detection plays an important role for developing new generation computer vision systems and has received significant attention in the past decade.

Object Weakly-Supervised Object Localization

Paper
Add Code

Few-Cost Salient Object Detection with Adversarial-Paced Learning

1 code implementation • NeurIPS 2020 • Dingwen Zhang, HaiBin Tian, Jungong Han

A fundamental challenge in training the existing deep saliency detection models is the requirement of large amounts of annotated data.

Object object-detection +3

Paper
Code

Onfocus Detection: Identifying Individual-Camera Eye Contact from Unconstrained Images

1 code implementation • 29 Mar 2021 • Dingwen Zhang, Bo wang, Gerong Wang, Qiang Zhang, Jiajia Zhang, Jungong Han, Zheng You

Onfocus detection aims at identifying whether the focus of the individual captured by a camera is on the camera or not.

Paper
Code

Densely Nested Top-Down Flows for Salient Object Detection

1 code implementation • 18 Feb 2021 • Chaowei Fang, HaiBin Tian, Dingwen Zhang, Qiang Zhang, Jungong Han, Junwei Han

To this end, this paper revisits the role of top-down modeling in salient object detection and designs a novel densely nested top-down flows (DNTDF)-based framework.

Object object-detection +2

Paper
Code

Salient Object Detection via Integrity Learning

3 code implementations • 19 Jan 2021 • Mingchen Zhuge, Deng-Ping Fan, Nian Liu, Dingwen Zhang, Dong Xu, Ling Shao

We define the concept of integrity at both a micro and macro level.

Object object-detection +2

263

Paper
Code

Light Field Saliency Detection With Dual Local Graph Learning and Reciprocative Guidance

no code implementations • ICCV 2021 • Nian Liu, Wangbo Zhao, Dingwen Zhang, Junwei Han, Ling Shao

In this paper, we model the information fusion within focal stack via graph networks.

Graph Learning object-detection +3

Paper
Add Code

Revisiting Anchor Mechanisms for Temporal Action Localization

1 code implementation • 22 Aug 2020 • Le Yang, Houwen Peng, Dingwen Zhang, Jianlong Fu, Junwei Han

To address this problem, this paper proposes a novel anchor-free action localization module that assists action localization by temporal points.

Temporal Action Localization

Paper
Code

Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization

no code implementations • 18 Aug 2020 • Tao Zhao, Junwei Han, Le Yang, Dingwen Zhang

The existing methods can be categorized into two localization-by-classification pipelines, i. e., the pre-classification pipeline and the post-classification pipeline.

Classification General Classification +2

Paper
Add Code

Exploring Rich and Efficient Spatial Temporal Interactions for Real Time Video Salient Object Detection

1 code implementation • 7 Aug 2020 • Chenglizhao Chen, Guotao Wang, Chong Peng, Dingwen Zhang, Yuming Fang, Hong Qin

In this way, even though the overall video saliency quality is heavily dependent on its spatial branch, however, the performance of the temporal branch still matter.

object-detection Salient Object Detection +2

Paper
Code

Re-thinking Co-Salient Object Detection

2 code implementations • 7 Jul 2020 • Deng-Ping Fan, Tengpeng Li, Zheng Lin, Ge-Peng Ji, Dingwen Zhang, Ming-Ming Cheng, Huazhu Fu, Jianbing Shen

CoSOD is an emerging and rapidly growing extension of salient object detection (SOD), which aims to detect the co-occurring salient objects in a group of images.

Ranked #7 on Co-Salient Object Detection on CoCA

Benchmarking Co-Salient Object Detection +3

Paper
Code

PoseFlow: A Deep Motion Representation for Understanding Human Behaviors in Videos

no code implementations • CVPR 2018 • Dingwen Zhang, Guangyu Guo, Dong Huang, Junwei Han

This "noisy" motion representation makes it very challenging for pose estimation and action recognition in real scenarios.

Action Recognition Optical Flow Estimation +3

Paper
Add Code

Reinforcement Cutting-Agent Learning for Video Object Segmentation

no code implementations • CVPR 2018 • Junwei Han, Le Yang, Dingwen Zhang, Xiaojun Chang, Xiaodan Liang

In this paper, we formulate this problem as a Markov Decision Process, where agents are learned to segment object regions under a deep reinforcement learning framework.

Decision Making Object +5

Paper
Add Code

Dilated Temporal Relational Adversarial Network for Generic Video Summarization

no code implementations • 30 Apr 2018 • Yu-jia Zhang, Michael Kampffmeyer, Xiaodan Liang, Dingwen Zhang, Min Tan, Eric P. Xing

Specifically, DTR-GAN learns a dilated temporal relational generator and a discriminator with three-player loss in an adversarial manner.

Generative Adversarial Network Video Summarization +1

Paper
Add Code

Unsupervised Object-Level Video Summarization with Online Motion Auto-Encoder

no code implementations • 2 Jan 2018 • Yu-jia Zhang, Xiaodan Liang, Dingwen Zhang, Min Tan, Eric P. Xing

Unsupervised video summarization plays an important role on digesting, browsing, and searching the ever-growing videos every day, and the underlying fine-grained semantic and motion information (i. e., objects of interest and their key motions) in online videos has been barely touched.

Object Unsupervised Video Summarization

Paper
Add Code

Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector

no code implementations • ICCV 2017 • Dingwen Zhang, Junwei Han, Yu Zhang

Based on this insight, we combine an intra-image fusion stream and a inter-image fusion stream in the proposed framework to generate the learning curriculum and pseudo ground-truth for supervising the training of the deep salient object detector.

Object object-detection +2

Paper
Add Code

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

no code implementations • CVPR 2017 • Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han

Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags.

Object Semantic Segmentation +3

Paper
Add Code

Learning Category-Specific 3D Shape Models From Weakly Labeled 2D Images

no code implementations • CVPR 2017 • Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang

Recently, researchers have made great processes to build category-specific 3D shape models from 2D images with manual annotations consisting of class labels, keypoints, and ground truth figure-ground segmentations.

3D Shape Reconstruction Segmentation +2

Paper
Add Code

Bridging Saliency Detection to Weakly Supervised Object Detection Based on Self-paced Curriculum Learning

no code implementations • 3 Mar 2017 • Dingwen Zhang, Deyu Meng, Long Zhao, Junwei Han

Weakly-supervised object detection (WOD) is a challenging problems in computer vision.

Ranked #34 on Weakly Supervised Object Detection on PASCAL VOC 2007

Object object-detection +2

Paper
Add Code

Object Co-Segmentation via Graph Optimized-Flexible Manifold Ranking

no code implementations • CVPR 2016 • Rong Quan, Junwei Han, Dingwen Zhang, Feiping Nie

Aiming at automatically discovering the common objects contained in a set of relevant images and segmenting them as foreground simultaneously, object co-segmentation has become an active research topic in recent years.

Object Segmentation

Paper
Add Code

A Review of Co-saliency Detection Technique: Fundamentals, Applications, and Challenges

no code implementations • 24 Apr 2016 • Dingwen Zhang, Huazhu Fu, Junwei Han, Ali Borji, Xuelong. Li

Co-saliency detection is a newly emerging and rapidly growing research area in computer vision community.

Co-Salient Object Detection

Paper
Add Code

A Self-Paced Multiple-Instance Learning Framework for Co-Saliency Detection

no code implementations • ICCV 2015 • Dingwen Zhang, Deyu Meng, Chao Li, Lu Jiang, Qian Zhao, Junwei Han

As an interesting and emerging topic, co-saliency detection aims at simultaneously extracting common salient objects in a group of images.

Co-Salient Object Detection Multiple Instance Learning +1

Paper
Add Code

Co-Saliency Detection via Looking Deep and Wide

no code implementations • CVPR 2015 • Dingwen Zhang, Junwei Han, Chao Li, Jingdong Wang

In the proposed framework, the wide and deep information are explored for the object proposal windows extracted in each image, and the co-saliency scores are calculated by integrating the intra-image contrast and intra group consistency via a principled Bayesian formulation.

Co-Salient Object Detection Image Retrieval +1

Paper
Add Code

Predicting Eye Fixations Using Convolutional Neural Networks

no code implementations • CVPR 2015 • Nian Liu, Junwei Han, Dingwen Zhang, Shifeng Wen, Tianming Liu

It is believed that eye movements in free-viewing of natural scenes are directed by both bottom-up visual saliency and top-down visual factors.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.