Search Results for author: Henghui Ding

Found 72 papers, 36 papers with code

Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation

1 code implementation CVPR 2018 Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this paper, we first propose a novel context contrasted local feature that not only leverages the informative context but also spotlights the local information in contrast to the context.

Scene Segmentation Segmentation

Feature Boosting Network For 3D Pose Estimation

no code implementations15 Jan 2019 Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot

Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.

3D Hand Pose Estimation 3D Pose Estimation

Toward Achieving Robust Low-Level and High-Level Scene Parsing

1 code implementation journal 2019 Bing Shuai, Henghui Ding, Ting Liu, Gang Wang, Xudong Jiang

Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance.

Scene Parsing Scene Segmentation +2

Boundary-Aware Feature Propagation for Scene Segmentation

1 code implementation ICCV 2019 Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang

Furthermore, we propose a boundary-aware feature propagation (BFP) module to harvest and propagate the local features within their regions isolated by the learned boundaries in the UAG-structured image.

Scene Segmentation Segmentation

Semantic Correlation Promoted Shape-Variant Context for Segmentation

1 code implementation CVPR 2019 Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region.

Denoising Segmentation +1

Object 6D Pose Estimation with Non-local Attention

no code implementations20 Feb 2020 Jianhan Mei, Henghui Ding, Xudong Jiang

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image.

6D Pose Estimation Object +2

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

no code implementations20 Feb 2020 Xiaohong Wang, Xudong Jiang, Henghui Ding, Jun Liu

Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma.

Lesion Segmentation Skin Lesion Segmentation

A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder

no code implementations ICCV 2021 Yujun Cai, Yiwei Wang, Yiheng Zhu, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Chuanxia Zheng, Sijie Yan, Henghui Ding, Xiaohui Shen, Ding Liu, Nadia Magnenat Thalmann

Notably, by considering this problem as a conditional generation process, we estimate a parametric distribution of the missing regions based on the input conditions, from which to sample and synthesize the full motion series.

motion prediction Motion Synthesis

Prototypical Matching and Open Set Rejection for Zero-Shot Semantic Segmentation

no code implementations ICCV 2021 HUI ZHANG, Henghui Ding

In this work, we present zero-shot semantic segmentation, which aims to identify not only the seen classes contained in training but also the novel classes that have never been seen.

Segmentation Semantic Segmentation +1

Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing

no code implementations ICCV 2021 Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang

In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.

Scene Parsing

Towards Enhancing Fine-grained Details for Image Matting

no code implementations22 Jan 2021 Chang Liu, Henghui Ding, Xudong Jiang

In this paper, we argue that recovering these microscopic details relies on low-level but high-definition texture features.

Image Matting

MINE: Towards Continuous Depth MPI with NeRF for Novel View Synthesis

1 code implementation ICCV 2021 Jiaxin Li, Zijian Feng, Qi She, Henghui Ding, Changhu Wang, Gim Hee Lee

In this paper, we propose MINE to perform novel view synthesis and depth estimation via dense 3D reconstruction from a single image.

3D Reconstruction Depth Estimation +1

Knowledge-aware Deep Framework for Collaborative Skin Lesion Segmentation and Melanoma Recognition

no code implementations7 Jun 2021 XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu

In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.

Clinical Knowledge Lesion Segmentation +3

Recovering the Unbiased Scene Graphs from the Biased Ones

1 code implementation5 Jul 2021 Meng-Jiun Chiou, Henghui Ding, Hanshu Yan, Changhu Wang, Roger Zimmermann, Jiashi Feng

Given input images, scene graph generation (SGG) aims to produce comprehensive, graphical representations describing visual relationships among salient objects.

Missing Labels Scene Graph Classification +4

Improving Video Instance Segmentation via Temporal Pyramid Routing

1 code implementation28 Jul 2021 Xiangtai Li, Hao He, Yibo Yang, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Yunhai Tong, DaCheng Tao

To incorporate both temporal and scale information, we propose a Temporal Pyramid Routing (TPR) strategy to conditionally align and conduct pixel-level aggregation from a feature pyramid pair of two adjacent frames.

Instance Segmentation Panoptic Segmentation +2

M2IOSR: Maximal Mutual Information Open Set Recognition

no code implementations5 Aug 2021 Xin Sun, Henghui Ding, Chi Zhang, Guosheng Lin, Keck-Voon Ling

In this work, we aim to address the challenging task of open set recognition (OSR).

Open Set Learning

Few-Shot Segmentation with Global and Local Contrastive Learning

1 code implementation11 Aug 2021 Weide Liu, Zhonghua Wu, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

To this end, we first propose a prior extractor to learn the query information from the unlabeled images with our proposed global-local contrastive learning.

Contrastive Learning Image Segmentation +2

Few-shot Segmentation with Optimal Transport Matching and Message Flow

no code implementations19 Aug 2021 Weide Liu, Chi Zhang, Henghui Ding, Tzu-Yi Hung, Guosheng Lin

In this work, we argue that every support pixel's information is desired to be transferred to all query pixels and propose a Correspondence Matching Network (CMNet) with an Optimal Transport Matching module to mine out the correspondence between the query and support images.

Few-Shot Semantic Segmentation Multi-Task Learning +2

Calibrating Class Activation Maps for Long-Tailed Visual Recognition

no code implementations29 Aug 2021 Chi Zhang, Guosheng Lin, Lvlong Lai, Henghui Ding, Qingyao Wu

First, we present a Class Activation Map Calibration (CAMC) module to improve the learning and prediction of network classifiers, by enforcing network prediction based on important image regions.

Representation Learning

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning

no code implementations ICCV 2021 Chi Zhang, Henghui Ding, Guosheng Lin, Ruibo Li, Changhu Wang, Chunhua Shen

Inspired by the recent success in Automated Machine Learning literature (AutoML), in this paper, we present Meta Navigator, a framework that attempts to solve the aforementioned limitation in few-shot learning by seeking a higher-level strategy and proffer to automate the selection from various few-shot learning designs.

AutoML Few-Shot Learning

Structure-Aware Label Smoothing for Graph Neural Networks

no code implementations1 Dec 2021 Yiwei Wang, Yujun Cai, Yuxuan Liang, Wei Wang, Henghui Ding, Muhao Chen, Jing Tang, Bryan Hooi

Representing a label distribution as a one-hot vector is a common practice in training node classification models.

Classification Node Classification

Directed Graph Contrastive Learning

1 code implementation NeurIPS 2021 Zekun Tong, Yuxuan Liang, Henghui Ding, Yongxing Dai, Xinke Li, Changhu Wang

However, it is still in its infancy with two concerns: 1) changing the graph structure through data augmentation to generate contrastive views may mislead the message passing scheme, as such graph changing action deprives the intrinsic graph structural information, especially the directional structure in directed graphs; 2) since GCL usually uses predefined contrastive views with hand-picking parameters, it does not take full advantage of the contrastive information provided by data augmentation, resulting in incomplete structure information for models learning.

Contrastive Learning Data Augmentation

Adaptive Data Augmentation on Temporal Graphs

no code implementations NeurIPS 2021 Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Siddharth Bhatia, Bryan Hooi

To address this issue, our idea is to transform the temporal graphs using data augmentation (DA) with adaptive magnitudes, so as to effectively augment the input features and preserve the essential semantic information.

Data Augmentation Node Classification

Time-Aware Neighbor Sampling for Temporal Graph Networks

no code implementations18 Dec 2021 Yiwei Wang, Yujun Cai, Yuxuan Liang, Henghui Ding, Changhu Wang, Bryan Hooi

In this work, we propose the TNS (Time-aware Neighbor Sampling) method: TNS learns from temporal information to provide an adaptive receptive neighborhood for every node at any time.

Node Classification

Learning Transferable Human-Object Interaction Detector With Natural Language Supervision

1 code implementation CVPR 2022 Suchen Wang, Yueqi Duan, Henghui Ding, Yap-Peng Tan, Kim-Hui Yap, Junsong Yuan

More specifically, we propose a new HOI visual encoder to detect the interacting humans and objects, and map them to a joint feature space to perform interaction recognition.

Human-Object Interaction Detection

Coarse-to-Fine Feature Mining for Video Semantic Segmentation

1 code implementation CVPR 2022 Guolei Sun, Yun Liu, Henghui Ding, Thomas Probst, Luc van Gool

To address this problem, we propose a Coarse-to-Fine Feature Mining (CFFM) technique to learn a unified presentation of static contexts and motional contexts.

Segmentation Semantic Segmentation +1

Instance-Specific Feature Propagation for Referring Segmentation

no code implementations26 Apr 2022 Chang Liu, Xudong Jiang, Henghui Ding

In this work, we propose a novel framework that simultaneously detects the target-of-interest via feature propagation and generates a fine-grained segmentation mask.

Instance Segmentation Segmentation +1

A Closer Look at Few-shot Image Generation

no code implementations CVPR 2022 Yunqing Zhao, Henghui Ding, Houjing Huang, Ngai-Man Cheung

Informed by our analysis and to slow down the diversity degradation of the target generator during adaptation, our second contribution proposes to apply mutual information (MI) maximization to retain the source domain's rich multi-level diversity information in the target domain generator.

10-shot image generation Contrastive Learning +1

Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging

1 code implementation20 May 2022 Yuanhao Cai, Jing Lin, Haoqian Wang, Xin Yuan, Henghui Ding, Yulun Zhang, Radu Timofte, Luc van Gool

In coded aperture snapshot spectral compressive imaging (CASSI) systems, hyperspectral image (HSI) reconstruction methods are employed to recover the spatial-spectral signal from a compressed measurement.

Compressive Sensing Image Reconstruction +1

Primitive3D: 3D Object Dataset Synthesis from Randomly Assembled Primitives

no code implementations CVPR 2022 Xinke Li, Henghui Ding, Zekun Tong, Yuwei Wu, Yeow Meng Chee

Further study suggests that our strategy can improve the model performance by pretraining and fine-tuning scheme, especially for the dataset with a small scale.

3D Object Classification Multi-Task Learning +1

Distilling Knowledge from Object Classification to Aesthetics Assessment

no code implementations2 Jun 2022 Jingwen Hou, Henghui Ding, Weisi Lin, Weide Liu, Yuming Fang

To deal with this dilemma, we propose to distill knowledge on semantic patterns for a vast variety of image contents from multiple pre-trained object classification (POC) models to an IAA model.

Classification Object

Long-tailed Recognition by Learning from Latent Categories

no code implementations2 Jun 2022 Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin

Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training.

Data Augmentation Long-tail Learning

Spatial Feature Mapping for 6DoF Object Pose Estimation

no code implementations3 Jun 2022 Jianhan Mei, Xudong Jiang, Henghui Ding

To address the problem of rotation symmetry ambiguity for objects, a spherical convolution is utilized and the spherical features are combined with the convolutional features that are mapped to the graph.

Object Pose Estimation

Tracking Every Thing in the Wild

1 code implementation26 Jul 2022 Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu

Our experiments show that TETA evaluates trackers more comprehensively, and TETer achieves significant improvements on the challenging large-scale datasets BDD100K and TAO compared to the state-of-the-art.

Benchmarking Classification +2

Video Mask Transfiner for High-Quality Video Instance Segmentation

1 code implementation28 Jul 2022 Lei Ke, Henghui Ding, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu

While Video Instance Segmentation (VIS) has seen rapid progress, current approaches struggle to predict high-quality masks with accurate boundary details.

Instance Segmentation Semantic Segmentation +2

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning

4 code implementations3 Oct 2022 Weicong Liang, Yuhui Yuan, Henghui Ding, Xiao Luo, WeiHong Lin, Ding Jia, Zheng Zhang, Chao Zhang, Han Hu

Vision transformers have recently achieved competitive results across various vision tasks but still suffer from heavy computation costs when processing a large number of tokens.

Clustering Depth Estimation +6

VLT: Vision-Language Transformer and Query Generation for Referring Segmentation

1 code implementation28 Oct 2022 Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang

We propose a Vision-Language Transformer (VLT) framework for referring segmentation to facilitate deep interactions among multi-modal information and enhance the holistic understanding to vision-language features.

Referring Expression Segmentation Referring Video Object Segmentation

Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation

no code implementations30 Oct 2022 Henghui Ding, HUI ZHANG, Xudong Jiang

A direct yet effective prototype regularization on support set is proposed in SRPNet, in which the generated prototypes are evaluated and regularized on the support set itself.

Few-Shot Semantic Segmentation Segmentation +1

Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

2 code implementations ICCV 2023 Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy

Experiments on the COCO dataset with two settings: Open Vocabulary Instance Segmentation (OVIS) and Open Set Panoptic Segmentation (OSPS) demonstrate the superiority of the CGG.

Instance Segmentation Panoptic Segmentation +1

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

1 code implementation ICCV 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.

Object Segmentation +3

Global Knowledge Calibration for Fast Open-Vocabulary Segmentation

no code implementations ICCV 2023 Kunyang Han, Yong liu, Jun Hao Liew, Henghui Ding, Yunchao Wei, Jiajun Liu, Yitong Wang, Yansong Tang, Yujiu Yang, Jiashi Feng, Yao Zhao

Recent advancements in pre-trained vision-language models, such as CLIP, have enabled the segmentation of arbitrary concepts solely from textual inputs, a process commonly referred to as open-vocabulary semantic segmentation (OVS).

Knowledge Distillation Open Vocabulary Semantic Segmentation +4

Federated Incremental Semantic Segmentation

1 code implementation CVPR 2023 Jiahua Dong, Duzhen Zhang, Yang Cong, Wei Cong, Henghui Ding, Dengxin Dai

Moreover, new clients collecting novel classes may join in the global training of FSS, which further exacerbates catastrophic forgetting.

Federated Learning Relation +2

Transformer-Based Visual Segmentation: A Survey

2 code implementations19 Apr 2023 Xiangtai Li, Henghui Ding, Haobo Yuan, Wenwei Zhang, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy

Recently, transformers, a type of neural network based on self-attention originally designed for natural language processing, have considerably surpassed previous convolutional or recurrent approaches in various vision processing tasks.

Autonomous Driving Point Cloud Segmentation +1

Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation

no code implementations24 May 2023 Chang Liu, Henghui Ding, Yulun Zhang, Xudong Jiang

However, the generic attention mechanism in Transformer only uses the language input for attention weight calculation, which does not explicitly fuse language features in its output.

Image Segmentation Semantic Segmentation

Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation

1 code implementation CVPR 2023 Shuting He, Henghui Ding, Wei Jiang

The inter-class relationships of semantic-related visual features are then required to be aligned with those in semantic space, thereby transferring semantic knowledge to visual feature learning.

Instance Segmentation Panoptic Segmentation +2

AdAM: Few-Shot Image Generation via Adaptation-Aware Kernel Modulation

no code implementations4 Jul 2023 Yunqing Zhao, Keshigeyan Chandrasegaran, Milad Abdollahzadeh, Chao Du, Tianyu Pang, Ruoteng Li, Henghui Ding, Ngai-Man Cheung

However, a major limitation of existing methods is that their knowledge preserving criteria consider only source domain/task and fail to consider target domain/adaptation in selecting source knowledge, casting doubt on their suitability for setups of different proximity between source and target domain.

Domain Adaptation Image Generation

Gradient-Semantic Compensation for Incremental Semantic Segmentation

no code implementations20 Jul 2023 Wei Cong, Yang Cong, Jiahua Dong, Gan Sun, Henghui Ding

To tackle the above challenges, in this paper, we propose a Gradient-Semantic Compensation (GSC) model, which surmounts incremental semantic segmentation from both gradient and semantic perspectives.

Segmentation Semantic Segmentation

Risk-optimized Outlier Removal for Robust 3D Point Cloud Classification

1 code implementation20 Jul 2023 Xinke Li, Junchi Lu, Henghui Ding, Changsheng Sun, Joey Tianyi Zhou, Chee Yeow Meng

With the growth of 3D sensing technology, deep learning system for 3D point clouds has become increasingly important, especially in applications like autonomous vehicles where safety is a primary concern.

3D Point Cloud Classification Autonomous Vehicles +4

MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

1 code implementation ICCV 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Chen Change Loy

To investigate the feasibility of using motion expressions to ground and segment objects in videos, we propose a large-scale dataset called MeViS, which contains numerous motion expressions to indicate target objects in complex environments.

Motion Expressions Guided Video Segmentation Object +6

GREC: Generalized Referring Expression Comprehension

1 code implementation30 Aug 2023 Shuting He, Henghui Ding, Chang Liu, Xudong Jiang

This dataset encompasses a range of expressions: those referring to multiple targets, expressions with no specific target, and the single-target expressions.

Generalized Referring Expression Comprehension Referring Expression +1

Region Generation and Assessment Network for Occluded Person Re-Identification

no code implementations7 Sep 2023 Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Then, to measure the importance of each generated region, we introduce a Region Assessment Module (RAM) that assigns confidence scores to different regions and reduces the negative impact of the occlusion regions by lower scores.

Person Re-Identification

Deep Geometrized Cartoon Line Inbetweening

1 code implementation ICCV 2023 Li SiYao, Tianpei Gu, Weiye Xiao, Henghui Ding, Ziwei Liu, Chen Change Loy

To preserve the precision and detail of the line drawings, we propose a new approach, AnimeInbet, which geometrizes raster line drawings into graphs of endpoints and reframes the inbetweening task as a graph fusion problem with vertex repositioning.

Learning-Based Biharmonic Augmentation for Point Cloud Classification

no code implementations10 Nov 2023 Jiacheng Wei, Guosheng Lin, Henghui Ding, Jie Hu, Kim-Hui Yap

Point cloud datasets often suffer from inadequate sample sizes in comparison to image datasets, making data augmentation challenging.

Classification Data Augmentation +1

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

no code implementations13 Nov 2023 Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.

Person Search Text based Person Retrieval +2

SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process

1 code implementation NeurIPS 2023 Mengyu Wang, Henghui Ding, Jun Hao Liew, Jiajun Liu, Yao Zhao, Yunchao Wei

We propose a model-agnostic solution called SegRefiner, which offers a novel perspective on this problem by interpreting segmentation refinement as a data generation process.

Denoising Dichotomous Image Segmentation +4

OMG-Seg: Is One Model Good Enough For All Segmentation?

1 code implementation18 Jan 2024 Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy

In this work, we address various segmentation tasks, each traditionally tackled by distinct or partially unified models.

Interactive Segmentation Panoptic Segmentation +3

Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition

1 code implementation3 Mar 2024 Kun-Yu Lin, Henghui Ding, Jiaming Zhou, Yi-Xing Peng, Zhilin Zhao, Chen Change Loy, Wei-Shi Zheng

To answer this, we establish a CROSS-domain Open-Vocabulary Action recognition benchmark named XOV-Action, and conduct a comprehensive evaluation of five state-of-the-art CLIP-based video learners under various types of domain gaps.

Open Vocabulary Action Recognition

Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment

no code implementations27 Mar 2024 Li SiYao, Tianpei Gu, Zhitao Yang, Zhengyu Lin, Ziwei Liu, Henghui Ding, Lei Yang, Chen Change Loy

We introduce a novel task within the field of 3D dance generation, termed dance accompaniment, which necessitates the generation of responsive movements from a dance partner, the "follower", synchronized with the lead dancer's movements and the underlying musical rhythm.

Cannot find the paper you are looking for? You can Submit a new open access paper.