Search Results for author: Xudong Jiang

Found 55 papers, 22 papers with code

PhraseClick: Toward Achieving Flexible Interactive Segmentation by Phrase and Click

no code implementations • ECCV 2020 • Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang

We propose to employ phrase expressions as another interaction input to infer the attributes of target object.

Paper
Add Code

Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing

no code implementations • 15 Apr 2024 • Song Xia, Yu Yi, Xudong Jiang, Henghui Ding

The proposed Dual Randomized Smoothing (DRS) down-samples the input image into two sub-images and smooths the two sub-images in lower dimensions.

Paper
Add Code

Spin-UP: Spin Light for Natural Light Uncalibrated Photometric Stereo

1 code implementation • 2 Apr 2024 • Zongrui Li, Zhan Lu, Haojie Yan, Boxin Shi, Gang Pan, Qian Zheng, Xudong Jiang

Natural Light Uncalibrated Photometric Stereo (NaUPS) relieves the strict environment and light assumptions in classical Uncalibrated Photometric Stereo (UPS) methods.

Inverse Rendering

Paper
Code

Class-incremental Learning for Time Series: Benchmark and Evaluation

1 code implementation • 19 Feb 2024 • Zhongzheng Qiao, Quang Pham, Zhen Cao, Hoang H Le, P. N. Suganthan, Xudong Jiang, Ramasamy Savitha

Real-world environments are inherently non-stationary, frequently introducing new classes over time.

Benchmarking Class Incremental Learning +4

Paper
Code

Skeleton-Guided Instance Separation for Fine-Grained Segmentation in Microscopy

no code implementations • 18 Jan 2024 • Jun Wang, Chengfeng Zhou, Zhaoyan Ming, Lina Wei, Xudong Jiang, Dahong Qian

One of the fundamental challenges in microscopy (MS) image analysis is instance segmentation (IS), particularly when segmenting cluster regions where multiple objects of varying sizes and shapes may be connected or even overlapped in arbitrary orientations.

Instance Segmentation Semantic Segmentation

Paper
Add Code

DocMSU: A Comprehensive Benchmark for Document-level Multimodal Sarcasm Understanding

1 code implementation • 26 Dec 2023 • Hang Du, Guoshun Nan, Sicheng Zhang, Binzhu Xie, Junrui Xu, Hehe Fan, Qimei Cui, Xiaofeng Tao, Xudong Jiang

Multimodal Sarcasm Understanding (MSU) has a wide range of applications in the news field such as public opinion analysis and forgery detection.

Object Detection Sarcasm Detection +1

Paper
Code

Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images

no code implementations • 26 Dec 2023 • Zhan Lu, Qian Zheng, Boxin Shi, Xudong Jiang

However, in the case of inputting sparse Low Dynamic Range (LDR) panoramic images, NeRF often degrades with under-constrained geometry and is unable to reconstruct HDR radiance from LDR inputs.

HDR Reconstruction Lighting Estimation

Paper
Add Code

InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

1 code implementation • 10 Dec 2023 • Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu

While recent advancements have introduced control over factors such as object localization, posture, and image contours, a crucial gap remains in our ability to control the interactions between objects in the generated content.

Human-Object Interaction Generation Object

Paper
Code

Multi-View Spectrogram Transformer for Respiratory Sound Classification

no code implementations • 16 Nov 2023 • Wentao He, Yuchen Yan, Jianfeng Ren, Ruibin Bai, Xudong Jiang

Deep neural networks have been applied to audio spectrograms for respiratory sound classification.

Classification Sound Classification

Paper
Add Code

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

no code implementations • 13 Nov 2023 • Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.

Ranked #6 on Text based Person Retrieval on CUHK-PEDES (using extra training data)

Person Search Text based Person Retrieval +2

Paper
Add Code

GREC: Generalized Referring Expression Comprehension

1 code implementation • 30 Aug 2023 • Shuting He, Henghui Ding, Chang Liu, Xudong Jiang

This dataset encompasses a range of expressions: those referring to multiple targets, expressions with no specific target, and the single-target expressions.

Generalized Referring Expression Comprehension Referring Expression +1

162

Paper
Code

MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

1 code implementation • ICCV 2023 • Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Chen Change Loy

To investigate the feasibility of using motion expressions to ground and segment objects in videos, we propose a large-scale dataset called MeViS, which contains numerous motion expressions to indicate target objects in complex environments.

Ranked #2 on Referring Video Object Segmentation on MeViS

Motion Expressions Guided Video Segmentation Object +6

457

Paper
Code

Towards Open Vocabulary Learning: A Survey

1 code implementation • 28 Jun 2023 • Jianzong Wu, Xiangtai Li, Shilin Xu, Haobo Yuan, Henghui Ding, Yibo Yang, Xia Li, Jiangning Zhang, Yunhai Tong, Xudong Jiang, Bernard Ghanem, DaCheng Tao

To our knowledge, this is the first comprehensive literature review of open vocabulary learning.

Open Set Learning Out-of-Distribution Detection +3

634

Paper
Code

GRES: Generalized Referring Expression Segmentation

2 code implementations • CVPR 2023 • Chang Liu, Henghui Ding, Xudong Jiang

Existing classic RES datasets and methods commonly support single-target expressions only, i. e., one expression refers to one target object.

Ranked #2 on Generalized Referring Expression Segmentation on gRefCOCO

Generalized Referring Expression Segmentation Referring Expression +1

647

Paper
Code

Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN

1 code implementation • 31 May 2023 • Yangfan Hu, Qian Zheng, Xudong Jiang, Gang Pan

However, due to the quantization error and accumulating error, it often requires lots of time steps (high inference latency) to achieve high performance, which negates SNN's advantages.

Image Classification object-detection +3

Paper
Code

Mask Attack Detection Using Vascular-weighted Motion-robust rPPG Signals

no code implementations • 25 May 2023 • Chenglin Yao, Jianfeng Ren, Ruibin Bai, Heshan Du, Jiang Liu, Xudong Jiang

Detecting 3D mask attacks to a face recognition system is challenging.

Face Alignment Face Anti-Spoofing +1

Paper
Add Code

Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation

no code implementations • 24 May 2023 • Chang Liu, Henghui Ding, Yulun Zhang, Xudong Jiang

However, the generic attention mechanism in Transformer only uses the language input for attention weight calculation, which does not explicitly fuse language features in its output.

Image Segmentation Semantic Segmentation

Paper
Add Code

Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation

1 code implementation • 23 May 2023 • Shuting He, Xudong Jiang, Wei Jiang, Henghui Ding

In this work, we address the challenging task of few-shot and zero-shot 3D point cloud semantic segmentation.

Few-shot 3D semantic segmentation Segmentation

Paper
Code

DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering

1 code implementation • CVPR 2023 • Zongrui Li, Qian Zheng, Boxin Shi, Gang Pan, Xudong Jiang

Although the ambiguity is alleviated on non-Lambertian objects, the problem is still difficult to solve for more general objects with complex shapes introducing irregular shadows and general materials with complex reflectance like anisotropic reflectance.

Inverse Rendering

Paper
Code

Video Question Answering Using CLIP-Guided Visual-Text Attention

no code implementations • 6 Mar 2023 • Shuhong Ye, Weikai Kong, Chenglin Yao, Jianfeng Ren, Xudong Jiang

Specifically, we first extract video features using a TimeSformer and text features using a BERT from the target application domain, and utilize CLIP to extract a pair of visual-text features from the general-knowledge domain through the domain-specific learning.

General Knowledge Question Answering +1

Paper
Add Code

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

1 code implementation • ICCV 2023 • Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.

Object Segmentation +3

291

Paper
Code

Decomposing 3D Neuroimaging into 2+1D Processing for Schizophrenia Recognition

no code implementations • 21 Nov 2022 • Mengjiao Hu, Xudong Jiang, Kang Sim, Juan Helen Zhou, Cuntai Guan

Deep learning has been successfully applied to recognizing both natural images and medical images.

Paper
Add Code

Robust Reflection Removal with Flash-only Cues in the Wild

1 code implementation • 5 Nov 2022 • Chenyang Lei, Xudong Jiang, Qifeng Chen

We propose a simple yet effective reflection-free cue for robust reflection removal from a pair of flash and ambient (no-flash) images.

Reflection Removal

214

Paper
Code

Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation

no code implementations • 30 Oct 2022 • Henghui Ding, HUI ZHANG, Xudong Jiang

A direct yet effective prototype regularization on support set is proposed in SRPNet, in which the generated prototypes are evaluated and regularized on the support set itself.

Few-Shot Semantic Segmentation Segmentation +1

Paper
Add Code

VLT: Vision-Language Transformer and Query Generation for Referring Segmentation

1 code implementation • 28 Oct 2022 • Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang

We propose a Vision-Language Transformer (VLT) framework for referring segmentation to facilitate deep interactions among multi-modal information and enhance the holistic understanding to vision-language features.

Ranked #3 on Referring Video Object Segmentation on MeViS

Referring Expression Segmentation Referring Video Object Segmentation

334

Paper
Code

A Max-relevance-min-divergence Criterion for Data Discretization with Applications on Naive Bayes

no code implementations • 20 Sep 2022 • Shihe Wang, Jianfeng Ren, Ruibin Bai, Yuan YAO, Xudong Jiang

Thus, we propose a Max-Dependency-Min-Divergence (MDmD) criterion that maximizes both the discriminant information and generalization ability of the discretized data.

Attribute Classification

Paper
Add Code

Boosting the Discriminant Power of Naive Bayes

no code implementations • 20 Sep 2022 • Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang

In this paper, we propose a feature augmentation method employing a stack auto-encoder to reduce the noise in the data and boost the discriminant power of naive Bayes.

Paper
Add Code

NeIF: Representing General Reflectance as Neural Intrinsics Fields for Uncalibrated Photometric Stereo

no code implementations • 18 Aug 2022 • Zongrui Li, Qian Zheng, Feishi Wang, Boxin Shi, Gang Pan, Xudong Jiang

Uncalibrated photometric stereo (UPS) is challenging due to the inherent ambiguity brought by unknown light.

Paper
Add Code

Spatial Feature Mapping for 6DoF Object Pose Estimation

no code implementations • 3 Jun 2022 • Jianhan Mei, Xudong Jiang, Henghui Ding

To address the problem of rotation symmetry ambiguity for objects, a spherical convolution is utilized and the spherical features are combined with the convolutional features that are mapped to the graph.

Object Pose Estimation

Paper
Add Code

Instance-Specific Feature Propagation for Referring Segmentation

no code implementations • 26 Apr 2022 • Chang Liu, Xudong Jiang, Henghui Ding

In this work, we propose a novel framework that simultaneously detects the target-of-interest via feature propagation and generates a fine-grained segmentation mask.

Instance Segmentation Segmentation +1

Paper
Add Code

Attention-based Dual-stream Vision Transformer for Radar Gait Recognition

no code implementations • 24 Nov 2021 • Shiliang Chen, Wentao He, Jianfeng Ren, Xudong Jiang

Radar gait recognition is robust to light variations and less infringement on privacy.

Gait Recognition

Paper
Add Code

Human Activity Recognition Using 3D Orthogonally-projected EfficientNet on Radar Time-Range-Doppler Signature

no code implementations • 24 Nov 2021 • Zeyu Wang, Chenglin Yao, Jianfeng Ren, Xudong Jiang

In radar activity recognition, 2D signal representations such as spectrogram, cepstrum and cadence velocity diagram are often utilized, while range information is often neglected.

Human Activity Recognition

Paper
Add Code

Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video Prediction

no code implementations • 24 Nov 2021 • Wentao He, Jianfeng Ren, Ruibin Bai, Xudong Jiang

Based on the two intrinsic natures of RPM problem, visual recognition and logical reasoning, we propose a Two-stage Rule-Induction Visual Reasoner (TRIVR), which consists of a perception module and a reasoning module, to tackle the challenges of real-world visual recognition and subsequent logical reasoning tasks, respectively.

Logical Reasoning Video Prediction +1

Paper
Add Code

Vision-Language Transformer and Query Generation for Referring Segmentation

1 code implementation • ICCV 2021 • Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang

We introduce transformer and multi-head attention to build a network with an encoder-decoder attention mechanism architecture that "queries" the given image with the language expression.

Ranked #3 on Generalized Referring Expression Comprehension on gRefCOCO

Generalized Referring Expression Comprehension Generalized Referring Expression Segmentation +1

334

Paper
Code

Complex common spatial patterns on time-frequency decomposed EEG for brain-computer interface

1 code implementation • Pattern Recognition 2021 • Vasilisa Mishuhina, Xudong Jiang

We propose a novel approach called time-frequency common spatial patterns (TFCSP) to enhance the robustness and accuracy of the electroencephalogram (EEG) signal classification.

Classification EEG +1

Paper
Code

Single Image Reflection Removal With Absorption Effect

1 code implementation • CVPR 2021 • Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot

In this paper, we consider the absorption effect for the problem of single image reflection removal.

Reflection Removal

Paper
Code

Panoramic Image Reflection Removal

no code implementations • CVPR 2021 • Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi

This paper studies the problem of panoramic image reflection removal, aiming at reliving the content ambiguity between reflection and transmission scenes.

Reflection Removal

Paper
Add Code

Knowledge-aware Deep Framework for Collaborative Skin Lesion Segmentation and Melanoma Recognition

no code implementations • 7 Jun 2021 • XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu

In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.

Clinical Knowledge Lesion Segmentation +3

Paper
Add Code

Towards Enhancing Fine-grained Details for Image Matting

no code implementations • 22 Jan 2021 • Chang Liu, Henghui Ding, Xudong Jiang

In this paper, we argue that recovering these microscopic details relies on low-level but high-definition texture features.

Image Matting

Paper
Add Code

Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing

no code implementations • ICCV 2021 • Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang

In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.

Scene Parsing

Paper
Add Code

Feature Distillation With Guided Adversarial Contrastive Learning

no code implementations • 21 Sep 2020 • Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot

In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.

Adversarial Robustness Contrastive Learning

Paper
Add Code

Temporal Distinct Representation Learning for Action Recognition

no code implementations • ECCV 2020 • Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan

Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos.

Action Recognition Representation Learning

Paper
Add Code

Brain MRI-based 3D Convolutional Neural Networks for Classification of Schizophrenia and Controls

no code implementations • 14 Mar 2020 • Mengjiao Hu, Kang Sim, Juan Helen Zhou, Xudong Jiang, Cuntai Guan

Convolutional Neural Network (CNN) has been successfully applied on classification of both natural images and medical images but not yet been applied to differentiating patients with schizophrenia from healthy controls.

BIG-bench Machine Learning General Classification

Paper
Add Code

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

no code implementations • 20 Feb 2020 • Xiaohong Wang, Xudong Jiang, Henghui Ding, Jun Liu

Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma.

Lesion Segmentation Skin Lesion Segmentation

Paper
Add Code

Object 6D Pose Estimation with Non-local Attention

no code implementations • 20 Feb 2020 • Jianhan Mei, Henghui Ding, Xudong Jiang

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image.

6D Pose Estimation Object +2

Paper
Add Code

Semantic Correlation Promoted Shape-Variant Context for Segmentation

1 code implementation • CVPR 2019 • Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region.

Ranked #13 on Semantic Segmentation on COCO-Stuff test

Denoising Segmentation +1

Paper
Code

Boundary-Aware Feature Propagation for Scene Segmentation

1 code implementation • ICCV 2019 • Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang

Furthermore, we propose a boundary-aware feature propagation (BFP) module to harvest and propagate the local features within their regions isolated by the learned boundaries in the UAG-structured image.

Ranked #38 on Semantic Segmentation on PASCAL Context

Scene Segmentation Segmentation

Paper
Code

SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks

no code implementations • ICCV 2019 • Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot

This paper solves the Sparse Photometric stereo through Lighting Interpolation and Normal Estimation using a generative Network (SPLINE-Net).

Paper
Add Code

Toward Achieving Robust Low-Level and High-Level Scene Parsing

1 code implementation • journal 2019 • Bing Shuai, Henghui Ding, Ting Liu, Gang Wang, Xudong Jiang

Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance.

Scene Parsing Scene Segmentation +2

Paper
Code

Feature Boosting Network For 3D Pose Estimation

no code implementations • 15 Jan 2019 • Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot

Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.

3D Hand Pose Estimation 3D Pose Estimation

Paper
Add Code

DCI: Discriminative and Contrast Invertible Descriptor

no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang

In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.

Object Object Recognition +1

Paper
Add Code

Interest Point Detection based on Adaptive Ternary Coding

no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang

In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.

Face Recognition Interest Point Detection +1

Paper
Add Code

Deformable Pose Traversal Convolution for 3D Action and Gesture Recognition

no code implementations • ECCV 2018 • Junwu Weng, Mengyuan Liu, Xudong Jiang, Junsong Yuan

This deformable convolution can better utilize contextual joints for action and gesture recognition and is more robust to noisy joints.

Hand Gesture Recognition Hand-Gesture Recognition

Paper
Add Code

Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation

1 code implementation • CVPR 2018 • Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this paper, we first propose a novel context contrasted local feature that not only leverages the informative context but also spotlights the local information in contrast to the context.

Ranked #16 on Semantic Segmentation on COCO-Stuff test

Scene Segmentation Segmentation

Paper
Code

Feature Weighting and Regularization of Common Spatial Patterns in EEG-Based Motor Imagery BCI

1 code implementation • IEEE Signal Processing Letters 2018 • Vasilisa Mishuhina, Xudong Jiang

Electroencephalography signals have very low spatial resolution and electrodes capture signals that are overlapping each other.

EEG feature selection +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.