Search Results for author: Xudong Jiang

Found 55 papers, 22 papers with code

Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing

no code implementations15 Apr 2024 Song Xia, Yu Yi, Xudong Jiang, Henghui Ding

The proposed Dual Randomized Smoothing (DRS) down-samples the input image into two sub-images and smooths the two sub-images in lower dimensions.

Spin-UP: Spin Light for Natural Light Uncalibrated Photometric Stereo

1 code implementation2 Apr 2024 Zongrui Li, Zhan Lu, Haojie Yan, Boxin Shi, Gang Pan, Qian Zheng, Xudong Jiang

Natural Light Uncalibrated Photometric Stereo (NaUPS) relieves the strict environment and light assumptions in classical Uncalibrated Photometric Stereo (UPS) methods.

Inverse Rendering

Skeleton-Guided Instance Separation for Fine-Grained Segmentation in Microscopy

no code implementations18 Jan 2024 Jun Wang, Chengfeng Zhou, Zhaoyan Ming, Lina Wei, Xudong Jiang, Dahong Qian

One of the fundamental challenges in microscopy (MS) image analysis is instance segmentation (IS), particularly when segmenting cluster regions where multiple objects of varying sizes and shapes may be connected or even overlapped in arbitrary orientations.

Instance Segmentation Semantic Segmentation

DocMSU: A Comprehensive Benchmark for Document-level Multimodal Sarcasm Understanding

1 code implementation26 Dec 2023 Hang Du, Guoshun Nan, Sicheng Zhang, Binzhu Xie, Junrui Xu, Hehe Fan, Qimei Cui, Xiaofeng Tao, Xudong Jiang

Multimodal Sarcasm Understanding (MSU) has a wide range of applications in the news field such as public opinion analysis and forgery detection.

Object Detection Sarcasm Detection +1

Pano-NeRF: Synthesizing High Dynamic Range Novel Views with Geometry from Sparse Low Dynamic Range Panoramic Images

no code implementations26 Dec 2023 Zhan Lu, Qian Zheng, Boxin Shi, Xudong Jiang

However, in the case of inputting sparse Low Dynamic Range (LDR) panoramic images, NeRF often degrades with under-constrained geometry and is unable to reconstruct HDR radiance from LDR inputs.

HDR Reconstruction Lighting Estimation

InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

1 code implementation10 Dec 2023 Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu

While recent advancements have introduced control over factors such as object localization, posture, and image contours, a crucial gap remains in our ability to control the interactions between objects in the generated content.

Human-Object Interaction Generation Object

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

no code implementations13 Nov 2023 Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.

Ranked #6 on Text based Person Retrieval on CUHK-PEDES (using extra training data)

Person Search Text based Person Retrieval +2

GREC: Generalized Referring Expression Comprehension

1 code implementation30 Aug 2023 Shuting He, Henghui Ding, Chang Liu, Xudong Jiang

This dataset encompasses a range of expressions: those referring to multiple targets, expressions with no specific target, and the single-target expressions.

Generalized Referring Expression Comprehension Referring Expression +1

MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

1 code implementation ICCV 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Chen Change Loy

To investigate the feasibility of using motion expressions to ground and segment objects in videos, we propose a large-scale dataset called MeViS, which contains numerous motion expressions to indicate target objects in complex environments.

Motion Expressions Guided Video Segmentation Object +6

Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN

1 code implementation31 May 2023 Yangfan Hu, Qian Zheng, Xudong Jiang, Gang Pan

However, due to the quantization error and accumulating error, it often requires lots of time steps (high inference latency) to achieve high performance, which negates SNN's advantages.

Image Classification object-detection +3

Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation

no code implementations24 May 2023 Chang Liu, Henghui Ding, Yulun Zhang, Xudong Jiang

However, the generic attention mechanism in Transformer only uses the language input for attention weight calculation, which does not explicitly fuse language features in its output.

Image Segmentation Semantic Segmentation

DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering

1 code implementation CVPR 2023 Zongrui Li, Qian Zheng, Boxin Shi, Gang Pan, Xudong Jiang

Although the ambiguity is alleviated on non-Lambertian objects, the problem is still difficult to solve for more general objects with complex shapes introducing irregular shadows and general materials with complex reflectance like anisotropic reflectance.

Inverse Rendering

Video Question Answering Using CLIP-Guided Visual-Text Attention

no code implementations6 Mar 2023 Shuhong Ye, Weikai Kong, Chenglin Yao, Jianfeng Ren, Xudong Jiang

Specifically, we first extract video features using a TimeSformer and text features using a BERT from the target application domain, and utilize CLIP to extract a pair of visual-text features from the general-knowledge domain through the domain-specific learning.

General Knowledge Question Answering +1

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

1 code implementation ICCV 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.

Object Segmentation +3

Decomposing 3D Neuroimaging into 2+1D Processing for Schizophrenia Recognition

no code implementations21 Nov 2022 Mengjiao Hu, Xudong Jiang, Kang Sim, Juan Helen Zhou, Cuntai Guan

Deep learning has been successfully applied to recognizing both natural images and medical images.

Robust Reflection Removal with Flash-only Cues in the Wild

1 code implementation5 Nov 2022 Chenyang Lei, Xudong Jiang, Qifeng Chen

We propose a simple yet effective reflection-free cue for robust reflection removal from a pair of flash and ambient (no-flash) images.

Reflection Removal

Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation

no code implementations30 Oct 2022 Henghui Ding, HUI ZHANG, Xudong Jiang

A direct yet effective prototype regularization on support set is proposed in SRPNet, in which the generated prototypes are evaluated and regularized on the support set itself.

Few-Shot Semantic Segmentation Segmentation +1

VLT: Vision-Language Transformer and Query Generation for Referring Segmentation

1 code implementation28 Oct 2022 Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang

We propose a Vision-Language Transformer (VLT) framework for referring segmentation to facilitate deep interactions among multi-modal information and enhance the holistic understanding to vision-language features.

Referring Expression Segmentation Referring Video Object Segmentation

A Max-relevance-min-divergence Criterion for Data Discretization with Applications on Naive Bayes

no code implementations20 Sep 2022 Shihe Wang, Jianfeng Ren, Ruibin Bai, Yuan YAO, Xudong Jiang

Thus, we propose a Max-Dependency-Min-Divergence (MDmD) criterion that maximizes both the discriminant information and generalization ability of the discretized data.

Attribute Classification

Boosting the Discriminant Power of Naive Bayes

no code implementations20 Sep 2022 Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang

In this paper, we propose a feature augmentation method employing a stack auto-encoder to reduce the noise in the data and boost the discriminant power of naive Bayes.

NeIF: Representing General Reflectance as Neural Intrinsics Fields for Uncalibrated Photometric Stereo

no code implementations18 Aug 2022 Zongrui Li, Qian Zheng, Feishi Wang, Boxin Shi, Gang Pan, Xudong Jiang

Uncalibrated photometric stereo (UPS) is challenging due to the inherent ambiguity brought by unknown light.

Spatial Feature Mapping for 6DoF Object Pose Estimation

no code implementations3 Jun 2022 Jianhan Mei, Xudong Jiang, Henghui Ding

To address the problem of rotation symmetry ambiguity for objects, a spherical convolution is utilized and the spherical features are combined with the convolutional features that are mapped to the graph.

Object Pose Estimation

Instance-Specific Feature Propagation for Referring Segmentation

no code implementations26 Apr 2022 Chang Liu, Xudong Jiang, Henghui Ding

In this work, we propose a novel framework that simultaneously detects the target-of-interest via feature propagation and generates a fine-grained segmentation mask.

Instance Segmentation Segmentation +1

Human Activity Recognition Using 3D Orthogonally-projected EfficientNet on Radar Time-Range-Doppler Signature

no code implementations24 Nov 2021 Zeyu Wang, Chenglin Yao, Jianfeng Ren, Xudong Jiang

In radar activity recognition, 2D signal representations such as spectrogram, cepstrum and cadence velocity diagram are often utilized, while range information is often neglected.

Human Activity Recognition

Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video Prediction

no code implementations24 Nov 2021 Wentao He, Jianfeng Ren, Ruibin Bai, Xudong Jiang

Based on the two intrinsic natures of RPM problem, visual recognition and logical reasoning, we propose a Two-stage Rule-Induction Visual Reasoner (TRIVR), which consists of a perception module and a reasoning module, to tackle the challenges of real-world visual recognition and subsequent logical reasoning tasks, respectively.

Logical Reasoning Video Prediction +1

Complex common spatial patterns on time-frequency decomposed EEG for brain-computer interface

1 code implementation Pattern Recognition 2021 Vasilisa Mishuhina, Xudong Jiang

We propose a novel approach called time-frequency common spatial patterns (TFCSP) to enhance the robustness and accuracy of the electroencephalogram (EEG) signal classification.

Classification EEG +1

Panoramic Image Reflection Removal

no code implementations CVPR 2021 Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi

This paper studies the problem of panoramic image reflection removal, aiming at reliving the content ambiguity between reflection and transmission scenes.

Reflection Removal

Knowledge-aware Deep Framework for Collaborative Skin Lesion Segmentation and Melanoma Recognition

no code implementations7 Jun 2021 XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu

In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.

Clinical Knowledge Lesion Segmentation +3

Towards Enhancing Fine-grained Details for Image Matting

no code implementations22 Jan 2021 Chang Liu, Henghui Ding, Xudong Jiang

In this paper, we argue that recovering these microscopic details relies on low-level but high-definition texture features.

Image Matting

Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing

no code implementations ICCV 2021 Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang

In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.

Scene Parsing

Feature Distillation With Guided Adversarial Contrastive Learning

no code implementations21 Sep 2020 Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot

In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.

Adversarial Robustness Contrastive Learning

Temporal Distinct Representation Learning for Action Recognition

no code implementations ECCV 2020 Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan

Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos.

Action Recognition Representation Learning

Brain MRI-based 3D Convolutional Neural Networks for Classification of Schizophrenia and Controls

no code implementations14 Mar 2020 Mengjiao Hu, Kang Sim, Juan Helen Zhou, Xudong Jiang, Cuntai Guan

Convolutional Neural Network (CNN) has been successfully applied on classification of both natural images and medical images but not yet been applied to differentiating patients with schizophrenia from healthy controls.

BIG-bench Machine Learning General Classification

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

no code implementations20 Feb 2020 Xiaohong Wang, Xudong Jiang, Henghui Ding, Jun Liu

Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma.

Lesion Segmentation Skin Lesion Segmentation

Object 6D Pose Estimation with Non-local Attention

no code implementations20 Feb 2020 Jianhan Mei, Henghui Ding, Xudong Jiang

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image.

6D Pose Estimation Object +2

Semantic Correlation Promoted Shape-Variant Context for Segmentation

1 code implementation CVPR 2019 Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region.

Denoising Segmentation +1

Boundary-Aware Feature Propagation for Scene Segmentation

1 code implementation ICCV 2019 Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang

Furthermore, we propose a boundary-aware feature propagation (BFP) module to harvest and propagate the local features within their regions isolated by the learned boundaries in the UAG-structured image.

Scene Segmentation Segmentation

SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks

no code implementations ICCV 2019 Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot

This paper solves the Sparse Photometric stereo through Lighting Interpolation and Normal Estimation using a generative Network (SPLINE-Net).

Toward Achieving Robust Low-Level and High-Level Scene Parsing

1 code implementation journal 2019 Bing Shuai, Henghui Ding, Ting Liu, Gang Wang, Xudong Jiang

Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance.

Scene Parsing Scene Segmentation +2

Feature Boosting Network For 3D Pose Estimation

no code implementations15 Jan 2019 Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot

Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.

3D Hand Pose Estimation 3D Pose Estimation

DCI: Discriminative and Contrast Invertible Descriptor

no code implementations31 Dec 2018 Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang

In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.

Object Object Recognition +1

Interest Point Detection based on Adaptive Ternary Coding

no code implementations31 Dec 2018 Zhenwei Miao, Kim-Hui Yap, Xudong Jiang

In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.

Face Recognition Interest Point Detection +1

Deformable Pose Traversal Convolution for 3D Action and Gesture Recognition

no code implementations ECCV 2018 Junwu Weng, Mengyuan Liu, Xudong Jiang, Junsong Yuan

This deformable convolution can better utilize contextual joints for action and gesture recognition and is more robust to noisy joints.

Hand Gesture Recognition Hand-Gesture Recognition

Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation

1 code implementation CVPR 2018 Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this paper, we first propose a novel context contrasted local feature that not only leverages the informative context but also spotlights the local information in contrast to the context.

Scene Segmentation Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.