Search Results for author: Xudong Jiang

Found 43 papers, 15 papers with code

Fast-SNN: Fast Spiking Neural Network by Converting Quantized ANN

1 code implementation31 May 2023 Yangfan Hu, Qian Zheng, Xudong Jiang, Gang Pan

However, due to the quantization error and accumulating error, it often requires lots of time steps (high inference latency) to achieve high performance, which negates SNN's advantages.

Image Classification object-detection +3

Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation

no code implementations24 May 2023 Chang Liu, Henghui Ding, Yulun Zhang, Xudong Jiang

However, the generic attention mechanism in Transformer only uses the language input for attention weight calculation, which does not explicitly fuse language features in its output.

Image Segmentation Semantic Segmentation

DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering

1 code implementation CVPR 2023 Zongrui Li, Qian Zheng, Boxin Shi, Gang Pan, Xudong Jiang

Although the ambiguity is alleviated on non-Lambertian objects, the problem is still difficult to solve for more general objects with complex shapes introducing irregular shadows and general materials with complex reflectance like anisotropic reflectance.

Inverse Rendering

Video Question Answering Using CLIP-Guided Visual-Text Attention

no code implementations6 Mar 2023 Shuhong Ye, Weikai Kong, Chenglin Yao, Jianfeng Ren, Xudong Jiang

Specifically, we first extract video features using a TimeSformer and text features using a BERT from the target application domain, and utilize CLIP to extract a pair of visual-text features from the general-knowledge domain through the domain-specific learning.

General Knowledge Question Answering +1

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

1 code implementation3 Feb 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.

Semantic Segmentation Video Object Segmentation +1

Decomposing 3D Neuroimaging into 2+1D Processing for Schizophrenia Recognition

no code implementations21 Nov 2022 Mengjiao Hu, Xudong Jiang, Kang Sim, Juan Helen Zhou, Cuntai Guan

Deep learning has been successfully applied to recognizing both natural images and medical images.

Robust Reflection Removal with Flash-only Cues in the Wild

1 code implementation5 Nov 2022 Chenyang Lei, Xudong Jiang, Qifeng Chen

We propose a simple yet effective reflection-free cue for robust reflection removal from a pair of flash and ambient (no-flash) images.

Reflection Removal

Self-Regularized Prototypical Network for Few-Shot Semantic Segmentation

no code implementations30 Oct 2022 Henghui Ding, HUI ZHANG, Xudong Jiang

A direct yet effective prototype regularization on support set is proposed in SRPNet, in which the generated prototypes are evaluated and regularized on the support set itself.

Few-Shot Semantic Segmentation Semantic Segmentation

VLT: Vision-Language Transformer and Query Generation for Referring Segmentation

1 code implementation28 Oct 2022 Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang

We propose a Vision-Language Transformer (VLT) framework for referring segmentation to facilitate deep interactions among multi-modal information and enhance the holistic understanding to vision-language features.

 Ranked #1 on Referring Video Object Segmentation on Refer-YouTube-VOS (using extra training data)

Referring Expression Segmentation Referring Video Object Segmentation

Boosting the Discriminant Power of Naive Bayes

no code implementations20 Sep 2022 Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang

In this paper, we propose a feature augmentation method employing a stack auto-encoder to reduce the noise in the data and boost the discriminant power of naive Bayes.

A Max-relevance-min-divergence Criterion for Data Discretization with Applications on Naive Bayes

no code implementations20 Sep 2022 Shihe Wang, Jianfeng Ren, Ruibin Bai, Yuan YAO, Xudong Jiang

Thus, we propose a Max-Dependency-Min-Divergence (MDmD) criterion that maximizes both the discriminant information and generalization ability of the discretized data.


NeIF: Representing General Reflectance as Neural Intrinsics Fields for Uncalibrated Photometric Stereo

no code implementations18 Aug 2022 Zongrui Li, Qian Zheng, Feishi Wang, Boxin Shi, Gang Pan, Xudong Jiang

Uncalibrated photometric stereo (UPS) is challenging due to the inherent ambiguity brought by unknown light.

Spatial Feature Mapping for 6DoF Object Pose Estimation

no code implementations3 Jun 2022 Jianhan Mei, Xudong Jiang, Henghui Ding

To address the problem of rotation symmetry ambiguity for objects, a spherical convolution is utilized and the spherical features are combined with the convolutional features that are mapped to the graph.

Pose Estimation

Instance-Specific Feature Propagation for Referring Segmentation

no code implementations26 Apr 2022 Chang Liu, Xudong Jiang, Henghui Ding

In this work, we propose a novel framework that simultaneously detects the target-of-interest via feature propagation and generates a fine-grained segmentation mask.

Instance Segmentation Semantic Segmentation

Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video Prediction

no code implementations24 Nov 2021 Wentao He, Jianfeng Ren, Ruibin Bai, Xudong Jiang

Based on the two intrinsic natures of RPM problem, visual recognition and logical reasoning, we propose a Two-stage Rule-Induction Visual Reasoner (TRIVR), which consists of a perception module and a reasoning module, to tackle the challenges of real-world visual recognition and subsequent logical reasoning tasks, respectively.

Logical Reasoning Video Prediction +1

Human Activity Recognition Using 3D Orthogonally-projected EfficientNet on Radar Time-Range-Doppler Signature

no code implementations24 Nov 2021 Zeyu Wang, Chenglin Yao, Jianfeng Ren, Xudong Jiang

In radar activity recognition, 2D signal representations such as spectrogram, cepstrum and cadence velocity diagram are often utilized, while range information is often neglected.

Human Activity Recognition

Vision-Language Transformer and Query Generation for Referring Segmentation

1 code implementation ICCV 2021 Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang

We introduce transformer and multi-head attention to build a network with an encoder-decoder attention mechanism architecture that "queries" the given image with the language expression.

Generalized Referring Expression Segmentation

Complex common spatial patterns on time-frequency decomposed EEG for brain-computer interface

1 code implementation Pattern Recognition 2021 Vasilisa Mishuhina, Xudong Jiang

We propose a novel approach called time-frequency common spatial patterns (TFCSP) to enhance the robustness and accuracy of the electroencephalogram (EEG) signal classification.

Classification Electroencephalogram (EEG)

Panoramic Image Reflection Removal

no code implementations CVPR 2021 Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi

This paper studies the problem of panoramic image reflection removal, aiming at reliving the content ambiguity between reflection and transmission scenes.

Reflection Removal

Knowledge-aware Deep Framework for Collaborative Skin Lesion Segmentation and Melanoma Recognition

no code implementations7 Jun 2021 XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu

In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.

Clinical Knowledge Lesion Segmentation +2

Towards Enhancing Fine-grained Details for Image Matting

no code implementations22 Jan 2021 Chang Liu, Henghui Ding, Xudong Jiang

In this paper, we argue that recovering these microscopic details relies on low-level but high-definition texture features.

Image Matting

Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing

no code implementations ICCV 2021 Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang

In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.

Scene Parsing

Feature Distillation With Guided Adversarial Contrastive Learning

no code implementations21 Sep 2020 Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot

In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.

Adversarial Robustness Contrastive Learning

Temporal Distinct Representation Learning for Action Recognition

no code implementations ECCV 2020 Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan

Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos.

Action Recognition Representation Learning

Brain MRI-based 3D Convolutional Neural Networks for Classification of Schizophrenia and Controls

no code implementations14 Mar 2020 Mengjiao Hu, Kang Sim, Juan Helen Zhou, Xudong Jiang, Cuntai Guan

Convolutional Neural Network (CNN) has been successfully applied on classification of both natural images and medical images but not yet been applied to differentiating patients with schizophrenia from healthy controls.

BIG-bench Machine Learning General Classification

Object 6D Pose Estimation with Non-local Attention

no code implementations20 Feb 2020 Jianhan Mei, Henghui Ding, Xudong Jiang

In this paper, we address the challenging task of estimating 6D object pose from a single RGB image.

6D Pose Estimation object-detection +1

Bi-directional Dermoscopic Feature Learning and Multi-scale Consistent Decision Fusion for Skin Lesion Segmentation

no code implementations20 Feb 2020 Xiaohong Wang, Xudong Jiang, Henghui Ding, Jun Liu

Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma.

Lesion Segmentation Skin Lesion Segmentation

Semantic Correlation Promoted Shape-Variant Context for Segmentation

1 code implementation CVPR 2019 Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region.

Denoising Semantic Segmentation

Boundary-Aware Feature Propagation for Scene Segmentation

1 code implementation ICCV 2019 Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang

Furthermore, we propose a boundary-aware feature propagation (BFP) module to harvest and propagate the local features within their regions isolated by the learned boundaries in the UAG-structured image.

Scene Segmentation

SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks

no code implementations ICCV 2019 Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot

This paper solves the Sparse Photometric stereo through Lighting Interpolation and Normal Estimation using a generative Network (SPLINE-Net).

Toward Achieving Robust Low-Level and High-Level Scene Parsing

1 code implementation journal 2019 Bing Shuai, Henghui Ding, Ting Liu, Gang Wang, Xudong Jiang

Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance.

Scene Parsing Scene Segmentation +1

Feature Boosting Network For 3D Pose Estimation

no code implementations15 Jan 2019 Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot

Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.

3D Hand Pose Estimation 3D Pose Estimation

Interest Point Detection based on Adaptive Ternary Coding

no code implementations31 Dec 2018 Zhenwei Miao, Kim-Hui Yap, Xudong Jiang

In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.

Face Recognition Interest Point Detection +1

DCI: Discriminative and Contrast Invertible Descriptor

no code implementations31 Dec 2018 Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang

In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.

Object Recognition Retrieval

Deformable Pose Traversal Convolution for 3D Action and Gesture Recognition

no code implementations ECCV 2018 Junwu Weng, Mengyuan Liu, Xudong Jiang, Junsong Yuan

This deformable convolution can better utilize contextual joints for action and gesture recognition and is more robust to noisy joints.

Hand Gesture Recognition Hand-Gesture Recognition

Context Contrasted Feature and Gated Multi-Scale Aggregation for Scene Segmentation

1 code implementation CVPR 2018 Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang

In this paper, we first propose a novel context contrasted local feature that not only leverages the informative context but also spotlights the local information in contrast to the context.

Scene Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.