no code implementations • ECCV 2020 • Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang
We propose to employ phrase expressions as another interaction input to infer the attributes of target object.
1 code implementation • CVPR 2023 • Chang Liu, Henghui Ding, Xudong Jiang
Existing classic RES datasets and methods commonly support single-target expressions only, i. e., one expression refers to one target object.
Generalized Referring Expression Segmentation
Referring Expression
1 code implementation • 31 May 2023 • Yangfan Hu, Qian Zheng, Xudong Jiang, Gang Pan
However, due to the quantization error and accumulating error, it often requires lots of time steps (high inference latency) to achieve high performance, which negates SNN's advantages.
no code implementations • 25 May 2023 • Chenglin Yao, Jianfeng Ren, Ruibin Bai, Heshan Du, Jiang Liu, Xudong Jiang
Detecting 3D mask attacks to a face recognition system is challenging.
no code implementations • 24 May 2023 • Chang Liu, Henghui Ding, Yulun Zhang, Xudong Jiang
However, the generic attention mechanism in Transformer only uses the language input for attention weight calculation, which does not explicitly fuse language features in its output.
1 code implementation • 23 May 2023 • Shuting He, Xudong Jiang, Wei Jiang, Henghui Ding
In this work, we address the challenging task of few-shot and zero-shot 3D point cloud semantic segmentation.
1 code implementation • CVPR 2023 • Zongrui Li, Qian Zheng, Boxin Shi, Gang Pan, Xudong Jiang
Although the ambiguity is alleviated on non-Lambertian objects, the problem is still difficult to solve for more general objects with complex shapes introducing irregular shadows and general materials with complex reflectance like anisotropic reflectance.
no code implementations • 6 Mar 2023 • Shuhong Ye, Weikai Kong, Chenglin Yao, Jianfeng Ren, Xudong Jiang
Specifically, we first extract video features using a TimeSformer and text features using a BERT from the target application domain, and utilize CLIP to extract a pair of visual-text features from the general-knowledge domain through the domain-specific learning.
1 code implementation • 3 Feb 2023 • Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai
However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.
no code implementations • 21 Nov 2022 • Mengjiao Hu, Xudong Jiang, Kang Sim, Juan Helen Zhou, Cuntai Guan
Deep learning has been successfully applied to recognizing both natural images and medical images.
1 code implementation • 5 Nov 2022 • Chenyang Lei, Xudong Jiang, Qifeng Chen
We propose a simple yet effective reflection-free cue for robust reflection removal from a pair of flash and ambient (no-flash) images.
no code implementations • 30 Oct 2022 • Henghui Ding, HUI ZHANG, Xudong Jiang
A direct yet effective prototype regularization on support set is proposed in SRPNet, in which the generated prototypes are evaluated and regularized on the support set itself.
1 code implementation • 28 Oct 2022 • Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang
We propose a Vision-Language Transformer (VLT) framework for referring segmentation to facilitate deep interactions among multi-modal information and enhance the holistic understanding to vision-language features.
Ranked #1 on
Referring Video Object Segmentation
on Refer-YouTube-VOS
(using extra training data)
Referring Expression Segmentation
Referring Video Object Segmentation
no code implementations • 20 Sep 2022 • Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang
In this paper, we propose a feature augmentation method employing a stack auto-encoder to reduce the noise in the data and boost the discriminant power of naive Bayes.
no code implementations • 20 Sep 2022 • Shihe Wang, Jianfeng Ren, Ruibin Bai, Yuan YAO, Xudong Jiang
Thus, we propose a Max-Dependency-Min-Divergence (MDmD) criterion that maximizes both the discriminant information and generalization ability of the discretized data.
no code implementations • 18 Aug 2022 • Zongrui Li, Qian Zheng, Feishi Wang, Boxin Shi, Gang Pan, Xudong Jiang
Uncalibrated photometric stereo (UPS) is challenging due to the inherent ambiguity brought by unknown light.
no code implementations • 3 Jun 2022 • Jianhan Mei, Xudong Jiang, Henghui Ding
To address the problem of rotation symmetry ambiguity for objects, a spherical convolution is utilized and the spherical features are combined with the convolutional features that are mapped to the graph.
no code implementations • 26 Apr 2022 • Chang Liu, Xudong Jiang, Henghui Ding
In this work, we propose a novel framework that simultaneously detects the target-of-interest via feature propagation and generates a fine-grained segmentation mask.
no code implementations • 24 Nov 2021 • Shiliang Chen, Wentao He, Jianfeng Ren, Xudong Jiang
Radar gait recognition is robust to light variations and less infringement on privacy.
no code implementations • 24 Nov 2021 • Wentao He, Jianfeng Ren, Ruibin Bai, Xudong Jiang
Based on the two intrinsic natures of RPM problem, visual recognition and logical reasoning, we propose a Two-stage Rule-Induction Visual Reasoner (TRIVR), which consists of a perception module and a reasoning module, to tackle the challenges of real-world visual recognition and subsequent logical reasoning tasks, respectively.
no code implementations • 24 Nov 2021 • Zeyu Wang, Chenglin Yao, Jianfeng Ren, Xudong Jiang
In radar activity recognition, 2D signal representations such as spectrogram, cepstrum and cadence velocity diagram are often utilized, while range information is often neglected.
1 code implementation • ICCV 2021 • Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang
We introduce transformer and multi-head attention to build a network with an encoder-decoder attention mechanism architecture that "queries" the given image with the language expression.
Ranked #4 on
Referring Expression Segmentation
on RefCOCOg-test
1 code implementation • Pattern Recognition 2021 • Vasilisa Mishuhina, Xudong Jiang
We propose a novel approach called time-frequency common spatial patterns (TFCSP) to enhance the robustness and accuracy of the electroencephalogram (EEG) signal classification.
1 code implementation • CVPR 2021 • Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot
In this paper, we consider the absorption effect for the problem of single image reflection removal.
no code implementations • CVPR 2021 • Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi
This paper studies the problem of panoramic image reflection removal, aiming at reliving the content ambiguity between reflection and transmission scenes.
no code implementations • 7 Jun 2021 • XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu
In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.
no code implementations • 22 Jan 2021 • Chang Liu, Henghui Ding, Xudong Jiang
In this paper, we argue that recovering these microscopic details relies on low-level but high-definition texture features.
no code implementations • ICCV 2021 • Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang
In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.
no code implementations • 21 Sep 2020 • Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot
In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.
no code implementations • ECCV 2020 • Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan
Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos.
no code implementations • 14 Mar 2020 • Mengjiao Hu, Kang Sim, Juan Helen Zhou, Xudong Jiang, Cuntai Guan
Convolutional Neural Network (CNN) has been successfully applied on classification of both natural images and medical images but not yet been applied to differentiating patients with schizophrenia from healthy controls.
no code implementations • 20 Feb 2020 • Jianhan Mei, Henghui Ding, Xudong Jiang
In this paper, we address the challenging task of estimating 6D object pose from a single RGB image.
no code implementations • 20 Feb 2020 • Xiaohong Wang, Xudong Jiang, Henghui Ding, Jun Liu
Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma.
1 code implementation • CVPR 2019 • Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang
In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region.
Ranked #16 on
Semantic Segmentation
on COCO-Stuff test
1 code implementation • ICCV 2019 • Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang
Furthermore, we propose a boundary-aware feature propagation (BFP) module to harvest and propagate the local features within their regions isolated by the learned boundaries in the UAG-structured image.
Ranked #36 on
Semantic Segmentation
on Cityscapes test
no code implementations • ICCV 2019 • Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot
This paper solves the Sparse Photometric stereo through Lighting Interpolation and Normal Estimation using a generative Network (SPLINE-Net).
1 code implementation • journal 2019 • Bing Shuai, Henghui Ding, Ting Liu, Gang Wang, Xudong Jiang
Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance.
no code implementations • 15 Jan 2019 • Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot
Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.
no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang
In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.
no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang
In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.
no code implementations • ECCV 2018 • Junwu Weng, Mengyuan Liu, Xudong Jiang, Junsong Yuan
This deformable convolution can better utilize contextual joints for action and gesture recognition and is more robust to noisy joints.
1 code implementation • CVPR 2018 • Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang
In this paper, we first propose a novel context contrasted local feature that not only leverages the informative context but also spotlights the local information in contrast to the context.
Ranked #19 on
Semantic Segmentation
on COCO-Stuff test
1 code implementation • IEEE Signal Processing Letters 2018 • Vasilisa Mishuhina, Xudong Jiang
Electroencephalography signals have very low spatial resolution and electrodes capture signals that are overlapping each other.