no code implementations • ECCV 2020 • Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang
We propose to employ phrase expressions as another interaction input to infer the attributes of target object.
no code implementations • 2 May 2024 • Zhongzheng Qiao, Xuan Huy Pham, Savitha Ramasamy, Xudong Jiang, Erdal Kayacan, Andriy Sarabakha
In autonomous and mobile robotics, a principal challenge is resilient real-time environmental perception, particularly in situations characterized by unknown and dynamic elements, as exemplified in the context of autonomous drone racing.
1 code implementation • 15 Apr 2024 • Song Xia, Yu Yi, Xudong Jiang, Henghui Ding
The proposed Dual Randomized Smoothing (DRS) down-samples the input image into two sub-images and smooths the two sub-images in lower dimensions.
1 code implementation • 2 Apr 2024 • Zongrui Li, Zhan Lu, Haojie Yan, Boxin Shi, Gang Pan, Qian Zheng, Xudong Jiang
Natural Light Uncalibrated Photometric Stereo (NaUPS) relieves the strict environment and light assumptions in classical Uncalibrated Photometric Stereo (UPS) methods.
1 code implementation • 19 Feb 2024 • Zhongzheng Qiao, Quang Pham, Zhen Cao, Hoang H Le, P. N. Suganthan, Xudong Jiang, Ramasamy Savitha
Real-world environments are inherently non-stationary, frequently introducing new classes over time.
no code implementations • 18 Jan 2024 • Jun Wang, Chengfeng Zhou, Zhaoyan Ming, Lina Wei, Xudong Jiang, Dahong Qian
One of the fundamental challenges in microscopy (MS) image analysis is instance segmentation (IS), particularly when segmenting cluster regions where multiple objects of varying sizes and shapes may be connected or even overlapped in arbitrary orientations.
no code implementations • 26 Dec 2023 • Zhan Lu, Qian Zheng, Boxin Shi, Xudong Jiang
However, in the case of inputting sparse Low Dynamic Range (LDR) panoramic images, NeRF often degrades with under-constrained geometry and is unable to reconstruct HDR radiance from LDR inputs.
1 code implementation • 26 Dec 2023 • Hang Du, Guoshun Nan, Sicheng Zhang, Binzhu Xie, Junrui Xu, Hehe Fan, Qimei Cui, Xiaofeng Tao, Xudong Jiang
Multimodal Sarcasm Understanding (MSU) has a wide range of applications in the news field such as public opinion analysis and forgery detection.
1 code implementation • 10 Dec 2023 • Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu
While recent advancements have introduced control over factors such as object localization, posture, and image contours, a crucial gap remains in our ability to control the interactions between objects in the generated content.
1 code implementation • 16 Nov 2023 • Wentao He, Yuchen Yan, Jianfeng Ren, Ruibin Bai, Xudong Jiang
Deep neural networks have been applied to audio spectrograms for respiratory sound classification.
no code implementations • 13 Nov 2023 • Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding
With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.
Ranked #6 on Text based Person Retrieval on CUHK-PEDES (using extra training data)
1 code implementation • 30 Aug 2023 • Shuting He, Henghui Ding, Chang Liu, Xudong Jiang
This dataset encompasses a range of expressions: those referring to multiple targets, expressions with no specific target, and the single-target expressions.
Generalized Referring Expression Comprehension Referring Expression +1
1 code implementation • ICCV 2023 • Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Chen Change Loy
To investigate the feasibility of using motion expressions to ground and segment objects in videos, we propose a large-scale dataset called MeViS, which contains numerous motion expressions to indicate target objects in complex environments.
Ranked #2 on Referring Video Object Segmentation on MeViS
1 code implementation • 28 Jun 2023 • Jianzong Wu, Xiangtai Li, Shilin Xu, Haobo Yuan, Henghui Ding, Yibo Yang, Xia Li, Jiangning Zhang, Yunhai Tong, Xudong Jiang, Bernard Ghanem, DaCheng Tao
To our knowledge, this is the first comprehensive literature review of open vocabulary learning.
2 code implementations • CVPR 2023 • Chang Liu, Henghui Ding, Xudong Jiang
Existing classic RES datasets and methods commonly support single-target expressions only, i. e., one expression refers to one target object.
Generalized Referring Expression Segmentation Referring Expression +1
1 code implementation • 31 May 2023 • Yangfan Hu, Qian Zheng, Xudong Jiang, Gang Pan
However, due to the quantization error and accumulating error, it often requires lots of time steps (high inference latency) to achieve high performance, which negates SNN's advantages.
no code implementations • 25 May 2023 • Chenglin Yao, Jianfeng Ren, Ruibin Bai, Heshan Du, Jiang Liu, Xudong Jiang
Detecting 3D mask attacks to a face recognition system is challenging.
no code implementations • 24 May 2023 • Chang Liu, Henghui Ding, Yulun Zhang, Xudong Jiang
However, the generic attention mechanism in Transformer only uses the language input for attention weight calculation, which does not explicitly fuse language features in its output.
1 code implementation • 23 May 2023 • Shuting He, Xudong Jiang, Wei Jiang, Henghui Ding
In this work, we address the challenging task of few-shot and zero-shot 3D point cloud semantic segmentation.
1 code implementation • CVPR 2023 • Zongrui Li, Qian Zheng, Boxin Shi, Gang Pan, Xudong Jiang
Although the ambiguity is alleviated on non-Lambertian objects, the problem is still difficult to solve for more general objects with complex shapes introducing irregular shadows and general materials with complex reflectance like anisotropic reflectance.
no code implementations • 6 Mar 2023 • Shuhong Ye, Weikai Kong, Chenglin Yao, Jianfeng Ren, Xudong Jiang
Specifically, we first extract video features using a TimeSformer and text features using a BERT from the target application domain, and utilize CLIP to extract a pair of visual-text features from the general-knowledge domain through the domain-specific learning.
1 code implementation • ICCV 2023 • Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai
However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.
no code implementations • 21 Nov 2022 • Mengjiao Hu, Xudong Jiang, Kang Sim, Juan Helen Zhou, Cuntai Guan
Deep learning has been successfully applied to recognizing both natural images and medical images.
1 code implementation • 5 Nov 2022 • Chenyang Lei, Xudong Jiang, Qifeng Chen
We propose a simple yet effective reflection-free cue for robust reflection removal from a pair of flash and ambient (no-flash) images.
no code implementations • 30 Oct 2022 • Henghui Ding, HUI ZHANG, Xudong Jiang
A direct yet effective prototype regularization on support set is proposed in SRPNet, in which the generated prototypes are evaluated and regularized on the support set itself.
1 code implementation • 28 Oct 2022 • Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang
We propose a Vision-Language Transformer (VLT) framework for referring segmentation to facilitate deep interactions among multi-modal information and enhance the holistic understanding to vision-language features.
Ranked #3 on Referring Video Object Segmentation on MeViS
Referring Expression Segmentation Referring Video Object Segmentation
no code implementations • 20 Sep 2022 • Shihe Wang, Jianfeng Ren, Ruibin Bai, Yuan YAO, Xudong Jiang
Thus, we propose a Max-Dependency-Min-Divergence (MDmD) criterion that maximizes both the discriminant information and generalization ability of the discretized data.
no code implementations • 20 Sep 2022 • Shihe Wang, Jianfeng Ren, Xiaoyu Lian, Ruibin Bai, Xudong Jiang
In this paper, we propose a feature augmentation method employing a stack auto-encoder to reduce the noise in the data and boost the discriminant power of naive Bayes.
no code implementations • 18 Aug 2022 • Zongrui Li, Qian Zheng, Feishi Wang, Boxin Shi, Gang Pan, Xudong Jiang
Uncalibrated photometric stereo (UPS) is challenging due to the inherent ambiguity brought by unknown light.
no code implementations • 3 Jun 2022 • Jianhan Mei, Xudong Jiang, Henghui Ding
To address the problem of rotation symmetry ambiguity for objects, a spherical convolution is utilized and the spherical features are combined with the convolutional features that are mapped to the graph.
no code implementations • 26 Apr 2022 • Chang Liu, Xudong Jiang, Henghui Ding
In this work, we propose a novel framework that simultaneously detects the target-of-interest via feature propagation and generates a fine-grained segmentation mask.
no code implementations • 24 Nov 2021 • Wentao He, Jianfeng Ren, Ruibin Bai, Xudong Jiang
Based on the two intrinsic natures of RPM problem, visual recognition and logical reasoning, we propose a Two-stage Rule-Induction Visual Reasoner (TRIVR), which consists of a perception module and a reasoning module, to tackle the challenges of real-world visual recognition and subsequent logical reasoning tasks, respectively.
no code implementations • 24 Nov 2021 • Zeyu Wang, Chenglin Yao, Jianfeng Ren, Xudong Jiang
In radar activity recognition, 2D signal representations such as spectrogram, cepstrum and cadence velocity diagram are often utilized, while range information is often neglected.
no code implementations • 24 Nov 2021 • Shiliang Chen, Wentao He, Jianfeng Ren, Xudong Jiang
Radar gait recognition is robust to light variations and less infringement on privacy.
1 code implementation • ICCV 2021 • Henghui Ding, Chang Liu, Suchen Wang, Xudong Jiang
We introduce transformer and multi-head attention to build a network with an encoder-decoder attention mechanism architecture that "queries" the given image with the language expression.
1 code implementation • Pattern Recognition 2021 • Vasilisa Mishuhina, Xudong Jiang
We propose a novel approach called time-frequency common spatial patterns (TFCSP) to enhance the robustness and accuracy of the electroencephalogram (EEG) signal classification.
no code implementations • CVPR 2021 • Yuchen Hong, Qian Zheng, Lingran Zhao, Xudong Jiang, Alex C. Kot, Boxin Shi
This paper studies the problem of panoramic image reflection removal, aiming at reliving the content ambiguity between reflection and transmission scenes.
1 code implementation • CVPR 2021 • Qian Zheng, Boxin Shi, Jinnan Chen, Xudong Jiang, Ling-Yu Duan, Alex C. Kot
In this paper, we consider the absorption effect for the problem of single image reflection removal.
no code implementations • 7 Jun 2021 • XiaoHong Wang, Xudong Jiang, Henghui Ding, Yuqian Zhao, Jun Liu
In this paper, we propose a novel knowledge-aware deep framework that incorporates some clinical knowledge into collaborative learning of two important melanoma diagnosis tasks, i. e., skin lesion segmentation and melanoma recognition.
no code implementations • 22 Jan 2021 • Chang Liu, Henghui Ding, Xudong Jiang
In this paper, we argue that recovering these microscopic details relies on low-level but high-definition texture features.
no code implementations • ICCV 2021 • Henghui Ding, HUI ZHANG, Jun Liu, Jiaxin Li, Zijian Feng, Xudong Jiang
In this work, we treat each respective region in an image as a whole, and capture the structure topology as well as the affinity among different regions.
no code implementations • 21 Sep 2020 • Tao Bai, Jinnan Chen, Jun Zhao, Bihan Wen, Xudong Jiang, Alex Kot
In this paper, we propose a novel approach called Guided Adversarial Contrastive Distillation (GACD), to effectively transfer adversarial robustness from teacher to student with features.
no code implementations • ECCV 2020 • Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan
Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos.
no code implementations • 14 Mar 2020 • Mengjiao Hu, Kang Sim, Juan Helen Zhou, Xudong Jiang, Cuntai Guan
Convolutional Neural Network (CNN) has been successfully applied on classification of both natural images and medical images but not yet been applied to differentiating patients with schizophrenia from healthy controls.
no code implementations • 20 Feb 2020 • Xiaohong Wang, Xudong Jiang, Henghui Ding, Jun Liu
Accurate segmentation of skin lesion from dermoscopic images is a crucial part of computer-aided diagnosis of melanoma.
no code implementations • 20 Feb 2020 • Jianhan Mei, Henghui Ding, Xudong Jiang
In this paper, we address the challenging task of estimating 6D object pose from a single RGB image.
1 code implementation • CVPR 2019 • Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang
In this way, the proposed network aggregates the context information of a pixel from its semantic-correlated region instead of a predefined fixed region.
Ranked #13 on Semantic Segmentation on COCO-Stuff test
1 code implementation • ICCV 2019 • Henghui Ding, Xudong Jiang, Ai Qun Liu, Nadia Magnenat Thalmann, Gang Wang
Furthermore, we propose a boundary-aware feature propagation (BFP) module to harvest and propagate the local features within their regions isolated by the learned boundaries in the UAG-structured image.
Ranked #38 on Semantic Segmentation on PASCAL Context
no code implementations • ICCV 2019 • Qian Zheng, Yiming Jia, Boxin Shi, Xudong Jiang, Ling-Yu Duan, Alex C. Kot
This paper solves the Sparse Photometric stereo through Lighting Interpolation and Normal Estimation using a generative Network (SPLINE-Net).
1 code implementation • journal 2019 • Bing Shuai, Henghui Ding, Ting Liu, Gang Wang, Xudong Jiang
Furthermore, we introduce a “dense skip” architecture to retain a rich set of low-level information from the pre-trained CNN, which is essential to improve the low-level parsing performance.
no code implementations • 15 Jan 2019 • Jun Liu, Henghui Ding, Amir Shahroudy, Ling-Yu Duan, Xudong Jiang, Gang Wang, Alex C. Kot
Learning a set of features that are reliable and discriminatively representative of the pose of a hand (or body) part is difficult due to the ambiguities, texture and illumination variation, and self-occlusion in the real application of 3D pose estimation.
no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang
In this paper, an adaptive pixel ternary coding mechanism is proposed and a contrast invariant and noise resistant interest point detector is developed on the basis of this mechanism.
no code implementations • 31 Dec 2018 • Zhenwei Miao, Kim-Hui Yap, Xudong Jiang, Subbhuraam Sinduja, Zhenhua Wang
In this paper, we proposed a Discriminative and Contrast Invertible (DCI) local feature descriptor.
no code implementations • ECCV 2018 • Junwu Weng, Mengyuan Liu, Xudong Jiang, Junsong Yuan
This deformable convolution can better utilize contextual joints for action and gesture recognition and is more robust to noisy joints.
1 code implementation • CVPR 2018 • Henghui Ding, Xudong Jiang, Bing Shuai, Ai Qun Liu, Gang Wang
In this paper, we first propose a novel context contrasted local feature that not only leverages the informative context but also spotlights the local information in contrast to the context.
Ranked #16 on Semantic Segmentation on COCO-Stuff test
1 code implementation • IEEE Signal Processing Letters 2018 • Vasilisa Mishuhina, Xudong Jiang
Electroencephalography signals have very low spatial resolution and electrodes capture signals that are overlapping each other.