Search Results for author: Xiangjian He

Found 30 papers, 6 papers with code

TAVP: Task-Adaptive Visual Prompt for Cross-domain Few-shot Segmentation

no code implementations9 Sep 2024 Jiaqi Yang, Ye Huang, Xiangjian He, Linlin Shen, Guoping Qiu

While ensuring that the prior knowledge of SAM is not discarded, the new branch disentangles category and domain information through prototypes, guiding it in adapting the CD-FSS.

Cross-Domain Few-Shot Image Segmentation +2

NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation

1 code implementation21 Aug 2024 Zhenye Lou, Qing Xu, Zekun Jiang, Xiangjian He, Zhen Chen, Yi Wang, Chenxin Li, Maggie M. He, Wenting Duan

To alleviate the labor-intensive requirement of manual prompts, we introduce a Gaussian-Kernel Prompt Encoder (GKP-Encoder) to generate density maps driven by a single point, which guides segmentation predictions by mixing position prompts and semantic prompts.

Decoder Domain Generalization +4

ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation

1 code implementation19 Jul 2024 Qing Xu, Jiaxuan Li, Xiangjian He, Ziyu Liu, Zhen Chen, Wenting Duan, Chenxin Li, Maggie M. He, Fiseha B. Tesema, Wooi P. Cheah, Yi Wang, Rong Qu, Jonathan M. Garibaldi

Finally, we design the Query-Decoupled Modality Decoder (QDMD) that leverages a one-to-one strategy to provide an independent decoding channel for every modality.

Decoder Image Segmentation +5

Millimeter Wave Radar-based Human Activity Recognition for Healthcare Monitoring Robot

no code implementations3 May 2024 Zhanzhong Gu, Xiangjian He, Gengfa Fang, Chengpei Xu, Feng Xia, Wenjing Jia

Finally, we deploy our system on a movable robot-mounted edge computing platform, achieving flexible healthcare monitoring in real-world scenarios.

Edge-computing Human Activity Recognition

ESP-Zero: Unsupervised enhancement of zero-shot classification for Extremely Sparse Point cloud

no code implementations30 Apr 2024 Jiayi Han, Zidi Cao, Weibo Zheng, Xiangguo Zhou, Xiangjian He, Yuanfang Zhang, Daisen Wei

To the point cloud encoders to fit the extremely sparse point clouds without re-running the pre-training procedure which could be time-consuming and expensive, in this work, we propose an unsupervised model adaptation approach to enhance the point cloud encoder for the extremely sparse point clouds.

Zero-Shot Learning

MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection

no code implementations26 Apr 2024 Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiaonan Luo, Xiangjian He

Bottom-up text detection methods play an important role in arbitrary-shape scene text detection but there are two restrictions preventing them from achieving their great potential, i. e., 1) the accumulation of false text segment detections, which affects subsequent processing, and 2) the difficulty of building reliable connections between text segments.

Scene Text Detection Text Detection

Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery

no code implementations23 Dec 2023 Jialu Zhang, Xiaoying Yang, Wentao He, Jianfeng Ren, Qian Zhang, Titian Zhao, Ruibin Bai, Xiangjian He, Jiang Liu

A set of rewards measuring the localization accuracy, the accuracy of predicted labels, and the scale consistency among nearby patches are designed in the agent to guide the scale optimization.

Object object-detection +1

Point Clouds Are Specialized Images: A Knowledge Transfer Approach for 3D Understanding

no code implementations28 Jul 2023 Jiachen Kang, Wenjing Jia, Xiangjian He, Kin Man Lam

Self-supervised representation learning (SSRL) has gained increasing attention in point cloud understanding, in addressing the challenges posed by 3D data scarcity and high annotation costs.

Representation Learning Transfer Learning

CARD: Semantic Segmentation with Efficient Class-Aware Regularized Decoder

1 code implementation11 Jan 2023 Ye Huang, Di Kang, Liang Chen, Wenjing Jia, Xiangjian He, Lixin Duan, Xuefei Zhe, Linchao Bao

Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2. 23% mIOU with superior generalization ability.

Decoder Representation Learning +2

Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection

1 code implementation CVPR 2023 Yi Wang, Ruili Wang, Xin Fan, Tianzhu Wang, Xiangjian He

A multi-level hybrid loss is firstly designed to guide the network to learn pixel-level, region-level, and object-level features.

Decoder object-detection +2

Leveraging Systematic Knowledge of 2D Transformations

no code implementations2 Jun 2022 Jiachen Kang, Wenjing Jia, Xiangjian He

The existing deep learning models suffer from out-of-distribution (o. o. d.)

Image Classification

CAR: Class-aware Regularizations for Semantic Segmentation

1 code implementation arXiv:2203.07160 2022 Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Xiangjian He, Linchao Bao

Recent segmentation methods, such as OCR and CPNet, utilizing "class level" information in addition to pixel features, have achieved notable success for boosting the accuracy of existing network modules.

Representation Learning Semantic Segmentation

Channelized Axial Attention for Semantic Segmentation -- Considering Channel Relation within Spatial Attention for Semantic Segmentation

1 code implementation19 Jan 2021 Ye Huang, Di Kang, Wenjing Jia, Xiangjian He, Liu Liu

Spatial and channel attentions, modelling the semantic interdependencies in spatial and channel dimensions respectively, have recently been widely used for semantic segmentation.

Relation Segmentation +1

EdgeLoc: An Edge-IoT Framework for Robust Indoor Localization Using Capsule Networks

no code implementations12 Sep 2020 Qianwen Ye, Xiaochen Fan, Gengfa Fang, Hongxia Bie, Chaocan Xiang, Xudong Song, Xiangjian He

First, the localization accuracy can be degraded by the random signal fluctuations, which would influence conventional localization algorithms that simply learn handcrafted features from raw fingerprint data.

Edge-computing Indoor Localization

Binarized Graph Neural Network

no code implementations19 Apr 2020 Hanchen Wang, Defu Lian, Ying Zhang, Lu Qin, Xiangjian He, Yiguang Lin, Xuemin Lin

Our proposed method can be seamlessly integrated into the existing GNN-based embedding approaches to binarize the model parameters and learn the compact embedding.

Graph Embedding Graph Neural Network

Security and Privacy in IoT Using Machine Learning and Blockchain: Threats & Countermeasures

no code implementations10 Feb 2020 Nazar Waheed, Xiangjian He, Muhammad Ikram, Saad Sajid Hashmi, Muhammad Usman

In this paper, we provide a summary of research efforts made in the past few years, starting from 2008 to 2019, addressing security and privacy issues using ML algorithms and BCtechniques in the IoT domain.

BIG-bench Machine Learning

PDANet: Pyramid Density-aware Attention Net for Accurate Crowd Counting

no code implementations16 Jan 2020 Saeed Amirgholipour, Xiangjian He, Wenjing Jia, Dadong Wang, Lei Liu

For this purpose, a classifier evaluates the density level of the input features and then passes them to the corresponding high and low crowded DAD modules.

Crowd Counting Decoder

See More Than Once -- Kernel-Sharing Atrous Convolution for Semantic Segmentation

no code implementations26 Aug 2019 Ye Huang, Qingqing Wang, Wenjing Jia, Xiangjian He

Experiments conducted on the benchmark PASCAL VOC 2012 dataset show that the proposed sharing strategy can not only boost a network s generalization and representation abilities but also reduce the model complexity significantly.

Semantic Segmentation

FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition

no code implementations20 Apr 2019 Qingqing Wang, Wenjing Jia, Xiangjian He, Yue Lu, Michael Blumenstein, Ye Huang

Scene text recognition has recently been widely treated as a sequence-to-sequence prediction problem, where traditional fully-connected-LSTM (FC-LSTM) has played a critical role.

Scene Text Recognition

DENet: A Universal Network for Counting Crowd with Varying Densities and Scales

no code implementations17 Apr 2019 Lei Liu, Jie Jiang, Wenjing Jia, Saeed Amirgholipour, Michelle Zeibots, Xiangjian He

Counting people or objects with significantly varying scales and densities has attracted much interest from the research community and yet it remains an open problem.

Decoder

Voiceprint recognition of Parkinson patients based on deep learning

no code implementations17 Dec 2018 Zhijing Xu, Juan Wang, Ying Zhang, Xiangjian He

In this paper, a method based on Deep Neural Network (DNN) recognition and classification combined with Mini-Batch Gradient Descent (MBGD) is proposed to distinguish PD patients from healthy people using voiceprint features.

General Classification

A-CCNN: adaptive ccnn for density estimation and crowd counting

no code implementations19 Apr 2018 Saeed Amirgholipour Kasmani, Xiangjian He, Wenjing Jia, Dadong Wang, Michelle Zeibots

Crowd counting, for estimating the number of people in a crowd using vision-based computer techniques, has attracted much interest in the research community.

Crowd Counting Density Estimation +1

Beyond Context: Exploring Semantic Similarity for Tiny Face Detection

no code implementations5 Mar 2018 Yue Xi, Jiangbin Zheng, Xiangjian He, Wenjing Jia, Hanhui Li

Tiny face detection aims to find faces with high degrees of variability in scale, resolution and occlusion in cluttered scenes.

Face Detection Metric Learning +2

A Structural Correlation Filter Combined with A Multi-task Gaussian Particle Filter for Visual Tracking

no code implementations3 Mar 2018 Manna Dai, Shuying Cheng, Xiangjian He, Dadong Wang

First, it can detect the tracked target in a large-scale search scope via weak KCF trackers and evaluate the reliability of weak trackers\rq decisions for a Gaussian particle filter to make a strong decision, and hence it can tackle fast motions, appearance variations, occlusions and re-detections.

Visual Tracking

Structured Inhomogeneous Density Map Learning for Crowd Counting

no code implementations20 Jan 2018 Hanhui Li, Xiangjian He, Hefeng Wu, Saeed Amirgholipour Kasmani, Ruomei Wang, Xiaonan Luo, Liang Lin

In this paper, we aim at tackling the problem of crowd counting in extremely high-density scenes, which contain hundreds, or even thousands of people.

Crowd Counting

Soft Locality Preserving Map (SLPM) for Facial Expression Recognition

no code implementations11 Jan 2018 Cigdem Turan, Kin-Man Lam, Xiangjian He

Furthermore, the proposed feature-generation method can improve the generalizability of the underlying manifolds for facial expression recognition.

Dimensionality Reduction Facial Expression Recognition +1

Face Parsing via a Fully-Convolutional Continuous CRF Neural Network

no code implementations12 Aug 2017 Lei Zhou, Zhi Liu, Xiangjian He

In this work, we address the face parsing task with a Fully-Convolutional continuous CRF Neural Network (FC-CNN) architecture.

Face Parsing

Diversity-Enhanced Condensation Algorithm and Its Application for Robust and Accurate Endoscope Three-Dimensional Motion Tracking

no code implementations CVPR 2014 Xiongbiao Luo, Ying Wan, Xiangjian He, Jie Yang, Kensaku MORI

The paper proposes a diversity-enhanced condensation algorithm to address the particle impoverishment problem which stochastic filtering usually suffers from.

Diversity

Cannot find the paper you are looking for? You can Submit a new open access paper.