no code implementations • 26 Nov 2024 • Guoan Xu, Jiaming Chen, Wenfeng Huang, Wenjing Jia, Guangwei Gao, Guo-Jun Qi
The Vision Transformer (ViT) has achieved notable success in computer vision, with its variants extensively validated across various downstream tasks, including semantic segmentation.
no code implementations • 27 Sep 2024 • Wenfeng Huang, Guoan Xu, Wenjing Jia, Stuart Perry, Guangwei Gao
Images captured in challenging environments--such as nighttime, foggy, rainy weather, and underwater--often suffer from significant degradation, resulting in a substantial loss of visual quality.
no code implementations • 11 Aug 2024 • Guoan Xu, Wenfeng Huang, Tao Wu, Ligeng Chen, Wenjing Jia, Guangwei Gao, Xiatian Zhu, Stuart Perry
Semantic segmentation involves assigning a specific category to each pixel in an image.
no code implementations • 10 Jul 2024 • Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen, Guangwei Gao
In this paper, we introduce HAFormer, a model that combines the hierarchical features extraction ability of CNNs with the global dependency modeling capability of Transformers to tackle lightweight semantic segmentation challenges.
no code implementations • 3 May 2024 • Zhanzhong Gu, Xiangjian He, Gengfa Fang, Chengpei Xu, Feng Xia, Wenjing Jia
Finally, we deploy our system on a movable robot-mounted edge computing platform, achieving flexible healthcare monitoring in real-world scenarios.
no code implementations • 26 Apr 2024 • Chengpei Xu, Wenjing Jia, Ruomei Wang, Xiaonan Luo, Xiangjian He
Bottom-up text detection methods play an important role in arbitrary-shape scene text detection but there are two restrictions preventing them from achieving their great potential, i. e., 1) the accumulation of false text segment detections, which affects subsequent processing, and 2) the difficulty of building reliable connections between text segments.
no code implementations • 13 Apr 2024 • Chengpei Xu, Hao Fu, Long Ma, Wenjing Jia, Chengqi Zhang, Feng Xia, Xiaoyu Ai, Binghao Li, Wenjie Zhang
Localizing text in low-light environments is challenging due to visual degradations.
no code implementations • 10 Sep 2023 • Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen
In contrast to the abundant research focusing on large-scale models, the progress in lightweight semantic segmentation appears to be advancing at a comparatively slower pace.
no code implementations • 28 Jul 2023 • Jiachen Kang, Wenjing Jia, Xiangjian He, Kin Man Lam
Self-supervised representation learning (SSRL) has gained increasing attention in point cloud understanding, in addressing the challenges posed by 3D data scarcity and high annotation costs.
1 code implementation • 11 Jan 2023 • Ye Huang, Di Kang, Liang Chen, Wenjing Jia, Xiangjian He, Lixin Duan, Xuefei Zhe, Linchao Bao
Extensive experiments and ablation studies conducted on multiple benchmark datasets demonstrate that the proposed CAR can boost the accuracy of all baseline models by up to 2. 23% mIOU with superior generalization ability.
no code implementations • 2 Jun 2022 • Jiachen Kang, Wenjing Jia, Xiangjian He
The existing deep learning models suffer from out-of-distribution (o. o. d.)
1 code implementation • arXiv:2203.07160 2022 • Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Xiangjian He, Linchao Bao
Recent segmentation methods, such as OCR and CPNet, utilizing "class level" information in addition to pixel features, have achieved notable success for boosting the accuracy of existing network modules.
Ranked #8 on Semantic Segmentation on PASCAL Context
1 code implementation • 19 Jan 2021 • Ye Huang, Di Kang, Wenjing Jia, Xiangjian He, Liu Liu
Spatial and channel attentions, modelling the semantic interdependencies in spatial and channel dimensions respectively, have recently been widely used for semantic segmentation.
Ranked #6 on Semantic Segmentation on COCO-Stuff test
no code implementations • 16 Jan 2020 • Saeed Amirgholipour, Xiangjian He, Wenjing Jia, Dadong Wang, Lei Liu
For this purpose, a classifier evaluates the density level of the input features and then passes them to the corresponding high and low crowded DAD modules.
no code implementations • 26 Aug 2019 • Ye Huang, Qingqing Wang, Wenjing Jia, Xiangjian He
Experiments conducted on the benchmark PASCAL VOC 2012 dataset show that the proposed sharing strategy can not only boost a network s generalization and representation abilities but also reduce the model complexity significantly.
no code implementations • 20 Apr 2019 • Qingqing Wang, Wenjing Jia, Xiangjian He, Yue Lu, Michael Blumenstein, Ye Huang
Scene text recognition has recently been widely treated as a sequence-to-sequence prediction problem, where traditional fully-connected-LSTM (FC-LSTM) has played a critical role.
no code implementations • 17 Apr 2019 • Lei Liu, Jie Jiang, Wenjing Jia, Saeed Amirgholipour, Michelle Zeibots, Xiangjian He
Counting people or objects with significantly varying scales and densities has attracted much interest from the research community and yet it remains an open problem.
no code implementations • 19 Apr 2018 • Saeed Amirgholipour Kasmani, Xiangjian He, Wenjing Jia, Dadong Wang, Michelle Zeibots
Crowd counting, for estimating the number of people in a crowd using vision-based computer techniques, has attracted much interest in the research community.
no code implementations • 5 Mar 2018 • Yue Xi, Jiangbin Zheng, Xiangjian He, Wenjing Jia, Hanhui Li
Tiny face detection aims to find faces with high degrees of variability in scale, resolution and occlusion in cluttered scenes.
no code implementations • 25 Sep 2013 • Yao Li, Wenjing Jia, Chunhua Shen, Anton Van Den Hengel
In order to measure the characterness we develop three novel cues that are tailored for character detection, and a Bayesian method for their integration.