no code implementations • 25 Nov 2022 • Zhao Zhou, Xiangcheng Du, Yingbin Zheng, Cheng Jin
We present the Aggregated Text TRansformer(ATTR), which is designed to represent texts in scene images with a multi-scale self-attention mechanism.
no code implementations • 23 Jul 2022 • Xiangcheng Du, Zhao Zhou, Yingbin Zheng, Xingjiao Wu, Tianlong Ma, Cheng Jin
Scene text erasing seeks to erase text contents from scene images and current state-of-the-art text erasing models are trained on large-scale synthetic data.
no code implementations • 24 Jan 2022 • Xingjiao Wu, Luwei Xiao, Xiangcheng Du, Yingbin Zheng, Xin Li, Tianlong Ma, Liang He
Our framework is an unsupervised document layout analysis framework.
no code implementations • Information Sciences 2021 • Xingjiao Wu, Yingbin Zheng, Tianlong Ma, Hao Ye, Liang He
Layout analysis from a document image plays an important role in document content understanding and information extraction systems.
no code implementations • 25 Nov 2019 • Zhichao Fu, Yu Kong, Yingbin Zheng, Hao Ye, Wenxin Hu, Jing Yang, Liang He
The accuracy of OCR is usually affected by the quality of the input document image and different kinds of marred document images hamper the OCR results.
no code implementations • 4 Nov 2019 • Xiangcheng Du, Tianlong Ma, Yingbin Zheng, Hao Ye, Xingjiao Wu, Liang He
In this paper, we study text recognition framework by considering the long-term temporal dependencies in the encoder stage.
no code implementations • 4 Jul 2019 • Xingjiao Wu, Baohan Xu, Yingbin Zheng, Hao Ye, Jing Yang, Liang He
Crowd counting aims to count the number of instantaneous people in a crowded space, and many promising solutions have been proposed for single image crowd counting.
no code implementations • 4 Jul 2019 • Zhichao Fu, Tianlong Ma, Yingbin Zheng, Hao Ye, Jing Yang, Liang He
In this paper, we resort to human visual demands of sharp edges and propose a two-phase edge-aware deep network to improve deep image deblurring.
no code implementations • 23 Mar 2019 • Zhao Zhou, Hao Ye, Luhui Chen, Yingbin Zheng
Curve text or arbitrary shape text is very common in real-world scenarios.
1 code implementation • 6 Dec 2018 • Xingjiao Wu, Yingbin Zheng, Hao Ye, Wenxin Hu, Jing Yang, Liang He
Crowd counting, i. e., estimation number of the pedestrian in crowd images, is emerging as an important research problem with the public security applications.
no code implementations • 26 Jun 2018 • Li Wang, Weiyuan Shao, Yao Lu, Hao Ye, Jian Pu, Yingbin Zheng
Crowd counting is one of the core tasks in various surveillance applications.
no code implementations • 13 Apr 2018 • Haonan Qiu, Yingbin Zheng, Hao Ye, Yao Lu, Feng Wang, Liang He
The performances of existing action localization approaches remain unsatisfactory in precisely determining the beginning and the end of an action.
4 code implementations • 3 Mar 2017 • Jianqi Ma, Weiyuan Shao, Hao Ye, Li Wang, Hong Wang, Yingbin Zheng, xiangyang xue
This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images.
no code implementations • 1 Feb 2017 • Li Wang, Yao Lu, Hong Wang, Yingbin Zheng, Hao Ye, xiangyang xue
We perform fast vehicle detection from traffic surveillance cameras.