no code implementations • 27 Jun 2024 • Jiaxin Zhang, Wentao Yang, Songxuan Lai, Zecheng Xie, Lianwen Jin
Current multimodal large language models (MLLMs) face significant challenges in visual document understanding (VDU) tasks due to the high resolution, dense text, and complex layouts typical of document images.
no code implementations • 15 May 2023 • Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin
Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion.
no code implementations • CVPR 2023 • Yongshuai Huang, Ning Lu, Dapeng Chen, Yibo Li, Zecheng Xie, Shenggao Zhu, Liangcai Gao, Wei Peng
The ablation study also validates that the proposed coordinate sequence decoder and the visual-alignment loss are the keys to the success of our method.
1 code implementation • CVPR 2023 • Hiuyi Cheng, Peirong Zhang, Sihang Wu, Jiaxin Zhang, Qiyuan Zhu, Zecheng Xie, Jing Li, Kai Ding, Lianwen Jin
Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion.
2 code implementations • CVPR 2019 • Zecheng Xie, Yaoxiong Huang, Yuanzhi Zhu, Lianwen Jin, Yuliang Liu, Lele Xie
In this paper, we propose a novel method, aggregation cross-entropy (ACE), for sequence recognition from a brand new perspective.
1 code implementation • CVPR 2019 • Yuliang Liu, Lianwen Jin, Zecheng Xie, Canjie Luo, Shuaitao Zhang, Lele Xie
Evaluation protocols play key role in the developmental progress of text detection methods.
1 code implementation • 16 Nov 2018 • Lele Xie, Yuliang Liu, Lianwen Jin, Zecheng Xie
However, the detection performance is sensitive to the setting of the anchor boxes.
no code implementations • 9 Oct 2016 • Zecheng Xie, Zenghui Sun, Lianwen Jin, Hao Ni, Terry Lyons
Online handwritten Chinese text recognition (OHCTR) is a challenging problem as it involves a large-scale character set, ambiguous segmentation, and variable-length input sequences.
no code implementations • 18 Apr 2016 • Zecheng Xie, Zenghui Sun, Lianwen Jin, Ziyong Feng, Shuye Zhang
This paper proposes an end-to-end framework, namely fully convolutional recurrent network (FCRN) for handwritten Chinese text recognition (HCTR).
Handwriting Recognition Handwritten Chinese Text Recognition +1
no code implementations • 28 May 2015 • Weixin Yang, Lianwen Jin, Zecheng Xie, Ziyong Feng
Deep convolutional neural networks (DCNNs) have achieved great success in various computer vision and pattern recognition applications, including those for handwritten Chinese character recognition (HCCR).
no code implementations • 20 May 2015 • Weixin Yang, Lianwen Jin, DaCheng Tao, Zecheng Xie, Ziyong Feng
Inspired by the theory of Leitners learning box from the field of psychology, we propose DropSample, a new method for training deep convolutional neural networks (DCNNs), and apply it to large-scale online handwritten Chinese character recognition (HCCR).
1 code implementation • 19 May 2015 • Zhuoyao Zhong, Lianwen Jin, Zecheng Xie
We design a streamlined version of GoogLeNet [13], which was original proposed for image classification in recent years with very deep architecture, for HCCR (denoted as HCCR-GoogLeNet).
Image Classification Offline Handwritten Chinese Character Recognition