PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition

2 code implementations29 Jul 2022 Dezhi Peng, Lianwen Jin, Yuliang Liu, Canjie Luo, Songxuan Lai

Utilizing the proposed weakly supervised learning framework, PageNet requires only transcripts to be annotated for real data; however, it can still output detection and recognition results at both the character and line levels, avoiding the labor and cost of labeling bounding boxes of characters and text lines.

Handwritten Chinese Text Recognition Line Detection +1

Knowing Where and What: Unified Word Block Pretraining for Document Understanding

1 code implementation28 Jul 2022 Song Tao, Zijian Wang, Tiantian Fan, Canjie Luo, Can Huang

In this paper, we focus on the embedding learning of word blocks containing text and layout information, and propose UTel, a language model with Unified TExt and Layout pre-training.

Contrastive Learning document understanding +2

Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild

1 code implementation23 Jul 2022 Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding

To address this issue, we propose a novel approach called Marior (Margin Removal and \Iterative Content Rectification).

Optical Character Recognition (OCR)

SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text

no code implementations23 Feb 2022 Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Zhe Li, Dezhi Peng

Specifically, we propose a style bank to parameterize the specific handwriting styles as latent vectors, which are input to a generator as style priors to achieve the corresponding handwritten styles.

Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter

1 code implementation CVPR 2021 Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo

Specifically, we integrate IFA into the two most prevailing text recognition streams (attention-based and CTC-based) and propose attention-guided dense prediction (ADP) and Extended CTC (ExCTC).

Optical Character Recognition Optical Character Recognition (OCR) +1

Text Recognition in the Wild: A Survey

1 code implementation7 May 2020 Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang

This paper aims to (1) summarize the fundamental problems and the state-of-the-art associated with scene text recognition; (2) introduce new insights and ideas; (3) provide a comprehensive review of publicly available resources; (4) point out directions for future work.

Scene Text Recognition

Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

3 code implementations CVPR 2020 Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

An agent network learns from the output of the recognition network and controls the fiducial points to generate more proper training samples for the recognition network.

Image Augmentation

Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild

no code implementations13 Jan 2020 Canjie Luo, Qingxiang Lin, Yuliang Liu, Lianwen Jin, Chunhua Shen

Furthermore, to tackle the issue of lacking paired training samples, we design an interactive joint training scheme, which shares attention masks from the recognizer to the discriminator, and enables the discriminator to extract the features of each character for further adversarial training.

Style Transfer

Decoupled Attention Network for Text Recognition

4 code implementations21 Dec 2019 Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo, Xiaoxue Chen, Yaqiang Wu, Qianying Wang, Mingxiang Cai

To remedy this issue, we propose a decoupled attention network (DAN), which decouples the alignment operation from using historical decoding results.

Handwritten Text Recognition Scene Text Recognition

Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection

1 code implementation20 Dec 2019 Yuliang Liu, Tong He, Hao Chen, Xinyu Wang, Canjie Luo, Shuaitao Zhang, Chunhua Shen, Lianwen Jin

More importantly, based on OBD, we provide a detailed analysis of the impact of a collection of refinements, which may inspire others to build state-of-the-art text detectors.

Scene Text Detection Text Detection

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)

1 code implementation16 Sep 2019 Chee-Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

This paper reports the ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) that consists of three major challenges: i) scene text detection, ii) scene text recognition, and iii) scene text spotting.

Scene Text Detection Scene Text Recognition +2

Adaptive Embedding Gate for Attention-Based Scene Text Recognition

no code implementations26 Aug 2019 Xiaoxue Chen, Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo

Scene text recognition has attracted particular research interest because it is a very challenging problem and has various applications.

Scene Text Recognition

Feature Enhancement Network: A Refined Scene Text Detector

no code implementations12 Nov 2017 Sheng Zhang, Yuliang Liu, Lianwen Jin, Canjie Luo

In this paper, we propose a refined scene text detector with a \textit{novel} Feature Enhancement Network (FEN) for Region Proposal and Text Detection Refinement.

object-detection Object Detection +2

