Conditional Text Image Generation with Diffusion Models

no code implementations CVPR 2023 Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao

Current text recognition systems, including those for handwritten scripts and scene text, have relied heavily on image synthesis and augmentation, since it is difficult to realize real-world complexity and diversity through collecting and annotating enough real text images.

Domain Adaptation Image Generation

Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter

1 code implementation CVPR 2021 Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li, Mengchao He, Yongpan Wang, Canjie Luo

Specifically, we integrate IFA into the two most prevailing text recognition streams (attention-based and CTC-based) and propose attention-guided dense prediction (ADP) and Extended CTC (ExCTC).

Optical Character Recognition Optical Character Recognition (OCR) +1

Text Recognition in the Wild: A Survey

1 code implementation7 May 2020 Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang

This paper aims to (1) summarize the fundamental problems and the state-of-the-art associated with scene text recognition; (2) introduce new insights and ideas; (3) provide a comprehensive review of publicly available resources; (4) point out directions for future work.

Scene Text Recognition

Decoupled Attention Network for Text Recognition

4 code implementations21 Dec 2019 Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo, Xiaoxue Chen, Yaqiang Wu, Qianying Wang, Mingxiang Cai

To remedy this issue, we propose a decoupled attention network (DAN), which decouples the alignment operation from using historical decoding results.

Handwritten Text Recognition Scene Text Recognition

Adaptive Embedding Gate for Attention-Based Scene Text Recognition

no code implementations26 Aug 2019 Xiaoxue Chen, Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Canjie Luo

Scene text recognition has attracted particular research interest because it is a very challenging problem and has various applications.

Scene Text Recognition

