Scene Text Recognition

121 papers with code • 15 benchmarks • 27 datasets

See Scene Text Detection for leaderboards in this task.

Libraries

Use these libraries to find Scene Text Recognition models and implementations

Most implemented papers

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Bartzi/see 14 Dec 2017

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task.

Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond

yueruchen/Pedestrian-Synthesis-GAN 5 Apr 2018

The results show that our framework can smoothly synthesize pedestrians on background images of variations and different levels of details.

On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

Media-Smart/vedastr 10 Oct 2019

Scene text recognition (STR) is the task of recognizing character sequences in natural scenes.

A Feasible Framework for Arbitrary-Shaped Scene Text Recognition

zhang0jhon/AttentionOCR 10 Dec 2019

Deep learning based methods have achieved surprising progress in Scene Text Recognition (STR), one of classic problems in computer vision.

SCATTER: Selective Context Attentional Scene Text Recognizer

phantrdat/cvpr20-scatter-text-recognizer CVPR 2020

The first attention step re-weights visual features from a CNN backbone together with contextual features computed by a BiLSTM layer.

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

PaddlePaddle/PaddleOCR CVPR 2020

Scene text image contains two levels of contents: visual texture and semantic information.

Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset

hgamal11/evarest-dataset IEEE Access 2021

Therefore, we use our new dataset to evaluate the problem of Arabic scene text recognition from three perspectives: (1) using deep learning techniques and studying their suitability for Arabic scene text recognition, where we identify essential components required for the model to obtain good performance; (2) identifying Arabic text challenges that differ from Latin text and require special attention; (3) investigating a bilingual model that concurrently deals with Arabic and English words, since Arabic text is usually found along with other languages.

Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition

Jackandrome/DPAN ICMR 2021

To tackle this issue, in this paper, we propose a dual parallel attention network (DPAN), in which a newly designed parallel context attention module (PCAM) is cascaded with the original PPAM, using linguistic contextual information to compensate for the information inconsistency between queries and keys.

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

microsoft/unilm 21 Sep 2021

Text recognition is a long-standing research problem for document digitalization.