Scene Text Recognition

121 papers with code • 15 benchmarks • 27 datasets

See Scene Text Detection for leaderboards in this task.

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Text Recognition

Dataset	Best Model	Compare
ICDAR2013	CLIP4STR-L*	See all
SVT	DTrOCR	See all
ICDAR2015	DTrOCR	See all
CUTE80	CPPD	See all
SVTP	DTrOCR	See all
IIIT5k	DTrOCR	See all
ICDAR 2003	DTrOCR	See all
COCO-Text	CLIP4STR-L	See all
IC19-Art	CLIP4STR-L	See all
HOST	CLIP4STR-L	See all
WOST	CLIP4STR-L	See all
MSDA	MetaSelf-Learning	See all
Uber-Text	MGP-STR	See all
SVT-P	ABINet-LV+TPS++	See all
IC13	ABINet-LV+TPS++	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Scene Text Recognition models and implementations

PaddlePaddle/PaddleOCR

14 papers

38,618

mindspore-lab/mindocr

7 papers

160

Media-Smart/vedastr

6 papers

531

alibabaresearch/advancedliteratemac…

5 papers

950

See all 8 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Bartzi/see • • 14 Dec 2017

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task.

Paper
Code

Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond

yueruchen/Pedestrian-Synthesis-GAN • • 5 Apr 2018

The results show that our framework can smoothly synthesize pedestrians on background images of variations and different levels of details.

Paper
Code

On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

Media-Smart/vedastr • • 10 Oct 2019

Scene text recognition (STR) is the task of recognizing character sequences in natural scenes.

Paper
Code

A Feasible Framework for Arbitrary-Shaped Scene Text Recognition

zhang0jhon/AttentionOCR • • 10 Dec 2019

Deep learning based methods have achieved surprising progress in Scene Text Recognition (STR), one of classic problems in computer vision.

Paper
Code

SCATTER: Selective Context Attentional Scene Text Recognizer

phantrdat/cvpr20-scatter-text-recognizer • • CVPR 2020

The first attention step re-weights visual features from a CNN backbone together with contextual features computed by a BiLSTM layer.

Paper
Code

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

PaddlePaddle/PaddleOCR • • CVPR 2020

Scene text image contains two levels of contents: visual texture and semantic information.

Paper
Code

SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition

Pay20Y/SEED • • CVPR 2020

Scene text recognition is a hot research topic in computer vision.

Paper
Code

Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset

hgamal11/evarest-dataset • IEEE Access 2021

Therefore, we use our new dataset to evaluate the problem of Arabic scene text recognition from three perspectives: (1) using deep learning techniques and studying their suitability for Arabic scene text recognition, where we identify essential components required for the model to obtain good performance; (2) identifying Arabic text challenges that differ from Latin text and require special attention; (3) investigating a bilingual model that concurrently deals with Arabic and English words, since Arabic text is usually found along with other languages.

Paper
Code

Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition

Jackandrome/DPAN • ICMR 2021

To tackle this issue, in this paper, we propose a dual parallel attention network (DPAN), in which a newly designed parallel context attention module (PCAM) is cascaded with the original PPAM, using linguistic contextual information to compensate for the information inconsistency between queries and keys.

Paper
Code

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

microsoft/unilm • • 21 Sep 2021

Text recognition is a long-standing research problem for document digitalization.

Paper
Code

Scene Text Recognition

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result