TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Recognition	ICDAR2013	SAR	Accuracy	91.0	# 33
Scene Text Recognition	ICDAR2015	SAR	Accuracy	69.2	# 26
Scene Text Recognition	SVT	SAR	Accuracy	84.5	# 31

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/show-attend-and-read-a-simple-and-strong/scene-text-recognition-on-icdar2015)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2015?p=show-attend-and-read-a-simple-and-strong)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/show-attend-and-read-a-simple-and-strong/scene-text-recognition-on-svt)](https://paperswithcode.com/sota/scene-text-recognition-on-svt?p=show-attend-and-read-a-simple-and-strong)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/show-attend-and-read-a-simple-and-strong/scene-text-recognition-on-icdar2013)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2013?p=show-attend-and-read-a-simple-and-strong)`

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

2 Nov 2018 · Hui Li, Peng Wang, Chunhua Shen, Guyu Zhang ·

Recognizing irregular text in natural scene images is challenging due to the large variance in text appearance, such as curvature, orientation and distortion. Most existing approaches rely heavily on sophisticated model designs and/or extra fine-grained annotations, which, to some extent, increase the difficulty in algorithm implementation and data collection. In this work, we propose an easy-to-implement strong baseline for irregular scene text recognition, using off-the-shelf neural network components and only word-level annotations. It is composed of a $31$-layer ResNet, an LSTM-based encoder-decoder framework and a 2-dimensional attention module. Despite its simplicity, the proposed method is robust and achieves state-of-the-art performance on both regular and irregular scene text recognition benchmarks. Code is available at: https://tinyurl.com/ShowAttendRead

PDF Abstract

Code

Add Remove Mark official

PaddlePaddle/PaddleOCR

38,490

open-mmlab/mmocr

4,075

mindee/doctr

↳ Quickstart in

Colab

Spaces

3,033

Pay20Y/SAR_TF

liuch37/sar-pytorch

See all 7 implementations

Tasks

Add Remove

Irregular Text Recognition

Optical Character Recognition (OCR)

Scene Text Recognition

Datasets

ICDAR 2013

SVT

Results from the Paper

Edit

Ranked #26 on Scene Text Recognition on ICDAR2015

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Recognition	ICDAR2013	SAR	Accuracy	91.0	# 33	Compare
Scene Text Recognition	ICDAR2015	SAR	Accuracy	69.2	# 26	Compare
Scene Text Recognition	SVT	SAR	Accuracy	84.5	# 31	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Convolution • Global Average Pooling • Kaiming Initialization • Max Pooling • ReLU • Residual Block • Residual Connection • ResNet

Edit Social Preview

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove