TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Recognition	ICDAR 2003	STAR-Net	Accuracy	89.9	# 11
Scene Text Recognition	ICDAR2013	STAR-Net	Accuracy	89.1	# 35
Scene Text Recognition	SVT	STAR-Net	Accuracy	83.6	# 34

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/star-net-a-spatial-attention-residue-network/scene-text-recognition-on-icdar-2003)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar-2003?p=star-net-a-spatial-attention-residue-network)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/star-net-a-spatial-attention-residue-network/scene-text-recognition-on-svt)](https://paperswithcode.com/sota/scene-text-recognition-on-svt?p=star-net-a-spatial-attention-residue-network)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/star-net-a-spatial-attention-residue-network/scene-text-recognition-on-icdar2013)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2013?p=star-net-a-spatial-attention-residue-network)`

Star-net: A spatial attention residue network for scene text recognition.

The British Machine Vision Conference,2016 2016 · W. Liu, C. Chen, K.-Y. K. Wong, Z. Su, and J. Han. ·

In this paper, we present a novel SpaTial Attention Residue Network (STAR-Net) for recognising scene texts. Our STAR-Net is equipped with a spatial attention mechanism which employs a spatial transformer to remove the distortions of texts in natural images. This allows the subsequent feature extractor to focus on the rectified text region without being sidetracked by the distortions. Our STAR-Net also exploits residue convolutional blocks to build a very deep feature extractor, which is essential to the successful extraction of discriminative text features for this fine grained recognition task. Combining the spatial attention mechanism with the residue convolutional blocks, our STAR-Net is the deepest end-to-end trainable neural network for scene text recognition. Experiments have been conducted on five public benchmark datasets. Experimental results show that our STAR-Net can achieve a performance comparable to state-of-the-art methods for scene texts with little distortions, and outperform these methods for scene texts with considerable distortions.

PDF

Code

Add Remove Mark official

PaddlePaddle/PaddleOCR

38,644

Tasks

Add Remove

Optical Character Recognition (OCR)

Scene Text Recognition

Datasets

ICDAR 2013

ICDAR 2003

SVT

Results from the Paper

Add Remove

Ranked #11 on Scene Text Recognition on ICDAR 2003

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Recognition	ICDAR 2003	STAR-Net	Accuracy	89.9	# 11	Compare
Scene Text Recognition	ICDAR2013	STAR-Net	Accuracy	89.1	# 35	Compare
Scene Text Recognition	SVT	STAR-Net	Accuracy	83.6	# 34	Compare

Methods

Add Remove

Spatial Transformer

Edit Social Preview

Star-net: A spatial attention residue network for scene text recognition.

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove