TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Recognition	ICDAR 2003	Yet Another Text Recognizer	Accuracy	97.1	# 1
Scene Text Recognition	ICDAR2013	Yet Another Text Recognizer	Accuracy	96.8	# 18
Scene Text Recognition	ICDAR2015	Yet Another Text Recognizer	Accuracy	80.2	# 16
Scene Text Recognition	SVT	Yet Another Text Recognizer	Accuracy	94.7	# 14

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/why-you-should-try-the-real-data-for-the/scene-text-recognition-on-icdar-2003)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar-2003?p=why-you-should-try-the-real-data-for-the)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/why-you-should-try-the-real-data-for-the/scene-text-recognition-on-svt)](https://paperswithcode.com/sota/scene-text-recognition-on-svt?p=why-you-should-try-the-real-data-for-the)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/why-you-should-try-the-real-data-for-the/scene-text-recognition-on-icdar2015)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2015?p=why-you-should-try-the-real-data-for-the)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/why-you-should-try-the-real-data-for-the/scene-text-recognition-on-icdar2013)](https://paperswithcode.com/sota/scene-text-recognition-on-icdar2013?p=why-you-should-try-the-real-data-for-the)`

Why You Should Try the Real Data for the Scene Text Recognition

29 Jul 2021 · Vladimir Loginov ·

Recent works in the text recognition area have pushed forward the recognition results to the new horizons. But for a long time a lack of large human-labeled natural text recognition datasets has been forcing researchers to use synthetic data for training text recognition models. Even though synthetic datasets are very large (MJSynth and SynthTest, two most famous synthetic datasets, have several million images each), their diversity could be insufficient, compared to natural datasets like ICDAR and others. Fortunately, the recently released text-recognition annotation for OpenImages V5 dataset has comparable with synthetic dataset number of instances and more diverse examples. We have used this annotation with a Text Recognition head architecture from the Yet Another Mask Text Spotter and got comparable to the SOTA results. On some datasets we have even outperformed previous SOTA models. In this paper we also introduce a text recognition model. The model's code is available.

PDF Abstract

Code

Add Remove Mark official

openvinotoolkit/training_extensions official

1,119

Tasks

Add Remove

Scene Text Recognition

Datasets

ICDAR 2013

ICDAR 2003 ICDAR 2015

SVT

TextOCR

Results from the Paper

Edit

Ranked #1 on Scene Text Recognition on ICDAR 2003

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Recognition	ICDAR 2003	Yet Another Text Recognizer	Accuracy	97.1	# 1	Compare
Scene Text Recognition	ICDAR2013	Yet Another Text Recognizer	Accuracy	96.8	# 18	Compare
Scene Text Recognition	ICDAR2015	Yet Another Text Recognizer	Accuracy	80.2	# 16	Compare
Scene Text Recognition	SVT	Yet Another Text Recognizer	Accuracy	94.7	# 14	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Why You Should Try the Real Data for the Scene Text Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove