TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Detection	ICDAR 2015	GNNets	F-Measure	88.52	# 11
Scene Text Detection	ICDAR 2015	GNNets	Precision	90.41	# 14
Scene Text Detection	ICDAR 2015	GNNets	Recall	86.71	# 12
Scene Text Detection	ICDAR 2017 MLT	GNNets	Precision	79.63	# 10
Scene Text Detection	ICDAR 2017 MLT	GNNets	Recall	70.06	# 6
Scene Text Detection	ICDAR 2017 MLT	GNNets	F-Measure	74.54%	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/geometry-normalization-networks-for-accurate/scene-text-detection-on-icdar-2017-mlt-1)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2017-mlt-1?p=geometry-normalization-networks-for-accurate)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/geometry-normalization-networks-for-accurate/scene-text-detection-on-icdar-2015)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2015?p=geometry-normalization-networks-for-accurate)`

Geometry Normalization Networks for Accurate Scene Text Detection

ICCV 2019 · Youjiang Xu, Jiaqi Duan, Zhanghui Kuang, Xiaoyu Yue, Hongbin Sun, Yue Guan, Wayne Zhang ·

Large geometry (e.g., orientation) variances are the key challenges in the scene text detection. In this work, we first conduct experiments to investigate the capacity of networks for learning geometry variances on detecting scene texts, and find that networks can handle only limited text geometry variances. Then, we put forward a novel Geometry Normalization Module (GNM) with multiple branches, each of which is composed of one Scale Normalization Unit and one Orientation Normalization Unit, to normalize each text instance to one desired canonical geometry range through at least one branch. The GNM is general and readily plugged into existing convolutional neural network based text detectors to construct end-to-end Geometry Normalization Networks (GNNets). Moreover, we propose a geometry-aware training scheme to effectively train the GNNets by sampling and augmenting text instances from a uniform geometry variance distribution. Finally, experiments on popular benchmarks of ICDAR 2015 and ICDAR 2017 MLT validate that our method outperforms all the state-of-the-art approaches remarkably by obtaining one-forward test F-scores of 88.52 and 74.54 respectively.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

bigvideoresearch/GNNets official

Tasks

Add Remove

Scene Text Detection

Text Detection

Datasets

ICDAR 2013 ICDAR 2015

ICDAR 2017

Results from the Paper

Edit

Ranked #10 on Scene Text Detection on ICDAR 2017 MLT

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Detection	ICDAR 2015	GNNets	F-Measure	88.52	# 11	Compare
			Precision	90.41	# 14	Compare
			Recall	86.71	# 12	Compare
Scene Text Detection	ICDAR 2017 MLT	GNNets	Precision	79.63	# 10	Compare
			Recall	70.06	# 6	Compare
			F-Measure	74.54%	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Geometry Normalization Networks for Accurate Scene Text Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove