TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text Spotting	ICDAR 2015	PGNet	F-measure (%) - Strong Lexicon	83.3	# 11
Text Spotting	ICDAR 2015	PGNet	F-measure (%) - Weak Lexicon	78.3	# 11
Text Spotting	ICDAR 2015	PGNet	F-measure (%) - Generic Lexicon	63.5	# 17
Scene Text Detection	ICDAR 2015	PGNet-A	Accuracy	62.3	# 1
Scene Text Detection	ICDAR 2015	MCLAB_FCN	F-Measure	53.6	# 41
Scene Text Detection	ICDAR 2015	MCLAB_FCN	Precision	70.8	# 41
Scene Text Detection	ICDAR 2015	MCLAB_FCN	Recall	43.0	# 41

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pgnet-real-time-arbitrarily-shaped-text/scene-text-detection-on-icdar-2015)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2015?p=pgnet-real-time-arbitrarily-shaped-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pgnet-real-time-arbitrarily-shaped-text/text-spotting-on-icdar-2015)](https://paperswithcode.com/sota/text-spotting-on-icdar-2015?p=pgnet-real-time-arbitrarily-shaped-text)`

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

12 Apr 2021 · Pengfei Wang, Chengquan Zhang, Fei Qi, Shanshan Liu, Xiaoqiang Zhang, Pengyuan Lyu, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi ·

The reading of arbitrarily-shaped text has received increasing research attention. However, existing text spotters are mostly built on two-stage frameworks or character-based methods, which suffer from either Non-Maximum Suppression (NMS), Region-of-Interest (RoI) operations, or character-level annotations. In this paper, to address the above problems, we propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time. The PGNet is a single-shot text spotter, where the pixel-level character classification map is learned with proposed PG-CTC loss avoiding the usage of character-level annotations. With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations involved, which guarantees high efficiency. Additionally, reasoning the relations between each character and its neighbors, a graph refinement module (GRM) is proposed to optimize the coarse recognition and improve the end-to-end performance. Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed. In particular, in Total-Text, it runs at 46.7 FPS, surpassing the previous spotters with a large margin.

PDF Abstract

Code

Add Remove Mark official

PaddlePaddle/PaddleOCR official

38,330

2024-MindSpore-1/Code3

Tasks

Add Remove

Optical Character Recognition (OCR)

Scene Text Detection

Text Spotting

Datasets

Total-Text ICDAR 2015

Results from the Paper

Edit

Ranked #1 on Scene Text Detection on ICDAR 2015 (Accuracy metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text Spotting	ICDAR 2015	PGNet	F-measure (%) - Strong Lexicon	83.3	# 11	Compare
			F-measure (%) - Weak Lexicon	78.3	# 11	Compare
			F-measure (%) - Generic Lexicon	63.5	# 17	Compare
Scene Text Detection	ICDAR 2015	PGNet-A	Accuracy	62.3	# 1	Compare
Scene Text Detection	ICDAR 2015	MCLAB_FCN	F-Measure	53.6	# 41	Compare
			Precision	70.8	# 41	Compare
			Recall	43.0	# 41	Compare

Methods

Add Remove

PGNet

Edit Social Preview

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove