TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Detection	IC19-Art	TextFuseNet (ResNeXt-101)	H-Mean	78.6	# 3
Scene Text Detection	ICDAR 2013	TextFuseNet (ResNeXt-101)	F-Measure	94.61%	# 1
Scene Text Detection	ICDAR 2013	TextFuseNet (ResNeXt-101)	Precision	97.27	# 2
Scene Text Detection	ICDAR 2013	TextFuseNet (ResNeXt-101)	Recall	92.09	# 2
Scene Text Detection	ICDAR 2015	TextFuseNet (ResNeXt-101)	F-Measure	92.23	# 1
Scene Text Detection	ICDAR 2015	TextFuseNet (ResNeXt-101)	Precision	93.96	# 1
Scene Text Detection	ICDAR 2015	TextFuseNet (ResNeXt-101)	Recall	90.56	# 2
Scene Text Detection	SCUT-CTW1500	TextFuseNet (ResNeXt-101)	F-Measure	87.4	# 4
Scene Text Detection	SCUT-CTW1500	TextFuseNet (ResNeXt-101)	Precision	89.7	# 4
Scene Text Detection	SCUT-CTW1500	TextFuseNet (ResNeXt-101)	Recall	85.1	# 5
Scene Text Detection	Total-Text	TextFuseNet (ResNeXt-101)	F-Measure	87.5%	# 4
Scene Text Detection	Total-Text	TextFuseNet (ResNeXt-101)	Precision	89.2	# 10
Scene Text Detection	Total-Text	TextFuseNet (ResNeXt-101)	Recall	85.8	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/textfusenet-scene-text-detection-with-richer/scene-text-detection-on-icdar-2013)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2013?p=textfusenet-scene-text-detection-with-richer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/textfusenet-scene-text-detection-with-richer/scene-text-detection-on-icdar-2015)](https://paperswithcode.com/sota/scene-text-detection-on-icdar-2015?p=textfusenet-scene-text-detection-with-richer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/textfusenet-scene-text-detection-with-richer/scene-text-detection-on-ic19-art)](https://paperswithcode.com/sota/scene-text-detection-on-ic19-art?p=textfusenet-scene-text-detection-with-richer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/textfusenet-scene-text-detection-with-richer/scene-text-detection-on-scut-ctw1500)](https://paperswithcode.com/sota/scene-text-detection-on-scut-ctw1500?p=textfusenet-scene-text-detection-with-richer)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/textfusenet-scene-text-detection-with-richer/scene-text-detection-on-total-text)](https://paperswithcode.com/sota/scene-text-detection-on-total-text?p=textfusenet-scene-text-detection-with-richer)`

TextFuseNet: Scene Text Detection with Richer Fused Features

17 May 2020 · Jian Ye, Zhe Chen, Juhua Liu, Bo Du ·

Arbitrary shape text detection in natural scenes is an extremely challenging task. Unlike existing text detection approaches that only perceive texts based on limited feature representations, we propose a novel framework, namely TextFuseNet, to exploit the use of richer features fused for text detection. More specifically, we propose to perceive texts from three levels of feature representations, i.e., character-, word- and global-level, and then introduce a novel text representation fusion technique to help achieve robust arbitrary text detection. The multi-level feature representation can adequately describe texts by dissecting them into individual characters while still maintaining their general semantics. TextFuseNet then collects and merges the texts’ features from different levels using a multi-path fusion architecture which can effectively align and fuse different representations. In practice, our proposed TextFuseNet can learn a more adequate description of arbitrary shapes texts, suppressing false positives and producing more accurate detection results. Our proposed framework can also be trained with weak supervision for those datasets that lack character-level annotations. Experiments on several datasets show that the proposed TextFuseNet achieves state-of-the-art performance. Specifically, we achieve an F-measure of 94.3% on ICDAR2013, 92.1% on ICDAR2015, 87.1% on Total-Text and 86.6% on CTW-1500, respectively.

PDF Abstract

Code

Add Remove Mark official

ying09/TextFuseNet

463

mindspore-ai/models

334

2023-MindSpore-1/ms-code-217

kingcong/textfusenet

MindSpore-paper-code-2/code3

See all 6 implementations

Tasks

Add Remove

Scene Text Detection

Text Detection

Datasets

ICDAR 2013

Total-Text ICDAR 2015

SCUT-CTW1500

Results from the Paper

Add Remove

Ranked #1 on Scene Text Detection on ICDAR 2015

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Detection	IC19-Art	TextFuseNet (ResNeXt-101)	H-Mean	78.6	# 3	Compare
Scene Text Detection	ICDAR 2013	TextFuseNet (ResNeXt-101)	F-Measure	94.61%	# 1	Compare
			Precision	97.27	# 2	Compare
			Recall	92.09	# 2	Compare
Scene Text Detection	ICDAR 2015	TextFuseNet (ResNeXt-101)	F-Measure	92.23	# 1	Compare
			Precision	93.96	# 1	Compare
			Recall	90.56	# 2	Compare
Scene Text Detection	SCUT-CTW1500	TextFuseNet (ResNeXt-101)	F-Measure	87.4	# 4	Compare
			Precision	89.7	# 4	Compare
			Recall	85.1	# 5	Compare
Scene Text Detection	Total-Text	TextFuseNet (ResNeXt-101)	F-Measure	87.5%	# 4	Compare
			Precision	89.2	# 10	Compare
			Recall	85.8	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

TextFuseNet: Scene Text Detection with Richer Fused Features

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove