TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Scene Text Detection	IC19-Art	SRFormer (ResNet-50)	H-Mean	79.3	# 2
Scene Text Detection	SCUT-CTW1500	SRFormer (ResNet-50)	F-Measure	89.6	# 2
Scene Text Detection	SCUT-CTW1500	SRFormer (ResNet-50)	Precision	91.6	# 2
Scene Text Detection	SCUT-CTW1500	SRFormer (ResNet-50)	Recall	87.7	# 2
Scene Text Detection	Total-Text	SRFormer (ResNet-50)	F-Measure	90.0%	# 2
Scene Text Detection	Total-Text	SRFormer (ResNet-50)	Precision	92.2%	# 2
Scene Text Detection	Total-Text	SRFormer (ResNet-50)	Recall	87.9%	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/srformer-empowering-regression-based-text/scene-text-detection-on-ic19-art)](https://paperswithcode.com/sota/scene-text-detection-on-ic19-art?p=srformer-empowering-regression-based-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/srformer-empowering-regression-based-text/scene-text-detection-on-scut-ctw1500)](https://paperswithcode.com/sota/scene-text-detection-on-scut-ctw1500?p=srformer-empowering-regression-based-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/srformer-empowering-regression-based-text/scene-text-detection-on-total-text)](https://paperswithcode.com/sota/scene-text-detection-on-total-text?p=srformer-empowering-regression-based-text)`

SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

21 Aug 2023 · Qingwen Bu, Sungrae Park, Minsoo Khang, Yichuan Cheng ·

Existing techniques for text detection can be broadly classified into two primary groups: segmentation-based and regression-based methods. Segmentation models offer enhanced robustness to font variations but require intricate post-processing, leading to high computational overhead. Regression-based methods undertake instance-aware prediction but face limitations in robustness and data efficiency due to their reliance on high-level representations. In our academic pursuit, we propose SRFormer, a unified DETR-based model with amalgamated Segmentation and Regression, aiming at the synergistic harnessing of the inherent robustness in segmentation representations, along with the straightforward post-processing of instance-level regression. Our empirical analysis indicates that favorable segmentation predictions can be obtained at the initial decoder layers. In light of this, we constrain the incorporation of segmentation branches to the first few decoder layers and employ progressive regression refinement in subsequent layers, achieving performance gains while minimizing computational load from the mask.Furthermore, we propose a Mask-informed Query Enhancement module. We take the segmentation result as a natural soft-ROI to pool and extract robust pixel representations, which are then employed to enhance and diversify instance queries. Extensive experimentation across multiple benchmarks has yielded compelling findings, highlighting our method's exceptional robustness, superior training and data efficiency, as well as its state-of-the-art performance. Our code is available at https://github.com/retsuh-bqw/SRFormer-Text-Det.

PDF Abstract

Code

Add Remove Mark official

retsuh-bqw/SRFormer-Text-Det official

opendrivelab/elm

Tasks

Add Remove

regression

Scene Text Detection

Segmentation

Text Detection

Datasets

Total-Text

SCUT-CTW1500

Results from the Paper

Edit

Ranked #2 on Scene Text Detection on IC19-Art

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Scene Text Detection	IC19-Art	SRFormer (ResNet-50)	H-Mean	79.3	# 2	Compare
Scene Text Detection	SCUT-CTW1500	SRFormer (ResNet-50)	F-Measure	89.6	# 2	Compare
			Precision	91.6	# 2	Compare
			Recall	87.7	# 2	Compare
Scene Text Detection	Total-Text	SRFormer (ResNet-50)	F-Measure	90.0%	# 2	Compare
			Precision	92.2%	# 2	Compare
			Recall	87.9%	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove