Scene Text Detection

91 papers with code • 9 benchmarks • 15 datasets

Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.

Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Text Detection

Dataset	Best Model	Compare
ICDAR 2015	TextFuseNet (ResNeXt-101)	See all
Total-Text	MixNet	See all
MSRA-TD500	MixNet	See all
SCUT-CTW1500	MixNet	See all
ICDAR 2013	TextFuseNet (ResNeXt-101)	See all
ICDAR 2017 MLT	PMTD*	See all
COCO-Text	Corner-based Region Proposals	See all
IC19-Art	MixNet	See all
IC19-ReCTs	BDN	See all

Libraries

Use these libraries to find Scene Text Detection models and implementations

PaddlePaddle/PaddleOCR

9 papers

38,458

mindspore-lab/mindocr

7 papers

157

open-mmlab/mmocr

6 papers

4,068

vitae-transformer/vitae-transformer…

4 papers

See all 8 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Recurrent Generic Contour-based Instance Segmentation with Progressive Learning

fh2019ustc/polysnake • • 21 Jan 2023

It maintains a single estimate of the contour that is progressively deformed toward the object boundary.

21 Jan 2023

Paper
Code

CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection

xiizhao/cbn.pytorch • • 5 Dec 2022

Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion.

05 Dec 2022

Paper
Code

DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting

vitae-transformer/deepsolo • • CVPR 2023

In this paper, we present DeepSolo, a simple DETR-like baseline that lets a single Decoder with Explicit Points Solo for text detection and recognition simultaneously.

225

19 Nov 2022

Paper
Code

Arbitrary Shape Text Detection via Segmentation with Probability Maps

gxym/textpms • • 26 Aug 2022

To be concrete, we adopt a Sigmoid Alpha Function (SAF) to transfer the distances between boundaries and their inside pixels to a probability map.

26 Aug 2022

Paper
Code

DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer

vitae-transformer/deepsolo • • 10 Jul 2022

However, these methods built upon detection transformer framework might achieve sub-optimal training efficiency and performance due to coarse positional query modeling. In addition, the point label form exploited in previous works implies the reading order of humans, which impedes the detection robustness from our observation.

225

10 Jul 2022

Paper
Code

TraffSign: Multilingual Traffic Signboard Text Detection and Recognition for Urdu and English

aatiibutt/TraffSign • • Document Analysis Systems 2022

To this end, we present Deep Learning Laboratory’s Traffic Signboards Dataset (DLL-TraffSiD) to develop multi-lingual text detection and recognition methods for traffic signboards.

18 May 2022

Paper
Code

Vision-Language Pre-Training for Boosting Scene Text Detectors

alibabaresearch/advancedliteratemachinery • • CVPR 2022

In this paper, we specifically adapt vision-language joint learning for scene text detection, a task that intrinsically involves cross-modal interaction between the two modalities: vision and language, since text is the written form of language.

926

29 Apr 2022

Paper
Code

Towards End-to-End Unified Scene Text Detection and Layout Analysis

tensorflow/models • • CVPR 2022

In this paper, we bring them together and introduce the task of unified scene text detection and layout analysis.

76,589

28 Mar 2022

Paper
Code

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

mxin262/swintextspotter • • CVPR 2022

End-to-end scene text spotting has attracted great attention in recent years due to the success of excavating the intrinsic synergy of the scene text detection and recognition.

256

19 Mar 2022

Paper
Code

Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion

PaddlePaddle/PaddleOCR • • 21 Feb 2022

By incorporating the proposed DB and ASF with the segmentation network, our proposed scene text detector consistently achieves state-of-the-art results, in terms of both detection accuracy and speed, on five standard benchmarks.

38,458

21 Feb 2022

Paper
Code

Scene Text Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result