Scene Text Detection

91 papers with code • 9 benchmarks • 15 datasets

Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.

Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection

Libraries

Use these libraries to find Scene Text Detection models and implementations

Recurrent Generic Contour-based Instance Segmentation with Progressive Learning

fh2019ustc/polysnake 21 Jan 2023

It maintains a single estimate of the contour that is progressively deformed toward the object boundary.

57
21 Jan 2023

CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection

xiizhao/cbn.pytorch 5 Dec 2022

Recently, segmentation-based methods are quite popular in scene text detection, which mainly contain two steps: text kernel segmentation and expansion.

10
05 Dec 2022

DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting

vitae-transformer/deepsolo CVPR 2023

In this paper, we present DeepSolo, a simple DETR-like baseline that lets a single Decoder with Explicit Points Solo for text detection and recognition simultaneously.

225
19 Nov 2022

Arbitrary Shape Text Detection via Segmentation with Probability Maps

gxym/textpms 26 Aug 2022

To be concrete, we adopt a Sigmoid Alpha Function (SAF) to transfer the distances between boundaries and their inside pixels to a probability map.

94
26 Aug 2022

DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer

vitae-transformer/deepsolo 10 Jul 2022

However, these methods built upon detection transformer framework might achieve sub-optimal training efficiency and performance due to coarse positional query modeling. In addition, the point label form exploited in previous works implies the reading order of humans, which impedes the detection robustness from our observation.

225
10 Jul 2022

TraffSign: Multilingual Traffic Signboard Text Detection and Recognition for Urdu and English

aatiibutt/TraffSign Document Analysis Systems 2022

To this end, we present Deep Learning Laboratory’s Traffic Signboards Dataset (DLL-TraffSiD) to develop multi-lingual text detection and recognition methods for traffic signboards.

1
18 May 2022

Vision-Language Pre-Training for Boosting Scene Text Detectors

alibabaresearch/advancedliteratemachinery CVPR 2022

In this paper, we specifically adapt vision-language joint learning for scene text detection, a task that intrinsically involves cross-modal interaction between the two modalities: vision and language, since text is the written form of language.

926
29 Apr 2022

Towards End-to-End Unified Scene Text Detection and Layout Analysis

tensorflow/models CVPR 2022

In this paper, we bring them together and introduce the task of unified scene text detection and layout analysis.

76,589
28 Mar 2022

SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

mxin262/swintextspotter CVPR 2022

End-to-end scene text spotting has attracted great attention in recent years due to the success of excavating the intrinsic synergy of the scene text detection and recognition.

256
19 Mar 2022

Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion

PaddlePaddle/PaddleOCR 21 Feb 2022

By incorporating the proposed DB and ASF with the segmentation network, our proposed scene text detector consistently achieves state-of-the-art results, in terms of both detection accuracy and speed, on five standard benchmarks.

38,458
21 Feb 2022