Scene Text Detection

91 papers with code • 9 benchmarks • 15 datasets

Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.

Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Text Detection

Dataset	Best Model	Compare
ICDAR 2015	TextFuseNet (ResNeXt-101)	See all
Total-Text	MixNet	See all
MSRA-TD500	MixNet	See all
SCUT-CTW1500	MixNet	See all
ICDAR 2013	TextFuseNet (ResNeXt-101)	See all
ICDAR 2017 MLT	PMTD*	See all
COCO-Text	Corner-based Region Proposals	See all
IC19-Art	MixNet	See all
IC19-ReCTs	BDN	See all

Libraries

Use these libraries to find Scene Text Detection models and implementations

PaddlePaddle/PaddleOCR

9 papers

38,632

mindspore-lab/mindocr

7 papers

160

open-mmlab/mmocr

6 papers

4,086

JaidedAI/EasyOCR

3 papers

22,022

See all 8 libraries.

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models

no code yet • 28 Nov 2023

We contend that one main limitation of existing generation methods is the insufficient integration of foreground text with the background.

Paper
Add Code

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

no code yet • 14 Aug 2023

Different from existing methods which integrate multiple-granularity features or multiple outputs, we resort to the perspective of representation learning in which auxiliary tasks are utilized to enable the encoder to jointly learn robust features with the main task of per-pixel classification during optimization.

Paper
Add Code

Separate Scene Text Detector for Unseen Scripts is Not All You Need

no code yet • 29 Jul 2023

It raises a critical question: Is there a need for separate training for new scripts?

Paper
Add Code

Adaptive Segmentation Network for Scene Text Detection

no code yet • 27 Jul 2023

Besides, we design a Global-information Enhanced Feature Pyramid Network (GE-FPN) for capturing text instances with macro size and extreme aspect ratios.

Paper
Add Code

CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer

no code yet • 25 Jul 2023

Contour based scene text detection methods have rapidly developed recently, but still suffer from inaccurate frontend contour initialization, multi-stage error accumulation, or deficient local information aggregation.

Paper
Add Code

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

no code yet • 6 Jun 2023

End-to-end text spotting is a vital computer vision task that aims to integrate scene text detection and recognition into a unified framework.

Paper
Add Code

Deformable Kernel Expansion Model for Efficient Arbitrary-shaped Scene Text Detection

no code yet • 28 Mar 2023

DKE employs a segmentation module to segment the shrunken text region as the text kernel, then expands the text kernel contour to obtain text boundary by regressing the vertex-wise offsets.

Paper
Add Code

Domain Adaptive Scene Text Detection via Subcategorization

no code yet • 1 Dec 2022

We study domain adaptive scene text detection, a largely neglected yet very meaningful task that aims for optimal transfer of labelled scene text images while handling unlabelled images in various new domains.

Paper
Add Code

Aggregated Text Transformer for Scene Text Detection

no code yet • 25 Nov 2022

We present the Aggregated Text TRansformer(ATTR), which is designed to represent texts in scene images with a multi-scale self-attention mechanism.

Paper
Add Code

Text Growing on Leaf

no code yet • 7 Sep 2022

Then, lateral and thin veins are generated along the main vein growth direction in polar coordinates.

Paper
Add Code

Scene Text Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result