Scene Text Recognition

121 papers with code • 15 benchmarks • 27 datasets

See Scene Text Detection for leaderboards in this task.

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Text Recognition

Dataset	Best Model	Compare
ICDAR2013	CLIP4STR-L*	See all
SVT	DTrOCR	See all
ICDAR2015	DTrOCR	See all
CUTE80	CPPD	See all
SVTP	DTrOCR	See all
IIIT5k	DTrOCR	See all
ICDAR 2003	DTrOCR	See all
COCO-Text	CLIP4STR-L	See all
IC19-Art	CLIP4STR-L	See all
HOST	CLIP4STR-L	See all
WOST	CLIP4STR-L	See all
MSDA	MetaSelf-Learning	See all
Uber-Text	MGP-STR	See all
SVT-P	ABINet-LV+TPS++	See all
IC13	ABINet-LV+TPS++	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Scene Text Recognition models and implementations

PaddlePaddle/PaddleOCR

14 papers

38,291

mindspore-lab/mindocr

7 papers

155

Media-Smart/vedastr

6 papers

531

alibabaresearch/advancedliteratemac…

5 papers

887

See all 9 libraries.

Datasets

Latest papers with no code

Most implemented Social Latest No code

JSTR: Judgment Improves Scene Text Recognition

no code yet • 9 Apr 2024

In this paper, we present a method for enhancing the accuracy of scene text recognition tasks by judging whether the image and text match each other.

Paper
Add Code

Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss

no code yet • 12 Mar 2024

In this paper, we propose a novel open-vocabulary text recognition framework, Pseudo-OCR, to recognize OOV words.

Paper
Add Code

IndicSTR12: A Dataset for Indic Scene Text Recognition

no code yet • 12 Mar 2024

Several benchmark datasets and substantial work on deep learning models are available for Latin languages to meet this need.

Paper
Add Code

Efficiently Leveraging Linguistic Priors for Scene Text Spotting

no code yet • 27 Feb 2024

This paper proposes a method that leverages linguistic knowledge from a large text corpus to replace the traditional one-hot encoding used in auto-regressive scene text spotting and recognition models.

Paper
Add Code

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

no code yet • 24 Feb 2024

Scene text recognition (STR) is a challenging task that requires large-scale annotated data for training.

Paper
Add Code

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

no code yet • 12 Feb 2024

We introduce Lumos, the first end-to-end multimodal question-answering system with text understanding capabilities.

Paper
Add Code

Instruction-Guided Scene Text Recognition

no code yet • 31 Jan 2024

Multi-modal models have shown appealing performance in visual tasks recently, as instruction-guided training has evoked the ability to understand fine-grained visual content.

Paper
Add Code

CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition

no code yet • 18 Jan 2024

However, the guidance of visual cues is ignored in the process of semantic mining, which limits the performance of the algorithm in recognizing irregular scene text.

Paper
Add Code

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

no code yet • 19 Dec 2023

Nowadays, scene text recognition has attracted more and more attention due to its diverse applications.

Paper
Add Code

STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers

no code yet • 28 Nov 2023

Robustness certification, which aims to formally certify the predictions of neural networks against adversarial inputs, has become an integral part of important tool for safety-critical applications.

Paper
Add Code

Scene Text Recognition

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result