Scene Text Recognition

121 papers with code • 15 benchmarks • 27 datasets

See Scene Text Detection for leaderboards in this task.

Libraries

Use these libraries to find Scene Text Recognition models and implementations

Latest papers with no code

JSTR: Judgment Improves Scene Text Recognition

no code yet • 9 Apr 2024

In this paper, we present a method for enhancing the accuracy of scene text recognition tasks by judging whether the image and text match each other.

Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss

no code yet • 12 Mar 2024

In this paper, we propose a novel open-vocabulary text recognition framework, Pseudo-OCR, to recognize OOV words.

IndicSTR12: A Dataset for Indic Scene Text Recognition

no code yet • 12 Mar 2024

Several benchmark datasets and substantial work on deep learning models are available for Latin languages to meet this need.

Efficiently Leveraging Linguistic Priors for Scene Text Spotting

no code yet • 27 Feb 2024

This paper proposes a method that leverages linguistic knowledge from a large text corpus to replace the traditional one-hot encoding used in auto-regressive scene text spotting and recognition models.

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

no code yet • 24 Feb 2024

Scene text recognition (STR) is a challenging task that requires large-scale annotated data for training.

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

no code yet • 12 Feb 2024

We introduce Lumos, the first end-to-end multimodal question-answering system with text understanding capabilities.

Instruction-Guided Scene Text Recognition

no code yet • 31 Jan 2024

Multi-modal models have shown appealing performance in visual tasks recently, as instruction-guided training has evoked the ability to understand fine-grained visual content.

CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition

no code yet • 18 Jan 2024

However, the guidance of visual cues is ignored in the process of semantic mining, which limits the performance of the algorithm in recognizing irregular scene text.

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

no code yet • 19 Dec 2023

Nowadays, scene text recognition has attracted more and more attention due to its diverse applications.

STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers

no code yet • 28 Nov 2023

Robustness certification, which aims to formally certify the predictions of neural networks against adversarial inputs, has become an integral part of important tool for safety-critical applications.