Scene Text Recognition

121 papers with code • 15 benchmarks • 27 datasets

See Scene Text Detection for leaderboards in this task.

Libraries

Use these libraries to find Scene Text Recognition models and implementations

Latest papers with no code

Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution

no code yet • 22 Nov 2023

Scene Text Image Super-Resolution (STISR) aims to enhance the resolution and legibility of text within low-resolution (LR) images, consequently elevating recognition accuracy in Scene Text Recognition (STR).

Reading Between the Lanes: Text VideoQA on the Road

no code yet • 8 Jul 2023

Text and signs around roads provide crucial information for drivers, vital for safe navigation and situational awareness.

DiffusionSTR: Diffusion Model for Scene Text Recognition

no code yet • 29 Jun 2023

This paper presents Diffusion Model for Scene Text Recognition (DiffusionSTR), an end-to-end text recognition framework using diffusion models for recognizing text in the wild.

Weakly Supervised Scene Text Generation for Low-resource Languages

no code yet • 25 Jun 2023

A large number of annotated training images is crucial for training successful scene text recognition models.

Masked and Permuted Implicit Context Learning for Scene Text Recognition

no code yet • 25 May 2023

We utilize the training procedure of PLM, and to integrate MLM, we incorporate word length information into the decoding process and replace the undetermined characters with mask tokens.

Scene Text Recognition with Image-Text Matching-guided Dictionary

no code yet • 8 May 2023

Inspired by ITC, the SITM network combines the visual features and the text features of all candidates to identify the candidate with the minimum distance in the feature space.

Improving Scene Text Recognition for Character-Level Long-Tailed Distribution

no code yet • 31 Mar 2023

However, STR models show a large performance degradation on languages with a numerous number of characters (e. g., Chinese and Korean), especially on characters that rarely appear due to the long-tailed distribution of characters in such languages.

Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model

no code yet • 13 Mar 2023

Despite the success of deep neural network (DNN) on sequential data (i. e., scene text and speech) recognition, it suffers from the over-confidence problem mainly due to overfitting in training with the cross-entropy loss, which may make the decision-making less reliable.

Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition

no code yet • 7 Mar 2023

Capturing images is a key part of automation for high-level tasks such as scene text recognition.

Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition

no code yet • 28 Feb 2023

While vision transformers have been highly successful in improving the performance in image-based tasks, not much work has been reported on applying transformers to multilingual scene text recognition due to the complexities in the visual appearance of multilingual texts.