TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Optical Character Recognition (OCR)	Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study	TransOCR	Accuracy (%)	72.8	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/scene-text-telescope-text-focused-scene-image/optical-character-recognition-on-benchmarking)](https://paperswithcode.com/sota/optical-character-recognition-on-benchmarking?p=scene-text-telescope-text-focused-scene-image)`

Scene Text Telescope: Text-Focused Scene Image Super-Resolution

CVPR 2021 · Jingye Chen, Bin Li, xiangyang xue ·

Image super-resolution, which is often regarded as a preprocessing procedure of scene text recognition, aims to recover the realistic features from a low-resolution text image. It has always been challenging due to large variations in text shapes, fonts, backgrounds, etc. However, most existing methods employ generic super-resolution frameworks to handle scene text images while ignoring text-specific properties such as text-level layouts and character-level details. In this paper, we establish a text-focused super-resolution framework, called Scene Text Telescope (STT). In terms of text-level layouts, we propose a Transformer-Based Super-Resolution Network (TBSRN) containing a Self-Attention Module to extract sequential information, which is robust to tackle the texts in arbitrary orientations. In terms of character-level details, we propose a Position-Aware Module and a Content-Aware Module to highlight the position and the content of each character. By observing that some characters look indistinguishable in low-resolution conditions, we use a weighted cross-entropy loss to tackle this problem. We conduct extensive experiments, including text recognition with pre-trained recognizers and image quality evaluation, on TextZoom and several scene text recognition benchmarks to assess the super-resolution images. The experimental results show that our STT can indeed generate text-focused super-resolution images and outperform the existing methods in terms of recognition accuracy.

PDF Abstract

Code

Add Remove Mark official

FudanVI/FudanOCR official

309

Tasks

Add Remove

Image Super-Resolution

Optical Character Recognition (OCR)

Position

Scene Text Recognition

Super-Resolution

Datasets

TextZoom

Results from the Paper

Add Remove

Ranked #3 on Optical Character Recognition (OCR) on Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Optical Character Recognition (OCR)	Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study	TransOCR	Accuracy (%)	72.8	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Scene Text Telescope: Text-Focused Scene Image Super-Resolution

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove