Optical Character Recognition (OCR)

316 papers with code • 5 benchmarks • 42 datasets

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Benchmarks

Add a Result

These leaderboards are used to track progress in Optical Character Recognition (OCR)

Dataset	Best Model	Compare
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study	DTrOCR	See all
FSNS - Test	AttentionOCR_Inception-resnet-v2_Location	See all
I2L-140K	I2L-NOPOOL	See all
SUT	Tesseract	See all
im2latex-100k	I2L-STRIPS	See all

Libraries

Use these libraries to find Optical Character Recognition (OCR) models and implementations

PaddlePaddle/PaddleOCR

18 papers

38,910

open-mmlab/mmocr

6 papers

4,099

alibabaresearch/advancedliteratemac…

5 papers

998

Media-Smart/vedastr

5 papers

531

See all 10 libraries.

Datasets

Subtasks

Irregular Text Recognition

Handwritten Chinese Text Recognition

Offline Handwritten Chinese Character Recognition

Word Spotting In Handwritten Documents

Handwritten Digit Image Synthesis

Grapheme Detection

Most implemented papers

Most implemented Social Latest No code

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

PaddlePaddle/PaddleOCR • • 21 Jul 2015

In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based sequence recognition.

Paper
Code

EAST: An Efficient and Accurate Scene Text Detector

PaddlePaddle/PaddleOCR • • CVPR 2017

Previous approaches for scene text detection have already achieved promising performances across various benchmarks.

Paper
Code

Shape Robust Text Detection with Progressive Scale Expansion Network

PaddlePaddle/PaddleOCR • • CVPR 2019

Due to the fact that there are large geometrical margins among the minimal scale kernels, our method is effective to split the close text instances, making it easier to use segmentation-based methods to detect arbitrary-shaped text instances.

Paper
Code

Real-time Scene Text Detection with Differentiable Binarization

MhLiao/DB • • 20 Nov 2019

Recently, segmentation-based methods are quite popular in scene text detection, as the segmentation results can more accurately describe scene text of various shapes such as curve text.

Paper
Code

Image-to-Markup Generation with Coarse-to-Fine Attention

harvardnlp/im2markup • • ICML 2017

We present a neural encoder-decoder model to convert images into presentational markup based on a scalable coarse-to-fine attention mechanism.

Paper
Code

PP-OCR: A Practical Ultra Lightweight OCR System

PaddlePaddle/PaddleOCR • • 21 Sep 2020

Meanwhile, several pre-trained models for the Chinese and English recognition are released, including a text detector (97K images are used), a direction classifier (600K images are used) as well as a text recognizer (17. 9M images are used).

Paper
Code

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

PaddlePaddle/PaddleOCR • • 2 Nov 2018

Recognizing irregular text in natural scene images is challenging due to the large variance in text appearance, such as curvature, orientation and distortion.

Paper
Code

A Multi-Object Rectified Attention Network for Scene Text Recognition

Canjie-Luo/MORAN_v2 • • 10 Jan 2019

It decreases the difficulty of recognition and enables the attention-based sequence recognition network to more easily read irregular text.

Paper
Code

Image-based table recognition: data, model, and evaluation

ibm-aur-nlp/PubTabNet • ECCV 2020

In addition, we propose a new Tree-Edit-Distance-based Similarity (TEDS) metric for table recognition, which more appropriately captures multi-hop cell misalignment and OCR errors than the pre-established metric.

Paper
Code

Robust Scene Text Recognition with Automatic Rectification

PaddlePaddle/PaddleOCR • • CVPR 2016

We show that the model is able to recognize several types of irregular text, including perspective text and curved text.

Paper
Code

Optical Character Recognition (OCR)

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result