HTR

37 papers with code • 2 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Datasets


Most implemented papers

OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold

IntuitionMachines/OrigamiNet CVPR 2020

On IAM we even surpass single line methods that use accurate localization information during training.

Full Page Handwriting Recognition via Image to Sequence Extraction

kingyiusuen/image-to-latex 11 Mar 2021

We present a Neural Network based Handwritten Text Recognition (HTR) model architecture that can be trained to recognize full pages of handwritten or printed text without image segmentation.

Boosting Handwriting Text Recognition in Small Databases with Transfer Learning

josarajar/HTRTF 4 Apr 2018

We first investigate, for a reduced and fixed number of training samples, 350 lines, how the learning from a large database, the IAM, can be transferred to the learning of the CLC of a reduced database, Washington.

ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation

amzn/convolutional-handwriting-gan CVPR 2020

This is especially true for handwritten text recognition (HTR), where each author has a unique style, unlike printed text, where the variation is smaller by design.

Data Generation for Post-OCR correction of Cyrillic handwriting

dbrainio/cyrillichandwritingpoc 27 Nov 2023

We apply a Handwritten Text Recognition (HTR) model to this dataset to identify OCR errors, forming the basis for our POC model training.

Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

mehreenmehreen/muharaf 13 Jun 2024

We present the Manuscripts of Handwritten Arabic~(Muharaf) dataset, which is a machine learning dataset consisting of more than 1, 600 historic handwritten page images transcribed by experts in archival Arabic.

Character-Based Handwritten Text Transcription with Attention Networks

jvpoulos/Attention-OCR 11 Dec 2017

When the sequence alignment is one-to-one, softmax attention is able to learn a more precise alignment at each step of the decoding, whereas the alignment generated by sigmoid attention is much less precise.

Detecting Tweets Mentioning Drug Name and Adverse Drug Reaction with Hierarchical Tweet Representation and Multi-Head Self-Attention

wuch15/SMM4H_THU_NGN WS 2018

This paper describes our system for the first and third shared tasks of the third Social Media Mining for Health Applications (SMM4H) workshop, which aims to detect the tweets mentioning drug names and adverse drug reactions.

Handwriting Recognition of Historical Documents with few labeled data

0x454447415244/HandwritingRecognitionSystem 10 Nov 2018

In this work, we demonstrate how to train an HTR system with few labeled data.

A Few-shot Learning Approach for Historical Ciphered Manuscript Recognition

dali92002/htrbymatching 26 Sep 2020

Encoded (or ciphered) manuscripts are a special type of historical documents that contain encrypted text.