TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Handwriting Recognition	KOHTD	Flor	CER	6.52	# 1
Handwriting Recognition	KOHTD	Bluche	CER	8.36	# 4
Handwriting Recognition	KOHTD	Puigcerver	CER	8.01	# 2
Handwriting Recognition	KOHTD	Abdallah	CER	8.22	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/kohtd-kazakh-offline-handwritten-text-dataset/handwriting-recognition-on-kohtd)](https://paperswithcode.com/sota/handwriting-recognition-on-kohtd?p=kohtd-kazakh-offline-handwritten-text-dataset)`

KOHTD: Kazakh Offline Handwritten Text Dataset

22 Sep 2021 · Nazgul Toiganbayeva, Mahmoud Kasem, Galymzhan Abdimanap, Kairat Bostanbekov, Abdelrahman Abdallah, Anel Alimova, Daniyar Nurseitov ·

Despite the transition to digital information exchange, many documents, such as invoices, taxes, memos and questionnaires, historical data, and answers to exam questions, still require handwritten inputs. In this regard, there is a need to implement Handwritten Text Recognition (HTR) which is an automatic way to decrypt records using a computer. Handwriting recognition is challenging because of the virtually infinite number of ways a person can write the same message. For this proposal we introduce Kazakh handwritten text recognition research, a comprehensive dataset of Kazakh handwritten texts is necessary. This is particularly true given the lack of a dataset for handwritten Kazakh text. In this paper, we proposed our extensive Kazakh offline Handwritten Text dataset (KOHTD), which has 3000 handwritten exam papers and more than 140335 segmented images and there are approximately 922010 symbols. It can serve researchers in the field of handwriting recognition tasks by using deep and machine learning. We used a variety of popular text recognition methods for word and line recognition in our studies, including CTC-based and attention-based methods. The findings demonstrate KOHTD's diversity. Also, we proposed a Genetic Algorithm (GA) for line and word segmentation based on random enumeration of a parameter. The dataset and GA code are available at https://github.com/abdoelsayed2016/KOHTD.

PDF Abstract

Code

Add Remove Mark official

abdoelsayed2016/KOHTD official

Tasks

Add Remove

Handwriting Recognition

Handwritten Text Recognition

HTR

Datasets

Introduced in the Paper:

KOHTD

Results from the Paper

Add Remove

Ranked #1 on Handwriting Recognition on KOHTD

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Handwriting Recognition	KOHTD	Flor	CER	6.52	# 1	Compare
Handwriting Recognition	KOHTD	Bluche	CER	8.36	# 4	Compare
Handwriting Recognition	KOHTD	Puigcerver	CER	8.01	# 2	Compare
Handwriting Recognition	KOHTD	Abdallah	CER	8.22	# 3	Compare

Methods

Add Remove

BiLSTM • Convolution • GA • LSTM • Max Pooling • PyTorch DDP • R-CNN • Sigmoid Activation • SVM • Tanh Activation

Edit Social Preview

KOHTD: Kazakh Offline Handwritten Text Dataset

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove