DIGITal (Digitally Generated Numerals)

Introduced by Fateh et al. in Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning

Digitally Generated Numerals (DIGITal)

Description

The Digitally Generated Numerals (DIGITal) dataset consists of 100,000 image pairs representing digits from 0 to 9. These image pairs include both low and high-quality versions, with a resolution of 128x128 pixels.

Languages

The dataset is generated from fonts in the following languages:

  • Arabic
  • Chinese (Simplified)
  • Farsi and Urdu
  • Gurmukhi
  • Gujarati
  • Tibetan
  • ARDIS (Sweden)
  • ISI Bangla
  • Bangla Lekha
  • Kannada
  • English

Image Resolution

  • resolution: 128x128 pixels

Image Pairs

  • Each digit (0 to 9) has 1000 associated images.

Total Images

  • Total images: 100,000

Samples

| !Image 1 | !Image 2 | !Image 3 | !Image 4 | !Image 5 | |:-----------------------:|:-----------------------:| | Arabic | ARDIS | Chinese | Farsi & Urdu | Gujarati |

| !Image 5 | !Image 6 | !Image 7 | !Image 8 | !Image 5 | |:-----------------------:|:-----------------------:| | ISI Bangla & BanglaLekha | Kannada | Tibetan | English | Gurmukhi

This dataset is a valuable resource for tasks related to digit recognition, font analysis, and super-resolution. Researchers and practitioners can utilize it for various applications in computer vision and machine learning.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks