DDI-100 (Distorted Document Images)

Introduced by Zharikov et al. in DDI-100: Dataset for Text Detection and Recognition

The DDI-100 dataset is a synthetic dataset for text detection and recognition based on 7000 real unique document pages and consists of more than 100000 augmented images. The ground truth comprises text and stamp masks, text and characters bounding boxes with relevant annotations.

Source: DDI-100

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages