…We also provided the bounding box annotations (YOLO format) for the segmentation of words/lines and the ground truth annotations for full-text, along with the segmented images and their positions. The BN-HTRd dataset can be adopted as a basis for various handwriting classification tasks such as end-to-end document recognition, word-spotting, word/line segmentation, and so on.
2 PAPERS • 2 BENCHMARKS
Digital Peter is a dataset of Peter the Great's manuscripts annotated for segmentation and text recognition.
3 PAPERS • 1 BENCHMARK
…The localization of each field is not available in such a way that this dataset encourages research on segmentation-free systems for information extraction.