SESYD Dataset (Systems Evaluation SYnthetic Documents)

SESYD "Systems Evaluation SYnthetic Documents" is a database of synthetical documents with groundtruth. This database targets two main research problems in the document image analysis field (i) symbol recognition and spotting in line drawing images (floorplans and electrical diagrams) (ii) character segmentation and recognition in geographical maps. The database is composed of eleven collections for performance evaluation containing 284k images, 190k symbols and 284k characters (k for thousand). SESYD is today a key database in the document image analysis field published in 2010 and referred by one hundred of citations into research papers.

Please, cite the following paper [1] if you are using this database.

[1] M. Delalandre, E. Valveny, T. Pridmore and D. Karatzas. Generation of Synthetic Documents for Performance Evaluation of Symbol Recognition & Spotting Systems. International Journal on Document Analysis and Recognition (IJDAR), 13(3):187-207, 2010. http://mathieu.delalandre.free.fr/publications/IJDAR2010.pdf

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


License


  • MIT

Modalities


Languages