no code implementations • LREC 2022 • Michael Arrigo, Stephanie Strassel, Nolan King, Thao Tran, Lisa Mason
CAMIO (Corpus of Annotated Multilingual Images for OCR) is a new corpus created by Linguistic Data Consortium to serve as a resource to support the development and evaluation of optical character recognition (OCR) and related technologies for 35 languages across 24 unique scripts.
Optical Character Recognition Optical Character Recognition (OCR)