no code implementations • LREC 2022 • Michael Arrigo, Stephanie Strassel, Nolan King, Thao Tran, Lisa Mason
CAMIO (Corpus of Annotated Multilingual Images for OCR) is a new corpus created by Linguistic Data Consortium to serve as a resource to support the development and evaluation of optical character recognition (OCR) and related technologies for 35 languages across 24 unique scripts.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 15 Oct 2024 • Reno Kriz, Kate Sanders, David Etter, Kenton Murray, Cameron Carpenter, Kelly Van Ochten, Hannah Recknor, Jimena Guallar-Blasco, Alexander Martin, Ronald Colaianni, Nolan King, Eugene Yang, Benjamin Van Durme
Efficiently retrieving and synthesizing information from large-scale multimodal collections has become a critical challenge.