Search Results for author: Štěpán Šimsa

Found 3 papers, 1 papers with code

DocILE Benchmark for Document Information Localization and Extraction

1 code implementation11 Feb 2023 Štěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas

This paper introduces the DocILE benchmark with the largest dataset of business documents for the tasks of Key Information Localization and Extraction and Line Item Recognition.

Key Information Extraction Unsupervised Pre-training

DocILE 2023 Teaser: Document Information Localization and Extraction

no code implementations29 Jan 2023 Štěpán Šimsa, Milan Šulc, Matyáš Skalický, Yash Patel, Ahmed Hamdi

The DocILE 2023 competition, hosted as a lab at the CLEF 2023 conference and as an ICDAR 2023 competition, will run the first major benchmark for the tasks of Key Information Localization and Extraction (KILE) and Line Item Recognition (LIR) from business documents.

Information Retrieval Retrieval

Business Document Information Extraction: Towards Practical Benchmarks

no code implementations20 Jun 2022 Matyáš Skalický, Štěpán Šimsa, Michal Uřičář, Milan Šulc

Information extraction from semi-structured documents is crucial for frictionless business-to-business (B2B) communication.

Cannot find the paper you are looking for? You can Submit a new open access paper.