Search Results for author: Maksym Lysak

Found 9 papers, 3 papers with code

Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion

no code implementations27 Jan 2025 Nikolaos Livathinos, Christoph Auer, Maksym Lysak, Ahmed Nassar, Michele Dolfi, Panos Vagenas, Cesar Berrospi Ramis, Matteo Omenetti, Kasper Dinkla, Yusik Kim, Shubham Gupta, Rafael Teixeira de Lima, Valery Weber, Lucas Morin, Ingmar Meijer, Viktor Kuropiatnyk, Peter W. J. Staar

We introduce Docling, an easy-to-use, self-contained, MIT-licensed, open-source toolkit for document conversion, that can parse several types of popular document formats into a unified, richly structured representation.

ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents

no code implementations24 May 2023 Christoph Auer, Ahmed Nassar, Maksym Lysak, Michele Dolfi, Nikolaos Livathinos, Peter Staar

The results demonstrate substantial progress towards achieving robust and highly generalizing methods for document layout understanding.

Data Augmentation

Optimized Table Tokenization for Table Structure Recognition

no code implementations5 May 2023 Maksym Lysak, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Peter Staar

The benefits of OTSL are that it reduces the number of tokens to 5 (HTML needs 28+) and shortens the sequence length to half of HTML on average.

TableFormer: Table Structure Understanding with Transformers

3 code implementations CVPR 2022 Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak, Peter Staar

In this way, we can obtain the content of the table-cells from programmatic PDF's directly from the PDF source and avoid the training of the custom OCR decoders.

Decoder object-detection +2

Robust PDF Document Conversion Using Recurrent Neural Networks

no code implementations18 Feb 2021 Nikolaos Livathinos, Cesar Berrospi, Maksym Lysak, Viktor Kuropiatnyk, Ahmed Nassar, Andre Carvalho, Michele Dolfi, Christoph Auer, Kasper Dinkla, Peter Staar

In this paper, we present a novel approach to document structure recovery in PDF using recurrent neural networks to process the low-level PDF data representation directly, instead of relying on a visual re-interpretation of the rendered PDF page, as has been proposed in previous literature.

Feature Engineering Information Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.