Search Results for author: Emanuele Vivoli

Found 5 papers, 2 papers with code

Multimodal Transformer for Comics Text-Cloze

no code implementations • 6 Mar 2024 • Emanuele Vivoli, Joan Lafuente Baeza, Ernest Valveny Llobet, Dimosthenis Karatzas

This work explores a closure task in comics, a medium where visual and textual elements are intricately intertwined.

Language Modelling Large Language Model +1

Paper
Add Code

Error assessment of microwave holography inversion for shallow buried objects

no code implementations • 27 Mar 2023 • Emanuele Vivoli, Luca Bossi, Marco Bertini, Pierluigi Falorni, Lorenzo Capineri

Holographic imaging is a technique that uses microwave energy to create a three-dimensional image of an object or scene.

Paper
Add Code

CTE: A Dataset for Contextualized Table Extraction

1 code implementation • 2 Feb 2023 • Andrea Gemelli, Emanuele Vivoli, Simone Marinai

We define the task of Contextualized Table Extraction (CTE), which aims to extract and define the structure of tables considering the textual context of the document.

Document Layout Analysis Table Detection +1

Paper
Code

MUST-VQA: MUltilingual Scene-text VQA

no code implementations • 14 Sep 2022 • Emanuele Vivoli, Ali Furkan Biten, Andres Mafla, Dimosthenis Karatzas, Lluis Gomez

In this paper, we present a framework for Multilingual Scene Text Visual Question Answering that deals with new languages in a zero-shot fashion.

Question Answering Visual Question Answering

Paper
Add Code

Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents

1 code implementation • 23 Aug 2022 • Andrea Gemelli, Emanuele Vivoli, Simone Marinai

Tables are widely used in several types of documents since they can bring important information in a structured way.

Optical Character Recognition (OCR) Table Extraction

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.