Search Results for author: Rafał Powalski

Found 3 papers, 3 papers with code

Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer

1 code implementation18 Feb 2021 Rafał Powalski, Łukasz Borchmann, Dawid Jurkiewicz, Tomasz Dwojak, Michał Pietruszka, Gabriela Pałka

We address the challenging problem of Natural Language Comprehension beyond plain-text documents by introducing the TILT neural network architecture which simultaneously learns layout information, visual features, and textual semantics.

Ranked #7 on Visual Question Answering (VQA) on InfographicVQA (using extra training data)

Document Image Classification document understanding +1

Cannot find the paper you are looking for? You can Submit a new open access paper.