Document AI

7 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?


Use these libraries to find Document AI models and implementations


Most implemented papers

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

microsoft/unilm 31 Dec 2019

In this paper, we propose the \textbf{LayoutLM} to jointly model interactions between text and layout information across scanned document images, which is beneficial for a great number of real-world document image understanding tasks such as information extraction from scanned documents.

DiT: Self-supervised Pre-training for Document Image Transformer

microsoft/unilm 4 Mar 2022

We leverage DiT as the backbone network in a variety of vision-based Document AI tasks, including document image classification, document layout analysis, table detection as well as text detection for OCR.

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

microsoft/unilm 18 Apr 2022

In this paper, we propose \textbf{LayoutLMv3} to pre-train multimodal Transformers for Document AI with unified text and image masking.

Unifying Vision, Text, and Layout for Universal Document Processing

microsoft/udop 5 Dec 2022

UDOP leverages the spatial correlation between textual content and document image to model image, text, and layout modalities with one uniform representation.

Document Intelligence Metrics for Visually Rich Document Evaluation

metricsdi/dimetrics 23 May 2022

The processing of Visually-Rich Documents (VRDs) is highly important in information extraction tasks associated with Document Intelligence.

DoSA : A System to Accelerate Annotations on Business Documents with Human-in-the-Loop

neeleshkshukla/dosa 9 Nov 2022

An initial document-specific model can be trained and its inference can be used as feedback for generating more automated annotations.