Semantic entity labeling

12 papers with code • 2 benchmarks • 2 datasets

  • One of Form Understanding task (Word grouping, Semantic entity labeling, Entity linking)
  • Classifying entities into one of four pre-defined categories: question, answer, header and, other.

cited from

G. Jaume, H. K. Ekenel, J. Thiran "FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents," 2019

Libraries

Use these libraries to find Semantic entity labeling models and implementations

Most implemented papers

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

microsoft/unilm ACL 2021

Pre-training of text and layout has proved effective in a variety of visually-rich document understanding tasks due to its effective model architecture and the advantage of large-scale unlabeled scanned/digital-born documents.

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

jpwang/lilt ACL 2022

LiLT can be pre-trained on the structured documents of a single language and then directly fine-tuned on other languages with the corresponding off-the-shelf monolingual/multilingual pre-trained textual models.

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

microsoft/unilm 18 Apr 2022

In this paper, we propose \textbf{LayoutLMv3} to pre-train multimodal Transformers for Document AI with unified text and image masking.

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

PaddlePaddle/PaddleNLP 12 Oct 2022

Recent years have witnessed the rise and success of pre-training techniques in visually-rich document understanding.

Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction

chongzhangfdu/tpp 17 Oct 2023

However, BIO-tagging scheme relies on the correct order of model inputs, which is not guaranteed in real-world NER on scanned VrDs where text are recognized and arranged by OCR systems.

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

andreagemelli/doc2graph 23 Aug 2022

Geometric Deep Learning has recently attracted significant interest in a wide range of machine learning fields, including document analysis.

XDoc: Unified Pre-training for Cross-Format Document Understanding

microsoft/unilm 6 Oct 2022

The surge of pre-training has witnessed the rapid development of document understanding recently.

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

PaddlePaddle/VIMER 1 Mar 2023

Compared to the masked multi-modal modeling methods for document image understanding that rely on both the image and text modalities, StrucTexTv2 models image-only input and potentially deals with more application scenarios free from OCR pre-processing.

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

alibabaresearch/advancedliteratemachinery CVPR 2023

Additionally, novel relation heads, which are pre-trained by the geometric pre-training tasks and fine-tuned for RE, are elaborately designed to enrich and enhance the feature representation.

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

ZeningLin/PEneo 7 Jan 2024

However, simply concatenating SER and RE serially can lead to severe error propagation, and it fails to handle cases like multi-line entities in real scenarios.