Semantic entity labeling

9 papers with code • 1 benchmarks • 1 datasets

One of Form Understanding task (Word grouping, Semantic entity labeling, Entity linking)
Classifying entities into one of four pre-defined categories: question, answer, header and, other.

cited from

G. Jaume, H. K. Ekenel, J. Thiran "FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents," 2019

Benchmarks

Add a Result

These leaderboards are used to track progress in Semantic entity labeling

Trend	Dataset	Best Model	Paper	Code	Compare
	FUNSD	LayoutMask (large)			See all

Libraries

Use these libraries to find Semantic entity labeling models and implementations

huggingface/transformers

3 papers

124,889

Datasets

FUNSD

Most implemented papers

Most implemented Social Latest No code

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

microsoft/unilm • • ACL 2021

Pre-training of text and layout has proved effective in a variety of visually-rich document understanding tasks due to its effective model architecture and the advantage of large-scale unlabeled scanned/digital-born documents.

Paper
Code

LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

jpwang/lilt • • ACL 2022

LiLT can be pre-trained on the structured documents of a single language and then directly fine-tuned on other languages with the corresponding off-the-shelf monolingual/multilingual pre-trained textual models.

Paper
Code

LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking

microsoft/unilm • • 18 Apr 2022

In this paper, we propose \textbf{LayoutLMv3} to pre-train multimodal Transformers for Document AI with unified text and image masking.

Paper
Code

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

PaddlePaddle/PaddleNLP • • 12 Oct 2022

Recent years have witnessed the rise and success of pre-training techniques in visually-rich document understanding.

Paper
Code

Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks

andreagemelli/doc2graph • • 23 Aug 2022

Geometric Deep Learning has recently attracted significant interest in a wide range of machine learning fields, including document analysis.

Paper
Code

XDoc: Unified Pre-training for Cross-Format Document Understanding

microsoft/unilm • • 6 Oct 2022

The surge of pre-training has witnessed the rapid development of document understanding recently.

Paper
Code

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

PaddlePaddle/VIMER • • 1 Mar 2023

Compared to the masked multi-modal modeling methods for document image understanding that rely on both the image and text modalities, StrucTexTv2 models image-only input and potentially deals with more application scenarios free from OCR pre-processing.

Paper
Code

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

alibabaresearch/advancedliteratemachinery • • CVPR 2023

Additionally, novel relation heads, which are pre-trained by the geometric pre-training tasks and fine-tuned for RE, are elaborately designed to enrich and enhance the feature representation.

Paper
Code

Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction

chongzhangfdu/tpp • 17 Oct 2023

However, BIO-tagging scheme relies on the correct order of model inputs, which is not guaranteed in real-world NER on scanned VrDs where text are recognized and arranged by OCR systems.

Paper
Code

Semantic entity labeling

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result