Document Layout Analysis
36 papers with code • 4 benchmarks • 9 datasets
"Document Layout Analysis is performed to determine physical structure of a document, that is, to determine document components. These document components can consist of single connected components-regions [...] of pixels that are adjacent to form single regions [...] , or group of text lines. A text line is a group of characters, symbols, and words that are adjacent, “relatively close” to each other and through which a straight line can be drawn (usually with horizontal or vertical orientation)." L. O'Gorman, "The document spectrum for page layout analysis," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 15, no. 11, pp. 1162-1173, Nov. 1993.
Image credit: PubLayNet: largest dataset ever for document layout analysis
Libraries
Use these libraries to find Document Layout Analysis models and implementationsLatest papers with no code
RoDLA: Benchmarking the Robustness of Document Layout Analysis Models
To address this, we are the first to introduce a robustness benchmark for DLA models, which includes 450K document images of three datasets.
AutoIE: An Automated Framework for Information Extraction from Scientific Literature
In the rapidly evolving field of scientific research, efficiently extracting key information from the burgeoning volume of scientific papers remains a formidable challenge.
U-DIADS-Bib: a full and few-shot pixel-precise dataset for document layout analysis of ancient manuscripts
Document Layout Analysis, which is the task of identifying different semantic regions inside of a document page, is a subject of great interest for both computer scientists and humanities scholars as it represents a fundamental step towards further analysis tasks for the former and a powerful tool to improve and facilitate the study of the documents for the latter.
Object Recognition from Scientific Document based on Compartment Refinement Framework
The lack of a comprehensive definition of the internal structure and elements of the documents indirectly impacts the accuracy of text classification and object recognition tasks.
Bengali Document Layout Analysis -- A YOLOV8 Based Ensembling Approach
This paper focuses on enhancing Bengali Document Layout Analysis (DLA) using the YOLOv8 model and innovative post-processing techniques.
Document Layout Analysis on BaDLAD Dataset: A Comprehensive MViTv2 Based Approach
In the rapidly evolving digital era, the analysis of document layouts plays a pivotal role in automated information extraction and interpretation.
Bengali Document Layout Analysis with Detectron2
Document Layout Analysis (DLA) involves segmenting documents into meaningful units like text boxes, paragraphs, images, and tables.
The YOLO model that still excels in document layout analysis
Document layout analysis can help people better understand and use the information in a document.
Performance Enhancement Leveraging Mask-RCNN on Bengali Document Layout Analysis
We trained a special model called Mask R-CNN to help with this understanding.
Framework and Model Analysis on Bengali Document Layout Analysis Dataset: BaDLAD
We looked at lots of different Bengali documents in our study.