Table Extraction

9 papers with code • 0 benchmarks • 0 datasets

Table extraction involves detecting and recognizing a table's logical structure and content from its unstructured presentation within a document

Most implemented papers

Deep learning for table detection and structure recognition: A survey

abdoelsayed2016/table-detection-structure-recognition 15 Nov 2022

The goals of this survey are to provide a profound comprehension of the major developments in the field of Table Detection, offer insight into the different methodologies, and provide a systematic taxonomy of the different approaches.

PubTables-1M: Towards comprehensive table extraction from unstructured documents

microsoft/table-transformer CVPR 2022

We demonstrate that these improvements lead to a significant increase in training performance and a more reliable estimate of model performance at evaluation for table structure recognition.

GFTE: Graph-based Financial Table Extraction

Irene323/GFTE 17 Mar 2020

Tabular data is a crucial form of information expression, which can organize data in a standard structure for easy information retrieval and comparison.

Flexible Table Recognition and Semantic Interpretation System

mnamysl/table-interpretation 25 May 2021

Moreover, to incorporate the extraction of semantic information, we develop a graph-based table interpretation method.

ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

SampannaKahu/ScanBank 23 Jun 2021

To the best of our knowledge, ScanBank is the first manually annotated dataset for figure and table extraction for scanned ETDs.

DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles

m3rg-iitd/discomat 3 Jul 2022

A crucial component in the curation of KB for a scientific domain (e. g., materials science, foods & nutrition, fuels) is information extraction from tables in the domain's published research articles.

Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents

ailab-unifi/gnn-tableextraction 23 Aug 2022

Tables are widely used in several types of documents since they can bring important information in a structured way.

CTE: A Dataset for Contextualized Table Extraction

ailab-unifi/cte-dataset 2 Feb 2023

We define the task of Contextualized Table Extraction (CTE), which aims to extract and define the structure of tables considering the textual context of the document.

Schema-Driven Information Extraction from Heterogeneous Tables

bflashcp3f/schema-to-json 23 May 2023

We use this collection of annotated tables to evaluate the ability of open-source and API-based language models to extract information from tables covering diverse domains and data formats.