GitTables

Introduced by Hulsebos et al. in GitTables: A Large-Scale Corpus of Relational Tables

GitTables is a corpus of currently 1.7M relational tables extracted from CSV files in GitHub. Table columns in GitTables have been annotated with more than 2K different semantic types from Schema.org and DBpedia. The column annotations consist of semantic types, hierarchical relations, range types and descriptions.

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


Modalities


Languages