Tabular Data Generation

29 papers with code • 6 benchmarks • 6 datasets

Generation of the tabular data using generative models

Libraries

Use these libraries to find Tabular Data Generation models and implementations
2 papers
15

Most implemented papers

Modeling Tabular data using Conditional GAN

DAI-Lab/CTGAN NeurIPS 2019

Tabular data usually contains a mix of discrete and continuous columns.

Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees

atong01/conditional-flow-matching 18 Sep 2023

Through empirical evaluation across the benchmark, we demonstrate that our approach outperforms deep-learning generation methods in data generation tasks and remains competitive in data imputation.

Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation

feedzai/bank-account-fraud 24 Nov 2022

The suite was generated by applying state-of-the-art tabular data generation techniques on an anonymized, real-world bank account opening fraud detection dataset.

Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark

hlasse/data-centric-synthetic-data NeurIPS 2023

In an empirical study, we evaluate the performance of five state-of-the-art models for tabular data generation on eleven distinct tabular datasets.

Tabular GANs for uneven distribution

Diyago/GAN-for-tabular-data 1 Oct 2020

GANs are well known for success in the realistic image generation.

TabFairGAN: Fair Tabular Data Generation with Generative Adversarial Networks

amirarsalan90/TabFairGAN 2 Sep 2021

In the unconstrained case, i. e. when the model is only trained in the first phase and is only meant to generate accurate data following the same joint probability distribution of the real data, the results show that the model beats state-of-the-art GANs proposed in the literature to produce synthetic tabular data.

DATGAN: Integrating expert knowledge into deep learning for synthetic tabular data

glederrey/datgan 7 Mar 2022

We show that the best versions of the DATGAN outperform state-of-the-art generative models on multiple case studies.

ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets

kristian10007/convgen 20 Jun 2022

Moreover, we discuss how our model can be used for synthetic tabular data generation in general, even outside the scope of data imbalance and thus, improves the overall applicability of convex space learning.

TabSynDex: A Universal Metric for Robust Evaluation of Synthetic Tabular Data

vikram2000b/tabsyndex 12 Jul 2022

We present several baseline models for comparative analysis of the proposed evaluation metric with existing generative models.

Language Models are Realistic Tabular Data Generators

kathrinse/be_great 12 Oct 2022

Tabular data is among the oldest and most ubiquitous forms of data.