1 code implementation • 15 Apr 2024 • Nian Ran, Bahrul Ilmi Nasution, Claire Little, Richard Allmendinger, Mark Elliot
However, there are unique challenges in tabular data compared to images, eg tabular data may contain both continuous and discrete variables and conditional sampling, and, critically, the data should possess high utility and low disclosure risk (the risk of re-identifying a population unit or learning something new about them), providing an opportunity for multi-objective (MO) optimization.
1 code implementation • 2 Jul 2022 • Claire Little, Mark Elliot, Richard Allmendinger
The paper presents a framework to measure the utility and disclosure risk of synthetic data by comparing it to samples of the original data of varying sample fractions, thereby identifying the sample fraction which has equivalent utility and risk to the synthetic data.
no code implementations • 3 Dec 2021 • Claire Little, Mark Elliot, Richard Allmendinger, Sahel Shariati Samani
Generative Adversarial Networks (GANs) are gaining increasing attention as a means for synthesising data.
no code implementations • 21 Jul 2021 • Muhammad Aslam Jarwar, Adriane Chapman, Mark Elliot, Fatemeh Raji
Based on this use case, we identify how provenance information could be utilized within the ADF framework, and identify a currently un-met requirement which is the modeling of \textit{data environments}.
1 code implementation • NeurIPS 2018 • Sebastian Flennerhag, Hujun Yin, John Keane, Mark Elliot
Standard neural network architectures are non-linear only by virtue of a simple element-wise activation function, making them both brittle and excessively large.