Search Results for author: Peter Christen

Found 8 papers, 2 papers with code

A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms

1 code implementation3 Jul 2023 George Papadakis, Nishadi Kirielle, Peter Christen, Themis Palpanas

Entity resolution (ER) is the process of identifying records that refer to the same entities within one or across multiple databases.

Entity Resolution

Privacy in Practice: Private COVID-19 Detection in X-Ray Images (Extended Version)

1 code implementation21 Nov 2022 Lucas Lange, Maja Schneider, Peter Christen, Erhard Rahm

The introduced DP should help limit leakage threats posed by MIAs, and our practical analysis is the first to test this hypothesis on the COVID-19 classification task.

Knowledge Distillation Membership Inference Attack

Big Data is not the New Oil: Common Misconceptions about Population Data

no code implementations20 Dec 2021 Peter Christen, Rainer Schnell

Remarkably many of these misconceptions are due to the social nature of data collections and are therefore missed by purely technical accounts of data processing.

Decision Making Misconceptions +1

Large Scale Record Linkage in the Presence of Missing Data

no code implementations19 Apr 2021 Thilina Ranbaduge, Peter Christen, Rainer Schnell

We evaluate the linkage quality and scalability of our approach using large real-world databases, showing that it can achieve high linkage quality even when the databases being linked contain substantial amounts of missing values and errors.

Attribute Data Integration +1

F*: An Interpretable Transformation of the F-measure

no code implementations31 Jul 2020 David J. Hand, Peter Christen, Nishadi Kirielle

The F-measure, also known as the F1-score, is widely used to assess the performance of classification algorithms.

Temporal graph-based clustering for historical record linkage

no code implementations6 Jul 2018 Charini Nanayakkara, Peter Christen, Thilina Ranbaduge

Research in the social sciences is increasingly based on large and complex data collections, where individual data sets from different domains are linked and integrated to allow advanced analytics.

Clustering

A Decision Tree Approach to Predicting Recidivism in Domestic Violence

no code implementations27 Mar 2018 Senuri Wijenayake, Timothy Graham, Peter Christen

Previous work in DV recidivism has employed different classification techniques, including decision tree (DT) induction and logistic regression, where the main focus was on achieving high prediction accuracy.

Decision Making feature selection +1

Application of Advanced Record Linkage Techniques for Complex Population Reconstruction

no code implementations13 Dec 2016 Peter Christen

Record linkage is the process of identifying records that refer to the same entities from several databases.

Cannot find the paper you are looking for? You can Submit a new open access paper.