no code implementations • 20 Feb 2024 • Sascha Xu, Nils Philipp Walter, Janis Kalofolias, Jilles Vreeken
Finding and describing sub-populations that are exceptional regarding a target property has important applications in many scientific disciplines, from identifying disadvantaged demographic groups in census data to finding conductive molecules within gold nanoparticles.
no code implementations • 8 Feb 2024 • Sascha Xu, Joscha Cüppers, Jilles Vreeken
SHAP is a popular approach to explain black-box models by revealing the importance of individual features.
no code implementations • 22 Dec 2023 • Marco Bjarne Schuster, Boris Wiegand, Jilles Vreeken
Although event logs are a powerful source to gain insight about the behavior of the underlying business process, existing work primarily focuses on finding patterns in the activity sequences of an event log, while ignoring event attribute data.
no code implementations • 7 Dec 2023 • Nils Philipp Walter, Jonas Fischer, Jilles Vreeken
Discovering patterns in data that best describe the differences between classes allows to hypothesize and reason about class-specific mechanisms.
no code implementations • 18 Nov 2023 • Michael A. Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken
Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier.
1 code implementation • 3 Nov 2023 • Chen Shani, Jilles Vreeken, Dafna Shahaf
Concepts play a pivotal role in various human cognitive functions, including learning, reasoning and communication.
no code implementations • 14 Jul 2023 • Sebastian Dalleiger, Jilles Vreeken
Addressing the interpretability problem of NMF on Boolean data, Boolean Matrix Factorization (BMF) uses Boolean algebra to decompose the input into low-rank Boolean factor matrices.
no code implementations • 31 Jan 2023 • Jonas Fischer, Rebekka Burkholz, Jilles Vreeken
We show, however, that these methods fail to reconstruct local properties, such as relative differences in densities (Fig.
2 code implementations • 16 Jun 2022 • Corinna Coupette, Jilles Vreeken, Bastian Rieck
We introduce Hyperbard, a dataset of diverse relational data representations derived from Shakespeare's plays.
no code implementations • 16 Dec 2021 • Corinna Coupette, Sebastian Dalleiger, Jilles Vreeken
How does neural connectivity in autistic children differ from neural connectivity in healthy children or autistic youths?
2 code implementations • 18 Oct 2021 • Michael Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken
Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier.
1 code implementation • 7 Oct 2021 • Michael Kamp, Jonas Fischer, Jilles Vreeken
Federated learning allows multiple parties to collaboratively train a joint model without sharing local data.
no code implementations • 29 Sep 2021 • Michael Kamp, Jonas Fischer, Jilles Vreeken
Federated learning allows multiple parties to collaboratively train a joint model without sharing local data.
no code implementations • 29 May 2021 • Corinna Coupette, Jilles Vreeken
We formalize this problem as a model selection task using the Minimum Description Length principle, capturing the similarity of the input graphs in a common model and the differences between them in transformations to individual models.
no code implementations • 2 Mar 2021 • Edith Heiter, Jonas Fischer, Jilles Vreeken
Low-dimensional embedding techniques such as tSNE and UMAP allow visualizing high-dimensional data and therewith facilitate the discovery of interesting structure.
no code implementations • 1 Jan 2021 • Jonas Fischer, Anna Oláh, Jilles Vreeken
In particular, we consider activation values of a network for given data, and propose to mine noise-robust rules of the form $X \rightarrow Y$ , where $X$ and $Y$ are sets of neurons in different layers.
no code implementations • 3 Dec 2020 • Arghya Dutta, Jilles Vreeken, Luca M. Ghiringhelli, Tristan Bereau
Beyond the widely recognized correlation with hydrophobicity, we additionally consider the functional relationship between passive permeation and acidity.
Chemical Physics Soft Condensed Matter
no code implementations • 6 Sep 2020 • Kailash Budhathoki, Mario Boley, Jilles Vreeken
Moreover, to discover reliable causal rules from a sample, we propose a conservative and consistent estimator of the causal effect, and derive an efficient and exact algorithm that maximises the estimator.
1 code implementation • 23 Mar 2020 • Caleb Belth, Xinyi Zheng, Jilles Vreeken, Danai Koutra
We apply our rules to three large KGs (NELL, DBpedia, and Yago), and tasks such as compression, various types of error detection, and identification of incomplete information.
1 code implementation • 30 Aug 2019 • Panagiotis Mandros, Mario Boley, Jilles Vreeken
This raises many questions, such as how to reliably and interpretably measure correlation between a multivariate set of attributes, how to do so without having to make assumptions on distribution of the data or the type of correlation, and, how to efficiently discover the top-most reliably correlated attribute sets from data.
no code implementations • 25 Apr 2019 • Michael Mampaey, Jilles Vreeken, Nikolaj Tatti
As we use the Maximum Entropy principle to obtain unbiased probabilistic models, and only include those itemsets that are most informative with regard to the current model, the summaries we construct are guaranteed to be both descriptive and non-redundant.
no code implementations • 12 Mar 2019 • Alexander Marx, Jilles Vreeken
Testing for conditional independence is a core aspect of constraint-based causal discovery.
no code implementations • 18 Feb 2019 • Nikolaj Tatti, Jilles Vreeken
Our approach provides a means to study and tell differences between results of different exploratory data mining methods.
no code implementations • 7 Feb 2019 • Nikolaj Tatti, Jilles Vreeken
An ideal outcome of pattern mining is a small set of informative patterns, containing no redundancy or noise, that identifies the key structure of the data at hand.
no code implementations • 21 Jan 2019 • David Kaltenpoth, Jilles Vreeken
We propose to do so using the Minimum Description Length (MDL) principle.
1 code implementation • 14 Sep 2018 • Panagiotis Mandros, Mario Boley, Jilles Vreeken
The reliable fraction of information is an attractive score for quantifying (functional) dependencies in high-dimensional data.
no code implementations • 20 Aug 2018 • Alexander Marx, Jilles Vreeken
We consider the problem of inferring the directed, causal graph from observational data, assuming no hidden confounders.
no code implementations • 26 Sep 2017 • Alexander Marx, Jilles Vreeken
We infer $X$ causes $Y$ in case it is shorter to describe $Y$ as a function of $X$ than the inverse direction.
no code implementations • 22 Sep 2017 • Janis Kalofolias, Mario Boley, Jilles Vreeken
That is, these sub-populations are exceptional with regard to the global distribution.
no code implementations • 25 May 2017 • Panagiotis Mandros, Mario Boley, Jilles Vreeken
As we want to be agnostic on the form of the dependence, we adopt an information-theoretic approach, and construct a reliable, bias correcting score that can be efficiently computed.
no code implementations • 22 Feb 2017 • Kailash Budhathoki, Jilles Vreeken
The algorithmic Markov condition states that the most likely causal direction between two random variables X and Y can be identified as that direction with the lowest Kolmogorov complexity.
no code implementations • 21 Feb 2017 • Alexander Marx, Jilles Vreeken
Given data over the joint distribution of two random variables $X$ and $Y$, we consider the problem of inferring the most likely causal direction between $X$ and $Y$.
no code implementations • 27 Jan 2017 • Apratim Bhattacharyya, Jilles Vreeken
Discovering the key structure of a database is one of the main goals of data mining.
no code implementations • 26 Jan 2017 • Mario Boley, Bryan R. Goldsmith, Luca M. Ghiringhelli, Jilles Vreeken
Existing algorithms for subgroup discovery with numerical targets do not optimize the error or target variable dispersion of the groups they find.
no code implementations • 22 Dec 2015 • Roel Bertens, Jilles Vreeken, Arno Siebes
Our world is filled with both beautiful and brainy people, but how often does a Nobel Prize winner also wins a beauty pageant?
no code implementations • 22 Dec 2015 • Roel Bertens, Jilles Vreeken, Arno Siebes
We study how to obtain concise descriptions of discrete multivariate sequential data.
no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken
For practical use, such a measure should be universal in the sense that it captures correlation in subspaces of any dimensionality and allows to meaningfully compare correlation scores across different subspaces, regardless how many dimensions they have and what specific statistical properties their dimensions possess.
no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken
We aim to analyze the relation between two random vectors that may potentially have both different number of attributes as well as realizations, and which may even not have a joint distribution.
no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken
Change detection in multivariate time series has applications in many domains, including health care and network monitoring.
no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken
For nominal attributes, this is relatively straightforward, as we can consider individual attribute values as binary features.