Search Results for author: Jilles Vreeken

Found 40 papers, 7 papers with code

Learning Exceptional Subgroups by End-to-End Maximizing KL-divergence

no code implementations20 Feb 2024 Sascha Xu, Nils Philipp Walter, Janis Kalofolias, Jilles Vreeken

Finding and describing sub-populations that are exceptional regarding a target property has important applications in many scientific disciplines, from identifying disadvantaged demographic groups in census data to finding conductive molecules within gold nanoparticles.

Succinct Interaction-Aware Explanations

no code implementations8 Feb 2024 Sascha Xu, Joscha Cüppers, Jilles Vreeken

SHAP is a popular approach to explain black-box models by revealing the importance of individual features.

Data is Moody: Discovering Data Modification Rules from Process Event Logs

no code implementations22 Dec 2023 Marco Bjarne Schuster, Boris Wiegand, Jilles Vreeken

Although event logs are a powerful source to gain insight about the behavior of the underlying business process, existing work primarily focuses on finding patterns in the activity sequences of an event log, while ignoring event attribute data.

Attribute Subgroup Discovery

Finding Interpretable Class-Specific Patterns through Efficient Neural Search

no code implementations7 Dec 2023 Nils Philipp Walter, Jonas Fischer, Jilles Vreeken

Discovering patterns in data that best describe the differences between classes allows to hypothesize and reason about class-specific mechanisms.

Understanding and Mitigating Classification Errors Through Interpretable Token Patterns

no code implementations18 Nov 2023 Michael A. Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken

Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier.

Classification NER +1

Towards Concept-Aware Large Language Models

1 code implementation3 Nov 2023 Chen Shani, Jilles Vreeken, Dafna Shahaf

Concepts play a pivotal role in various human cognitive functions, including learning, reasoning and communication.

Efficiently Factorizing Boolean Matrices using Proximal Gradient Descent

no code implementations14 Jul 2023 Sebastian Dalleiger, Jilles Vreeken

Addressing the interpretability problem of NMF on Boolean data, Boolean Matrix Factorization (BMF) uses Boolean algebra to decompose the input into low-rank Boolean factor matrices.

Combinatorial Optimization

Preserving local densities in low-dimensional embeddings

no code implementations31 Jan 2023 Jonas Fischer, Rebekka Burkholz, Jilles Vreeken

We show, however, that these methods fail to reconstruct local properties, such as relative differences in densities (Fig.

All the World's a (Hyper)Graph: A Data Drama

2 code implementations16 Jun 2022 Corinna Coupette, Jilles Vreeken, Bastian Rieck

We introduce Hyperbard, a dataset of diverse relational data representations derived from Shakespeare's plays.

Graph Learning Graph Mining

Differentially Describing Groups of Graphs

no code implementations16 Dec 2021 Corinna Coupette, Sebastian Dalleiger, Jilles Vreeken

How does neural connectivity in autistic children differ from neural connectivity in healthy children or autistic youths?

Label-Descriptive Patterns and Their Application to Characterizing Classification Errors

2 code implementations18 Oct 2021 Michael Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken

Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier.

Descriptive named-entity-recognition +4

Federated Learning from Small Datasets

1 code implementation7 Oct 2021 Michael Kamp, Jonas Fischer, Jilles Vreeken

Federated learning allows multiple parties to collaboratively train a joint model without sharing local data.

Federated Learning

Picking Daisies in Private: Federated Learning from Small Datasets

no code implementations29 Sep 2021 Michael Kamp, Jonas Fischer, Jilles Vreeken

Federated learning allows multiple parties to collaboratively train a joint model without sharing local data.

Federated Learning

Graph Similarity Description: How Are These Graphs Similar?

no code implementations29 May 2021 Corinna Coupette, Jilles Vreeken

We formalize this problem as a model selection task using the Minimum Description Length principle, capturing the similarity of the input graphs in a common model and the differences between them in transformations to individual models.

Graph Similarity Model Selection

Factoring out prior knowledge from low-dimensional embeddings

no code implementations2 Mar 2021 Edith Heiter, Jonas Fischer, Jilles Vreeken

Low-dimensional embedding techniques such as tSNE and UMAP allow visualizing high-dimensional data and therewith facilitate the discovery of interesting structure.

What's in the Box? Exploring the Inner Life of Neural Networks with Robust Rules

no code implementations1 Jan 2021 Jonas Fischer, Anna Oláh, Jilles Vreeken

In particular, we consider activation values of a network for given data, and propose to mine noise-robust rules of the form $X \rightarrow Y$ , where $X$ and $Y$ are sets of neurons in different layers.

Data-driven equation for drug-membrane permeability across drugs and membranes

no code implementations3 Dec 2020 Arghya Dutta, Jilles Vreeken, Luca M. Ghiringhelli, Tristan Bereau

Beyond the widely recognized correlation with hydrophobicity, we additionally consider the functional relationship between passive permeation and acidity.

Chemical Physics Soft Condensed Matter

Discovering Reliable Causal Rules

no code implementations6 Sep 2020 Kailash Budhathoki, Mario Boley, Jilles Vreeken

Moreover, to discover reliable causal rules from a sample, we propose a conservative and consistent estimator of the causal effect, and derive an efficient and exact algorithm that maximises the estimator.

What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization

1 code implementation23 Mar 2020 Caleb Belth, Xinyi Zheng, Jilles Vreeken, Danai Koutra

We apply our rules to three large KGs (NELL, DBpedia, and Yago), and tasks such as compression, various types of error detection, and identification of incomplete information.

Knowledge Graphs Question Answering

Discovering Reliable Correlations in Categorical Data

1 code implementation30 Aug 2019 Panagiotis Mandros, Mario Boley, Jilles Vreeken

This raises many questions, such as how to reliably and interpretably measure correlation between a multivariate set of attributes, how to do so without having to make assumptions on distribution of the data or the type of correlation, and, how to efficiently discover the top-most reliably correlated attribute sets from data.

Attribute

Summarizing Data Succinctly with the Most Informative Itemsets

no code implementations25 Apr 2019 Michael Mampaey, Jilles Vreeken, Nikolaj Tatti

As we use the Maximum Entropy principle to obtain unbiased probabilistic models, and only include those itemsets that are most informative with regard to the current model, the summaries we construct are guaranteed to be both descriptive and non-redundant.

Descriptive

Testing Conditional Independence on Discrete Data using Stochastic Complexity

no code implementations12 Mar 2019 Alexander Marx, Jilles Vreeken

Testing for conditional independence is a core aspect of constraint-based causal discovery.

Causal Discovery

Comparing Apples and Oranges: Measuring Differences between Exploratory Data Mining Results

no code implementations18 Feb 2019 Nikolaj Tatti, Jilles Vreeken

Our approach provides a means to study and tell differences between results of different exploratory data mining methods.

The Long and the Short of It: Summarising Event Sequences with Serial Episodes

no code implementations7 Feb 2019 Nikolaj Tatti, Jilles Vreeken

An ideal outcome of pattern mining is a small set of informative patterns, containing no redundancy or noise, that identifies the key structure of the data at hand.

We Are Not Your Real Parents: Telling Causal from Confounded using MDL

no code implementations21 Jan 2019 David Kaltenpoth, Jilles Vreeken

We propose to do so using the Minimum Description Length (MDL) principle.

Discovering Reliable Dependencies from Data: Hardness and Improved Algorithms

1 code implementation14 Sep 2018 Panagiotis Mandros, Mario Boley, Jilles Vreeken

The reliable fraction of information is an attractive score for quantifying (functional) dependencies in high-dimensional data.

Causal Discovery by Telling Apart Parents and Children

no code implementations20 Aug 2018 Alexander Marx, Jilles Vreeken

We consider the problem of inferring the directed, causal graph from observational data, assuming no hidden confounders.

Causal Discovery

Telling Cause from Effect using MDL-based Local and Global Regression

no code implementations26 Sep 2017 Alexander Marx, Jilles Vreeken

We infer $X$ causes $Y$ in case it is shorter to describe $Y$ as a function of $X$ than the inverse direction.

regression

Discovering Reliable Approximate Functional Dependencies

no code implementations25 May 2017 Panagiotis Mandros, Mario Boley, Jilles Vreeken

As we want to be agnostic on the form of the dependence, we adopt an information-theoretic approach, and construct a reliable, bias correcting score that can be efficiently computed.

Attribute

Causal Inference by Stochastic Complexity

no code implementations22 Feb 2017 Kailash Budhathoki, Jilles Vreeken

The algorithmic Markov condition states that the most likely causal direction between two random variables X and Y can be identified as that direction with the lowest Kolmogorov complexity.

Causal Inference

Causal Inference on Multivariate and Mixed-Type Data

no code implementations21 Feb 2017 Alexander Marx, Jilles Vreeken

Given data over the joint distribution of two random variables $X$ and $Y$, we consider the problem of inferring the most likely causal direction between $X$ and $Y$.

Causal Inference Vocal Bursts Type Prediction

Efficiently Summarising Event Sequences with Rich Interleaving Patterns

no code implementations27 Jan 2017 Apratim Bhattacharyya, Jilles Vreeken

Discovering the key structure of a database is one of the main goals of data mining.

Identifying Consistent Statements about Numerical Data with Dispersion-Corrected Subgroup Discovery

no code implementations26 Jan 2017 Mario Boley, Bryan R. Goldsmith, Luca M. Ghiringhelli, Jilles Vreeken

Existing algorithms for subgroup discovery with numerical targets do not optimize the error or target variable dispersion of the groups they find.

Subgroup Discovery

Beauty and Brains: Detecting Anomalous Pattern Co-Occurrences

no code implementations22 Dec 2015 Roel Bertens, Jilles Vreeken, Arno Siebes

Our world is filled with both beautiful and brainy people, but how often does a Nobel Prize winner also wins a beauty pageant?

Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns

no code implementations22 Dec 2015 Roel Bertens, Jilles Vreeken, Arno Siebes

We study how to obtain concise descriptions of discrete multivariate sequential data.

Universal Dependency Analysis

no code implementations28 Oct 2015 Hoang-Vu Nguyen, Jilles Vreeken

For practical use, such a measure should be universal in the sense that it captures correlation in subspaces of any dimensionality and allows to meaningfully compare correlation scores across different subspaces, regardless how many dimensions they have and what specific statistical properties their dimensions possess.

Canonical Divergence Analysis

no code implementations28 Oct 2015 Hoang-Vu Nguyen, Jilles Vreeken

We aim to analyze the relation between two random vectors that may potentially have both different number of attributes as well as realizations, and which may even not have a joint distribution.

Linear-time Detection of Non-linear Changes in Massively High Dimensional Time Series

no code implementations28 Oct 2015 Hoang-Vu Nguyen, Jilles Vreeken

Change detection in multivariate time series has applications in many domains, including health care and network monitoring.

Change Detection Time Series +1

Flexibly Mining Better Subgroups

no code implementations28 Oct 2015 Hoang-Vu Nguyen, Jilles Vreeken

For nominal attributes, this is relatively straightforward, as we can consider individual attribute values as binary features.

Attribute Subgroup Discovery

Cannot find the paper you are looking for? You can Submit a new open access paper.