Search Results for author: Jilles Vreeken

Found 40 papers, 7 papers with code

Learning Exceptional Subgroups by End-to-End Maximizing KL-divergence

no code implementations • 20 Feb 2024 • Sascha Xu, Nils Philipp Walter, Janis Kalofolias, Jilles Vreeken

Finding and describing sub-populations that are exceptional regarding a target property has important applications in many scientific disciplines, from identifying disadvantaged demographic groups in census data to finding conductive molecules within gold nanoparticles.

Paper
Add Code

Succinct Interaction-Aware Explanations

no code implementations • 8 Feb 2024 • Sascha Xu, Joscha Cüppers, Jilles Vreeken

SHAP is a popular approach to explain black-box models by revealing the importance of individual features.

Paper
Add Code

Data is Moody: Discovering Data Modification Rules from Process Event Logs

no code implementations • 22 Dec 2023 • Marco Bjarne Schuster, Boris Wiegand, Jilles Vreeken

Although event logs are a powerful source to gain insight about the behavior of the underlying business process, existing work primarily focuses on finding patterns in the activity sequences of an event log, while ignoring event attribute data.

Attribute Subgroup Discovery

Paper
Add Code

Finding Interpretable Class-Specific Patterns through Efficient Neural Search

no code implementations • 7 Dec 2023 • Nils Philipp Walter, Jonas Fischer, Jilles Vreeken

Discovering patterns in data that best describe the differences between classes allows to hypothesize and reason about class-specific mechanisms.

Paper
Add Code

Understanding and Mitigating Classification Errors Through Interpretable Token Patterns

no code implementations • 18 Nov 2023 • Michael A. Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken

Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier.

Classification NER +1

Paper
Add Code

Towards Concept-Aware Large Language Models

1 code implementation • 3 Nov 2023 • Chen Shani, Jilles Vreeken, Dafna Shahaf

Concepts play a pivotal role in various human cognitive functions, including learning, reasoning and communication.

Paper
Code

Efficiently Factorizing Boolean Matrices using Proximal Gradient Descent

no code implementations • 14 Jul 2023 • Sebastian Dalleiger, Jilles Vreeken

Addressing the interpretability problem of NMF on Boolean data, Boolean Matrix Factorization (BMF) uses Boolean algebra to decompose the input into low-rank Boolean factor matrices.

Combinatorial Optimization

Paper
Add Code

Preserving local densities in low-dimensional embeddings

no code implementations • 31 Jan 2023 • Jonas Fischer, Rebekka Burkholz, Jilles Vreeken

We show, however, that these methods fail to reconstruct local properties, such as relative differences in densities (Fig.

Paper
Add Code

All the World's a (Hyper)Graph: A Data Drama

2 code implementations • 16 Jun 2022 • Corinna Coupette, Jilles Vreeken, Bastian Rieck

We introduce Hyperbard, a dataset of diverse relational data representations derived from Shakespeare's plays.

Graph Learning Graph Mining

Paper
Code

Differentially Describing Groups of Graphs

no code implementations • 16 Dec 2021 • Corinna Coupette, Sebastian Dalleiger, Jilles Vreeken

How does neural connectivity in autistic children differ from neural connectivity in healthy children or autistic youths?

Paper
Add Code

Label-Descriptive Patterns and Their Application to Characterizing Classification Errors

2 code implementations • 18 Oct 2021 • Michael Hedderich, Jonas Fischer, Dietrich Klakow, Jilles Vreeken

Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors, but also gives a way to act and improve the classifier.

Descriptive named-entity-recognition +4

Paper
Code

Federated Learning from Small Datasets

1 code implementation • 7 Oct 2021 • Michael Kamp, Jonas Fischer, Jilles Vreeken

Federated learning allows multiple parties to collaboratively train a joint model without sharing local data.

Federated Learning

Paper
Code

Picking Daisies in Private: Federated Learning from Small Datasets

no code implementations • 29 Sep 2021 • Michael Kamp, Jonas Fischer, Jilles Vreeken

Federated learning allows multiple parties to collaboratively train a joint model without sharing local data.

Federated Learning

Paper
Add Code

Graph Similarity Description: How Are These Graphs Similar?

no code implementations • 29 May 2021 • Corinna Coupette, Jilles Vreeken

We formalize this problem as a model selection task using the Minimum Description Length principle, capturing the similarity of the input graphs in a common model and the differences between them in transformations to individual models.

Graph Similarity Model Selection

Paper
Add Code

Factoring out prior knowledge from low-dimensional embeddings

no code implementations • 2 Mar 2021 • Edith Heiter, Jonas Fischer, Jilles Vreeken

Low-dimensional embedding techniques such as tSNE and UMAP allow visualizing high-dimensional data and therewith facilitate the discovery of interesting structure.

Paper
Add Code

What's in the Box? Exploring the Inner Life of Neural Networks with Robust Rules

no code implementations • 1 Jan 2021 • Jonas Fischer, Anna Oláh, Jilles Vreeken

In particular, we consider activation values of a network for given data, and propose to mine noise-robust rules of the form $X \rightarrow Y$ , where $X$ and $Y$ are sets of neurons in different layers.

Paper
Add Code

Data-driven equation for drug-membrane permeability across drugs and membranes

no code implementations • 3 Dec 2020 • Arghya Dutta, Jilles Vreeken, Luca M. Ghiringhelli, Tristan Bereau

Beyond the widely recognized correlation with hydrophobicity, we additionally consider the functional relationship between passive permeation and acidity.

Chemical Physics Soft Condensed Matter

Paper
Add Code

Discovering Reliable Causal Rules

no code implementations • 6 Sep 2020 • Kailash Budhathoki, Mario Boley, Jilles Vreeken

Moreover, to discover reliable causal rules from a sample, we propose a conservative and consistent estimator of the causal effect, and derive an efficient and exact algorithm that maximises the estimator.

Paper
Add Code

What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization

1 code implementation • 23 Mar 2020 • Caleb Belth, Xinyi Zheng, Jilles Vreeken, Danai Koutra

We apply our rules to three large KGs (NELL, DBpedia, and Yago), and tasks such as compression, various types of error detection, and identification of incomplete information.

Knowledge Graphs Question Answering

Paper
Code

Discovering Reliable Correlations in Categorical Data

1 code implementation • 30 Aug 2019 • Panagiotis Mandros, Mario Boley, Jilles Vreeken

This raises many questions, such as how to reliably and interpretably measure correlation between a multivariate set of attributes, how to do so without having to make assumptions on distribution of the data or the type of correlation, and, how to efficiently discover the top-most reliably correlated attribute sets from data.

Attribute

Paper
Code

Summarizing Data Succinctly with the Most Informative Itemsets

no code implementations • 25 Apr 2019 • Michael Mampaey, Jilles Vreeken, Nikolaj Tatti

As we use the Maximum Entropy principle to obtain unbiased probabilistic models, and only include those itemsets that are most informative with regard to the current model, the summaries we construct are guaranteed to be both descriptive and non-redundant.

Descriptive

Paper
Add Code

Testing Conditional Independence on Discrete Data using Stochastic Complexity

no code implementations • 12 Mar 2019 • Alexander Marx, Jilles Vreeken

Testing for conditional independence is a core aspect of constraint-based causal discovery.

Causal Discovery

Paper
Add Code

Comparing Apples and Oranges: Measuring Differences between Exploratory Data Mining Results

no code implementations • 18 Feb 2019 • Nikolaj Tatti, Jilles Vreeken

Our approach provides a means to study and tell differences between results of different exploratory data mining methods.

Paper
Add Code

The Long and the Short of It: Summarising Event Sequences with Serial Episodes

no code implementations • 7 Feb 2019 • Nikolaj Tatti, Jilles Vreeken

An ideal outcome of pattern mining is a small set of informative patterns, containing no redundancy or noise, that identifies the key structure of the data at hand.

Paper
Add Code

We Are Not Your Real Parents: Telling Causal from Confounded using MDL

no code implementations • 21 Jan 2019 • David Kaltenpoth, Jilles Vreeken

We propose to do so using the Minimum Description Length (MDL) principle.

Paper
Add Code

Discovering Reliable Dependencies from Data: Hardness and Improved Algorithms

1 code implementation • 14 Sep 2018 • Panagiotis Mandros, Mario Boley, Jilles Vreeken

The reliable fraction of information is an attractive score for quantifying (functional) dependencies in high-dimensional data.

Paper
Code

Causal Discovery by Telling Apart Parents and Children

no code implementations • 20 Aug 2018 • Alexander Marx, Jilles Vreeken

We consider the problem of inferring the directed, causal graph from observational data, assuming no hidden confounders.

Causal Discovery

Paper
Add Code

Telling Cause from Effect using MDL-based Local and Global Regression

no code implementations • 26 Sep 2017 • Alexander Marx, Jilles Vreeken

We infer $X$ causes $Y$ in case it is shorter to describe $Y$ as a function of $X$ than the inverse direction.

regression

Paper
Add Code

Efficiently Discovering Locally Exceptional yet Globally Representative Subgroups

no code implementations • 22 Sep 2017 • Janis Kalofolias, Mario Boley, Jilles Vreeken

That is, these sub-populations are exceptional with regard to the global distribution.

Subgroup Discovery

Paper
Add Code

Discovering Reliable Approximate Functional Dependencies

no code implementations • 25 May 2017 • Panagiotis Mandros, Mario Boley, Jilles Vreeken

As we want to be agnostic on the form of the dependence, we adopt an information-theoretic approach, and construct a reliable, bias correcting score that can be efficiently computed.

Attribute

Paper
Add Code

Causal Inference by Stochastic Complexity

no code implementations • 22 Feb 2017 • Kailash Budhathoki, Jilles Vreeken

The algorithmic Markov condition states that the most likely causal direction between two random variables X and Y can be identified as that direction with the lowest Kolmogorov complexity.

Causal Inference

Paper
Add Code

Causal Inference on Multivariate and Mixed-Type Data

no code implementations • 21 Feb 2017 • Alexander Marx, Jilles Vreeken

Given data over the joint distribution of two random variables $X$ and $Y$, we consider the problem of inferring the most likely causal direction between $X$ and $Y$.

Causal Inference Vocal Bursts Type Prediction

Paper
Add Code

Efficiently Summarising Event Sequences with Rich Interleaving Patterns

no code implementations • 27 Jan 2017 • Apratim Bhattacharyya, Jilles Vreeken

Discovering the key structure of a database is one of the main goals of data mining.

Paper
Add Code

Identifying Consistent Statements about Numerical Data with Dispersion-Corrected Subgroup Discovery

no code implementations • 26 Jan 2017 • Mario Boley, Bryan R. Goldsmith, Luca M. Ghiringhelli, Jilles Vreeken

Existing algorithms for subgroup discovery with numerical targets do not optimize the error or target variable dispersion of the groups they find.

Subgroup Discovery

Paper
Add Code

Beauty and Brains: Detecting Anomalous Pattern Co-Occurrences

no code implementations • 22 Dec 2015 • Roel Bertens, Jilles Vreeken, Arno Siebes

Our world is filled with both beautiful and brainy people, but how often does a Nobel Prize winner also wins a beauty pageant?

Paper
Add Code

Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns

no code implementations • 22 Dec 2015 • Roel Bertens, Jilles Vreeken, Arno Siebes

We study how to obtain concise descriptions of discrete multivariate sequential data.

Paper
Add Code

Universal Dependency Analysis

no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken

For practical use, such a measure should be universal in the sense that it captures correlation in subspaces of any dimensionality and allows to meaningfully compare correlation scores across different subspaces, regardless how many dimensions they have and what specific statistical properties their dimensions possess.

Paper
Add Code

Canonical Divergence Analysis

no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken

We aim to analyze the relation between two random vectors that may potentially have both different number of attributes as well as realizations, and which may even not have a joint distribution.

Paper
Add Code

Linear-time Detection of Non-linear Changes in Massively High Dimensional Time Series

no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken

Change detection in multivariate time series has applications in many domains, including health care and network monitoring.

Change Detection Time Series +1

Paper
Add Code

Flexibly Mining Better Subgroups

no code implementations • 28 Oct 2015 • Hoang-Vu Nguyen, Jilles Vreeken

For nominal attributes, this is relatively straightforward, as we can consider individual attribute values as binary features.

Attribute Subgroup Discovery

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.