no code implementations • 18 Jan 2024 • Lincen Yang, Matthijs van Leeuwen
To address these shortcomings, we propose TURS, for Truly Unordered Rule Sets.
1 code implementation • 2 Jul 2023 • Zhong Li, Jiayang Shi, Matthijs van Leeuwen
Event logs are widely used to record the status of high-tech systems, making log anomaly detection important for monitoring those systems.
1 code implementation • 22 Feb 2023 • Zhong Li, Matthijs van Leeuwen
Traditional anomaly detection methods aim to identify objects that deviate from most other objects by treating all features equally.
no code implementations • 13 Oct 2022 • Zhong Li, Yuxuan Zhu, Matthijs van Leeuwen
In the past two decades, most research on anomaly detection has focused on improving the accuracy of the detection, while largely ignoring the explainability of the corresponding methods and thus leaving the explanation of outcomes to practitioners.
no code implementations • 19 Aug 2022 • Zhong Li, Matthijs van Leeuwen
Event logs are widely used for anomaly detection and prediction in complex systems.
1 code implementation • 17 Jun 2022 • Lincen Yang, Matthijs van Leeuwen
Still, existing methods have several shortcomings: 1) most recent methods require a binary feature matrix as input, while learning rules directly from numeric variables is understudied; 2) existing methods impose orders among rules, either explicitly or implicitly, which harms interpretability; and 3) currently no method exists for learning probabilistic rule sets for multi-class target variables (there is only one for probabilistic rule lists).
2 code implementations • 25 Mar 2021 • Hugo Manuel Proença, Peter Grünwald, Thomas Bäck, Matthijs van Leeuwen
This novel model class allows us to formalise the problem of optimal robust subgroup discovery using the Minimum Description Length (MDL) principle, where we resort to optimal Normalised Maximum Likelihood and Bayesian encodings for nominal and numeric targets, respectively.
1 code implementation • 13 Jan 2021 • Alexander Marx, Lincen Yang, Matthijs van Leeuwen
Further, we show that CMI can be consistently estimated for discrete-continuous mixture variables by learning an adaptive histogram model.
Causal Discovery Information Theory Information Theory Applications
3 code implementations • 16 Jun 2020 • Hugo M. Proença, Peter Grünwald, Thomas Bäck, Matthijs van Leeuwen
We propose a dispersion-aware problem formulation for subgroup set discovery that is based on the minimum description length (MDL) principle and subgroup lists.
1 code implementation • 2 Jun 2020 • Lincen Yang, Mitra Baratchi, Matthijs van Leeuwen
As the flexibility of our model class comes at the cost of a vast search space, we introduce a heuristic algorithm, named PALM, which Partitions each dimension ALternately and then Merges neighboring regions, all using the MDL principle.
no code implementations • 21 Nov 2019 • Micky Faas, Matthijs van Leeuwen
We introduce geometric pattern mining, the problem of finding recurring local structure in discrete, geometric matrices.
3 code implementations • 1 May 2019 • Hugo M. Proença, Matthijs van Leeuwen
Interpretable classifiers have recently witnessed an increase in attention from the data mining community because they are inherently easier to understand and explain than their more complex counterparts.
no code implementations • 7 Feb 2017 • Vladimir Dzyuba, Matthijs van Leeuwen
We compare the performance of the proposed algorithm to the state-of-the-art in interactive pattern mining by emulating the interests of a user.
1 code implementation • 1 Nov 2016 • Bas van Stein, Matthijs van Leeuwen, Thomas Bäck
In highly complex and high-dimensional data, however, existing methods are likely to overlook important outliers because they do not explicitly take into account that the data is often a mixture distribution of multiple components.
1 code implementation • 28 Oct 2016 • Vladimir Dzyuba, Matthijs van Leeuwen, Luc De Raedt
Several sampling algorithms have been proposed, but each of them has its limitations when it comes to 1) flexibility in terms of quality measures and constraints that can be used, and/or 2) guarantees with respect to sampling accuracy.
no code implementations • 17 Oct 2016 • Sander van Rijn, Hao Wang, Matthijs van Leeuwen, Thomas Bäck
Various variants of the well known Covariance Matrix Adaptation Evolution Strategy (CMA-ES) have been proposed recently, which improve the empirical performance of the original algorithm by structural modifications.