no code implementations • 24 Oct 2023 • Lauri Seppäläinen, Anton Björklund, Vitus Besel, Kai Puolamäki
Explainable artificial intelligence is used to investigate the decision processes of black box machine learning models and complex simulators.
1 code implementation • 12 Jan 2022 • Anton Björklund, Jarmo Mäkelä, Kai Puolamäki
Existing methods for explaining black box learning models often focus on building local explanations of model behaviour for a particular data item.
1 code implementation • 1 Jul 2021 • Laila Melkas, Rafael Savvides, Suyog Chandramouli, Jarmo Mäkelä, Tuomo Nieminen, Ivan Mammarella, Kai Puolamäki
We present a workflow that is required to take this knowledge into account and to apply CSD algorithms in Earth system sciences.
no code implementations • 16 Jun 2020 • Sami Hanhijärvi, Markus Ojala, Niko Vuokko, Kai Puolamäki, Nikolaj Tatti, Heikki Mannila
At each step in the data mining process, the randomization produces random samples from the set of data matrices satisfying the already discovered patterns or models.
no code implementations • 13 Dec 2019 • Francesco Concas, Julien Mineraud, Eemil Lagerspetz, Samu Varjonen, Xiaoli Liu, Kai Puolamäki, Petteri Nurmi, Sasu Tarkoma
Periodic re-calibration can improve the accuracy of low-cost sensors, particularly with machine-learning-based calibration, which has shown great promise due to its capability to calibrate sensors in-field.
no code implementations • 9 Oct 2019 • Henri Tiittanen, Emilia Oikarinen, Andreas Henelius, Kai Puolamäki
Regression analysis is a standard supervised machine learning method used to model an outcome variable in terms of a set of predictor variables.
1 code implementation • 7 May 2019 • Kai Puolamäki, Emilia Oikarinen, Andreas Henelius
This paper proposes a principled framework for interactive visual exploration of relations in data, through views most informative given the user's current knowledge and objectives.
1 code implementation • 20 May 2018 • Kai Puolamäki, Emilia Oikarinen, Buse Atli, Andreas Henelius
An explorative data analysis system should be aware of what the user already knows and what the user wants to know of the data: otherwise the system cannot provide the user with the most informative and useful views of the data.
no code implementations • 9 Apr 2018 • Andreas Henelius, Emilia Oikarinen, Kai Puolamäki
This framework allows the user to incorporate existing knowledge into the exploration process, focus on exploring a subset of the data, and compare different complex hypotheses concerning relations in the data.
1 code implementation • 23 Oct 2017 • Kai Puolamäki, Emilia Oikarinen, Bo Kang, Jefrey Lijffijt, Tijl De Bie
We conclude that the information theoretic approach to exploratory data analysis where patterns observed by a user are formalized as constraints provides a principled, intuitive, and efficient basis for constructing an EDA system.
no code implementations • 12 Oct 2017 • Jefrey Lijffijt, Bo Kang, Wouter Duivesteijn, Kai Puolamäki, Emilia Oikarinen, Tijl De Bie
The subgroup descriptions are in terms of a succinct set of arbitrarily-typed other attributes.
no code implementations • 24 Jul 2017 • Andreas Henelius, Kai Puolamäki, Antti Ukkonen
In this work we present the novel ASTRID method for investigating which attribute interactions classifiers exploit when making predictions.
no code implementations • 20 Jan 2017 • Jussi Korpela, Emilia Oikarinen, Kai Puolamäki, Antti Ukkonen
In this paper we define confidence intervals for multivariate data that extend the one-dimensional definition in a natural way.
1 code implementation • 27 Dec 2016 • Andreas Henelius, Kai Puolamäki, Henrik Boström, Panagiotis Papapetrou
In this study, we propose a technique for quantifying the instability of a clustering solution and for finding robust clusters, termed core clusters, which correspond to clusters where the co-occurrence probability of each data item within a cluster is at least $1 - \alpha$.
1 code implementation • 22 Dec 2016 • Andreas Henelius, Antti Ukkonen, Kai Puolamäki
In many data exploration tasks it is meaningful to identify groups of attribute interactions that are specific to a variable of interest.