Explaining Groups of Points in Low-Dimensional Representations

ICML 2020 · Gregory Plumb, Jonathan Terhorst, Sriram Sankararaman, Ameet Talwalkar ·

A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differences between the groups. To solve this problem, we introduce a new type of explanation, a Global Counterfactual Explanation (GCE), and our algorithm, Transitive Global Translations (TGT), for computing GCEs. TGT identifies the differences between each pair of groups using compressed sensing but constrains those pairwise differences to be consistent among all of the groups. Empirically, we demonstrate that TGT is able to identify explanations that accurately explain the model while being relatively sparse, and that these explanations match real patterns in the data.

PDF Abstract ICML 2020 PDF

Code

Add Remove Mark official

GDPlumb/ELDR official

elfrink1/FACT

damiaanr/fact-ai

Tasks

Add Remove

counterfactual

Counterfactual Explanation

Interpretable Machine Learning

Datasets

Add Datasets introduced or used in this paper

Reproducibility Reports

Jan 31 2021

[Re] Explaining Groups of Points in Low-Dimensional Representations

RC 2020 · Rajeev Verma, Jim Wagemans, Paras Dahal, Auke Elfrink

We were able to reproduce their results using their code, yielding mostly similar results. TGT generally outperforms DBM, especially when explanations use few features. TGT is consistent in terms of the features to which it attributes cluster differences, across different sparsity levels. TGT matches real patterns in data. When extending the types of functions used for explanations, performance did not improve significantly, suggesting translations make for adequate explanations. However, the scaling extension shows promising performance on the modified synthetic data to recover the original signal.

Jan 31 2021

[Re] Explaining Groups of Points in Low-Dimensional Representations

RC 2020 · Damiaan J W Reijnaers, Daniël B van de Pavert, Giguru Scheuer, Liang Huang

The results presented in [1] were reproducible, both by using the provided code and our own implementation. Our additional experiments have highlighted several limitations of the explanatory algorithm in question: the algorithm severely relies on the shape and variance of the clusters present in the data (and, if applicable, the method used to label these clusters), and highly non-linear dimensionality reduction algorithms perform worse in terms of explainability.

Results from the Paper

Edit

Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Explaining Groups of Points in Low-Dimensional Representations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit