Search Results for author: Stephan Wäldchen

Found 6 papers, 3 papers with code

Hardness of Deceptive Certificate Selection

no code implementations7 Jun 2023 Stephan Wäldchen

A prover selects a certificate from the datapoint and sends it to a verifier who decides the class.

Feature Correlation

Interpretability Guarantees with Merlin-Arthur Classifiers

1 code implementation1 Jun 2022 Stephan Wäldchen, Kartikey Sharma, Berkant Turan, Max Zimmer, Sebastian Pokutta

We propose an interactive multi-agent classifier that provides provable interpretability guarantees even for complex agents such as neural networks.

Feature Correlation

A Complete Characterisation of ReLU-Invariant Distributions

no code implementations13 Dec 2021 Jan Macdonald, Stephan Wäldchen

We prove that no invariant parametrised family of distributions can exist unless at least one of the following three restrictions holds: First, the network layers have a width of one, which is unreasonable for practical neural networks.

Explainable artificial intelligence Explainable Artificial Intelligence (XAI) +1

A Rate-Distortion Framework for Explaining Neural Network Decisions

2 code implementations27 May 2019 Jan Macdonald, Stephan Wäldchen, Sascha Hauch, Gitta Kutyniok

We formalise the widespread idea of interpreting neural network decisions as an explicit optimisation problem in a rate-distortion framework.

General Classification Image Classification

Unmasking Clever Hans Predictors and Assessing What Machines Really Learn

1 code implementation26 Feb 2019 Sebastian Lapuschkin, Stephan Wäldchen, Alexander Binder, Grégoire Montavon, Wojciech Samek, Klaus-Robert Müller

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly "intelligent" behavior.

Cannot find the paper you are looking for? You can Submit a new open access paper.