Search Results for author: Roland S. Zimmermann

Found 13 papers, 6 papers with code

Scale Alone Does not Improve Mechanistic Interpretability in Vision Models

no code implementations NeurIPS 2023 Roland S. Zimmermann, Thomas Klein, Wieland Brendel

We use a psychophysical paradigm to quantify one form of mechanistic interpretability for a diverse suite of nine models and find no scaling effect for interpretability - neither for model nor dataset size.

Don't trust your eyes: on the (un)reliability of feature visualizations

1 code implementation7 Jun 2023 Robert Geirhos, Roland S. Zimmermann, Blair Bilodeau, Wieland Brendel, Been Kim

Today, visualization methods form the foundation of our knowledge about the internal workings of neural networks, as a type of mechanistic interpretability.

Sensitivity of Slot-Based Object-Centric Models to their Number of Slots

no code implementations30 May 2023 Roland S. Zimmermann, Sjoerd van Steenkiste, Mehdi S. M. Sajjadi, Thomas Kipf, Klaus Greff

Self-supervised methods for learning object-centric representations have recently been applied successfully to various datasets.

Provably Learning Object-Centric Representations

no code implementations23 May 2023 Jack Brady, Roland S. Zimmermann, Yash Sharma, Bernhard Schölkopf, Julius von Kügelgen, Wieland Brendel

Under this generative process, we prove that the ground-truth object representations can be identified by an invertible and compositional inference model, even in the presence of dependencies between objects.

Object Representation Learning

Increasing Confidence in Adversarial Robustness Evaluations

no code implementations28 Jun 2022 Roland S. Zimmermann, Wieland Brendel, Florian Tramer, Nicholas Carlini

Hundreds of defenses have been proposed to make deep neural networks robust against minimal (adversarial) input perturbations.

Adversarial Robustness

Score-Based Generative Classifiers

no code implementations1 Oct 2021 Roland S. Zimmermann, Lukas Schott, Yang song, Benjamin A. Dunn, David A. Klindt

In this work, we investigate score-based generative models as classifiers for natural images.

Classification

How Well do Feature Visualizations Support Causal Understanding of CNN Activations?

1 code implementation NeurIPS 2021 Roland S. Zimmermann, Judy Borowski, Robert Geirhos, Matthias Bethge, Thomas S. A. Wallis, Wieland Brendel

A precise understanding of why units in an artificial network respond to certain stimuli would constitute a big step towards explainable artificial intelligence.

Explainable artificial intelligence

A simple way to make neural networks robust against diverse image corruptions

3 code implementations ECCV 2020 Evgenia Rusak, Lukas Schott, Roland S. Zimmermann, Julian Bitterwolf, Oliver Bringmann, Matthias Bethge, Wieland Brendel

The human visual system is remarkably robust against a wide range of naturally occurring variations and corruptions like rain or snow.

Comment on "Adv-BNN: Improved Adversarial Defense through Robust Bayesian Neural Network"

no code implementations1 Jul 2019 Roland S. Zimmermann

A recent paper by Liu et al. combines the topics of adversarial training and Bayesian Neural Networks (BNN) and suggests that adversarially trained BNNs are more robust against adversarial attacks than their non-Bayesian counterparts.

Adversarial Attack Adversarial Defense

Simion Zoo: A Workbench for Distributed Experimentation with Reinforcement Learning for Continuous Control Tasks

no code implementations16 Apr 2019 Borja Fernandez-Gauna, Manuel Graña, Roland S. Zimmermann

We present Simion Zoo, a Reinforcement Learning (RL) workbench that provides a complete set of tools to design, run, and analyze the results, both statistically and visually, of RL control applications.

Continuous Control reinforcement-learning +1

Faster Training of Mask R-CNN by Focusing on Instance Boundaries

2 code implementations19 Sep 2018 Roland S. Zimmermann, Julien N. Siems

We present an auxiliary task to Mask R-CNN, an instance segmentation network, which leads to faster training of the mask head.

Edge Detection Instance Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.