Intriguing properties of generative classifiers

28 Sep 2023  ·  Priyank Jaini, Kevin Clark, Robert Geirhos ·

What is the best paradigm to recognize objects -- discriminative inference (fast but potentially prone to shortcut learning) or using a generative model (slow but potentially more robust)? We build on recent advances in generative modeling that turn text-to-image models into classifiers. This allows us to study their behavior and to compare them against discriminative models and human psychophysical data. We report four intriguing emergent properties of generative classifiers: they show a record-breaking human-like shape bias (99% for Imagen), near human-level out-of-distribution accuracy, state-of-the-art alignment with human classification errors, and they understand certain perceptual illusions. Our results indicate that while the current dominant paradigm for modeling human object recognition is discriminative inference, zero-shot generative models approximate human object recognition data surprisingly well.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Object Recognition shape bias Imagen shape bias 98.7 # 1
Object Recognition shape bias Parti shape bias 91.7 # 3
Object Recognition shape bias Stable Diffusion shape bias 92.7 # 2

Methods


No methods listed for this paper. Add relevant methods here