1 code implementation • 26 May 2022 • Bruno Sauvalle, Arnaud de La Fortelle
We introduce a new architecture for unsupervised object-centric representation learning and multi-object detection and segmentation, which uses a translation-equivariant attention mechanism to predict the coordinates of the objects present in the scene and to associate a feature vector to each object.
Ranked #1 on Unsupervised Object Segmentation on ClevrTex
1 code implementation • 15 Dec 2021 • Bruno Sauvalle, Arnaud de La Fortelle
The main novelty of the proposed model is that the autoencoder is also trained to predict the background noise, which allows to compute for each frame a pixel-dependent threshold to perform the foreground segmentation.