Search Results for author: Rishabh Kabra

Found 11 papers, 5 papers with code

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models

no code implementations13 Jun 2024 Ziyi Wu, Yulia Rubanova, Rishabh Kabra, Drew A. Hudson, Igor Gilitschenski, Yusuf Aytar, Sjoerd van Steenkiste, Kelsey R. Allen, Thomas Kipf

By fine-tuning a pre-trained text-to-image diffusion model with this information, our approach enables fine-grained 3D pose and placement control of individual objects in a scene.

Object

Leveraging VLM-Based Pipelines to Annotate 3D Objects

no code implementations29 Nov 2023 Rishabh Kabra, Loic Matthey, Alexander Lerchner, Niloy J. Mitra

With these supervised and unsupervised evaluations, we show how a VLM-based pipeline can be leveraged to produce reliable annotations for 764K objects from the Objaverse dataset.

In-Context Learning Language Modelling +2

SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition

1 code implementation NeurIPS 2021 Rishabh Kabra, Daniel Zoran, Goker Erdogan, Loic Matthey, Antonia Creswell, Matthew Botvinick, Alexander Lerchner, Christopher P. Burgess

Leveraging the shared structure that exists across different scenes, our model learns to infer two sets of latent representations from RGB video input alone: a set of "object" latents, corresponding to the time-invariant, object-level contents of the scene, as well as a set of "frame" latents, corresponding to global time-varying elements such as viewpoint.

Instance Segmentation Object +1

Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

no code implementations NeurIPS 2021 Antonia Creswell, Rishabh Kabra, Chris Burgess, Murray Shanahan

We present a slot-wise, object-based transition model that decomposes a scene into objects, aligns them (with respect to a slot-wise object memory) to maintain a consistent order across time, and predicts how those objects evolve over successive frames.

Object

PARTS: Unsupervised Segmentation With Slots, Attention and Independence Maximization

no code implementations ICCV 2021 Daniel Zoran, Rishabh Kabra, Alexander Lerchner, Danilo J. Rezende

We present a model that is able to segment visual scenes from complex 3D environments into distinct objects, learn disentangled representations of individual objects, and form consistent and coherent predictions of future frames, in a fully unsupervised manner.

Representation Learning Scene Segmentation

Multi-Object Representation Learning with Iterative Variational Inference

6 code implementations1 Mar 2019 Klaus Greff, Raphaël Lopez Kaufman, Rishabh Kabra, Nick Watters, Chris Burgess, Daniel Zoran, Loic Matthey, Matthew Botvinick, Alexander Lerchner

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities.

Object Representation Learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.