Search Results for author: Siva Karthik Mustikovela

Found 10 papers, 2 papers with code

Making Large Multimodal Models Understand Arbitrary Visual Prompts

no code implementations • 1 Dec 2023 • Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee

Furthermore, we present ViP-Bench, a comprehensive benchmark to assess the capability of models in understanding visual prompts across multiple dimensions, enabling future research in this domain.

Visual Commonsense Reasoning Visual Prompting

Paper
Add Code

NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects

1 code implementation • 24 Aug 2023 • Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai

We propose a novel-view augmentation (NOVA) strategy to train NeRFs for photo-realistic 3D composition of dynamic objects in a static scene.

Optical Flow Estimation

Paper
Code

Self-Supervised Object Detection via Generative Image Synthesis

no code implementations • ICCV 2021 • Siva Karthik Mustikovela, Shalini De Mello, Aayush Prakash, Umar Iqbal, Sifei Liu, Thu Nguyen-Phuoc, Carsten Rother, Jan Kautz

We present SSOD, the first end-to-end analysis-by synthesis framework with controllable GANs for the task of self-supervised object detection.

Image Generation Object +2

Paper
Add Code

Intrinsic Autoencoders for Joint Neural Rendering and Intrinsic Image Decomposition

no code implementations • 29 Jun 2020 • Hassan Abu Alhaija, Siva Karthik Mustikovela, Justus Thies, Varun Jampani, Matthias Nießner, Andreas Geiger, Carsten Rother

Neural rendering techniques promise efficient photo-realistic image synthesis while at the same time providing rich control over scene parameters by learning the physical image formation process.

Image-to-Image Translation Intrinsic Image Decomposition +1

Paper
Add Code

Self-Supervised Viewpoint Learning From Image Collections

2 code implementations • CVPR 2020 • Siva Karthik Mustikovela, Varun Jampani, Shalini De Mello, Sifei Liu, Umar Iqbal, Carsten Rother, Jan Kautz

Training deep neural networks to estimate the viewpoint of objects requires large labeled training datasets.

Object Viewpoint Estimation

215

Paper
Code

Geometric Image Synthesis

no code implementations • 12 Sep 2018 • Hassan Abu Alhaija, Siva Karthik Mustikovela, Andreas Geiger, Carsten Rother

The task of generating natural images from 3D scenes has been a long standing goal in computer graphics.

Image Generation Instance Segmentation +1

Paper
Add Code

iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects

no code implementations • 5 Dec 2017 • Omid Hosseini Jafari, Siva Karthik Mustikovela, Karl Pertsch, Eric Brachmann, Carsten Rother

We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded.

6D Pose Estimation 6D Pose Estimation using RGB +3

Paper
Add Code

Bounding Boxes, Segmentations and Object Coordinates: How Important Is Recognition for 3D Scene Flow Estimation in Autonomous Driving Scenarios?

no code implementations • ICCV 2017 • Aseem Behl, Omid Hosseini Jafari, Siva Karthik Mustikovela, Hassan Abu Alhaija, Carsten Rother, Andreas Geiger

Existing methods for 3D scene flow estimation often fail in the presence of large displacement or local ambiguities, e. g., at texture-less or reflective surfaces.

Autonomous Driving Instance Segmentation +3

Paper
Add Code

Augmented Reality Meets Computer Vision : Efficient Data Generation for Urban Driving Scenes

no code implementations • 4 Aug 2017 • Hassan Abu Alhaija, Siva Karthik Mustikovela, Lars Mescheder, Andreas Geiger, Carsten Rother

Further, we demonstrate the utility of our approach on training standard deep models for semantic instance segmentation and object detection of cars in outdoor driving scenes.

Instance Segmentation Object +3

Paper
Add Code

Can Ground Truth Label Propagation from Video help Semantic Segmentation?

no code implementations • 3 Oct 2016 • Siva Karthik Mustikovela, Michael Ying Yang, Carsten Rother

For state-of-the-art semantic segmentation task, training convolutional neural networks (CNNs) requires dense pixelwise ground truth (GT) labeling, which is expensive and involves extensive human effort.

Semantic Segmentation Video Segmentation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.