Search Results for author: Siva Karthik Mustikovela

Found 10 papers, 2 papers with code

Making Large Multimodal Models Understand Arbitrary Visual Prompts

no code implementations1 Dec 2023 Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee

Furthermore, we present ViP-Bench, a comprehensive benchmark to assess the capability of models in understanding visual prompts across multiple dimensions, enabling future research in this domain.

Visual Commonsense Reasoning Visual Prompting

NOVA: NOvel View Augmentation for Neural Composition of Dynamic Objects

1 code implementation24 Aug 2023 Dakshit Agrawal, Jiajie Xu, Siva Karthik Mustikovela, Ioannis Gkioulekas, Ashish Shrivastava, Yuning Chai

We propose a novel-view augmentation (NOVA) strategy to train NeRFs for photo-realistic 3D composition of dynamic objects in a static scene.

Optical Flow Estimation

Intrinsic Autoencoders for Joint Neural Rendering and Intrinsic Image Decomposition

no code implementations29 Jun 2020 Hassan Abu Alhaija, Siva Karthik Mustikovela, Justus Thies, Varun Jampani, Matthias Nießner, Andreas Geiger, Carsten Rother

Neural rendering techniques promise efficient photo-realistic image synthesis while at the same time providing rich control over scene parameters by learning the physical image formation process.

Image-to-Image Translation Intrinsic Image Decomposition +1

Geometric Image Synthesis

no code implementations12 Sep 2018 Hassan Abu Alhaija, Siva Karthik Mustikovela, Andreas Geiger, Carsten Rother

The task of generating natural images from 3D scenes has been a long standing goal in computer graphics.

Image Generation Instance Segmentation +1

iPose: Instance-Aware 6D Pose Estimation of Partly Occluded Objects

no code implementations5 Dec 2017 Omid Hosseini Jafari, Siva Karthik Mustikovela, Karl Pertsch, Eric Brachmann, Carsten Rother

We address the task of 6D pose estimation of known rigid objects from single input images in scenarios where the objects are partly occluded.

6D Pose Estimation 6D Pose Estimation using RGB +3

Augmented Reality Meets Computer Vision : Efficient Data Generation for Urban Driving Scenes

no code implementations4 Aug 2017 Hassan Abu Alhaija, Siva Karthik Mustikovela, Lars Mescheder, Andreas Geiger, Carsten Rother

Further, we demonstrate the utility of our approach on training standard deep models for semantic instance segmentation and object detection of cars in outdoor driving scenes.

Instance Segmentation Object +3

Can Ground Truth Label Propagation from Video help Semantic Segmentation?

no code implementations3 Oct 2016 Siva Karthik Mustikovela, Michael Ying Yang, Carsten Rother

For state-of-the-art semantic segmentation task, training convolutional neural networks (CNNs) requires dense pixelwise ground truth (GT) labeling, which is expensive and involves extensive human effort.

Semantic Segmentation Video Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.