Search Results for author: Suryansh Kumar

Found 32 papers, 4 papers with code

ICGNet: A Unified Approach for Instance-Centric Grasping

no code implementations18 Jan 2024 René Zurbrügg, Yifan Liu, Francis Engelmann, Suryansh Kumar, Marco Hutter, Vaishakh Patil, Fisher Yu

Executing a successful grasp in a cluttered environment requires multiple levels of scene understanding: First, the robot needs to analyze the geometric properties of individual objects to find feasible grasps.

Object Object Reconstruction +1

Learning Robust Multi-Scale Representation for Neural Radiance Fields from Unposed Images

no code implementations8 Nov 2023 Nishant Jain, Suryansh Kumar, Luc van Gool

The key ideas presented in this paper are (i) Recovering accurate camera parameters via a robust pipeline from unposed day-to-day images is equally crucial in neural novel view synthesis problem; (ii) It is rather more practical to model object's content at different resolutions since dramatic camera motion is highly likely in day-to-day unposed images.

Depth Estimation Depth Prediction +2

How To Not Train Your Dragon: Training-free Embodied Object Goal Navigation with Semantic Frontiers

no code implementations26 May 2023 Junting Chen, Guohao Li, Suryansh Kumar, Bernard Ghanem, Fisher Yu

Our method propagates semantics on the scene graphs based on language priors and scene statistics to introduce semantic knowledge to the geometric frontiers.

Imitation Learning Navigate +2

Neural Implicit Dense Semantic SLAM

no code implementations27 Apr 2023 Yasaman Haghighi, Suryansh Kumar, Jean-Philippe Thiran, Luc van Gool

Visual Simultaneous Localization and Mapping (vSLAM) is a widely used technique in robotics and computer vision that enables a robot to create a map of an unfamiliar environment using a camera sensor while simultaneously tracking its position over time.

Scene Understanding Semantic Segmentation +1

Quantum Annealing for Single Image Super-Resolution

no code implementations18 Apr 2023 Han Yao Choong, Suryansh Kumar, Luc van Gool

As a result, in this work, we take the privilege to perform an early exploration of applying a quantum computing algorithm to this important image enhancement problem, i. e., SISR.

Combinatorial Optimization Image Enhancement +1

Single Image Depth Prediction Made Better: A Multivariate Gaussian Take

no code implementations CVPR 2023 Ce Liu, Suryansh Kumar, Shuhang Gu, Radu Timofte, Luc van Gool

Accordingly, we introduce an approach that performs continuous modeling of per-pixel depth, where we can predict and reason about the per-pixel depth and its distribution.

Depth Estimation Depth Prediction

Enhanced Stable View Synthesis

no code implementations CVPR 2023 Nishant Jain, Suryansh Kumar, Luc van Gool

Extensive evaluation of our approach on the popular benchmark dataset, such as Tanks and Temples, shows substantial improvement in view synthesis results compared to the prior art.

3D Reconstruction Novel View Synthesis

VA-DepthNet: A Variational Approach to Single Image Depth Prediction

2 code implementations13 Feb 2023 Ce Liu, Suryansh Kumar, Shuhang Gu, Radu Timofte, Luc van Gool

While state-of-the-art deep neural network methods for SIDP learn the scene depth from images in a supervised setting, they often overlook the invaluable invariances and priors in the rigid scene space, such as the regularity of the scene.

Depth Prediction Monocular Depth Estimation

Uncertainty-Driven Dense Two-View Structure from Motion

no code implementations1 Feb 2023 Weirong Chen, Suryansh Kumar, Fisher Yu

This work introduces an effective and practical solution to the dense two-view structure from motion (SfM) problem.

Depth Estimation Optical Flow Estimation +2

CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion

no code implementations2 Dec 2022 Tobias Fischer, Yung-Hsu Yang, Suryansh Kumar, Min Sun, Fisher Yu

To track the 3D locations and trajectories of the other traffic participants at any given time, modern autonomous vehicles are equipped with multiple cameras that cover the vehicle's full surroundings.

3D Object Tracking Autonomous Vehicles +2

Multi-View Photometric Stereo Revisited

no code implementations14 Oct 2022 Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc van Gool

The proposed approach in this paper exploits the benefit of uncertainty modeling in a deep neural network for a reliable fusion of photometric stereo (PS) and multi-view stereo (MVS) network predictions.

3D Shape Representation

Robustifying the Multi-Scale Representation of Neural Radiance Fields

no code implementations9 Oct 2022 Nishant Jain, Suryansh Kumar, Luc van Gool

Although recently proposed Mip-NeRF could handle multi-scale imaging problems with NeRF, it cannot handle camera pose estimation error.

Pose Estimation

Organic Priors in Non-Rigid Structure from Motion

no code implementations13 Jul 2022 Suryansh Kumar, Luc van Gool

Besides that, the paper provides insights into the NRSfM factorization -- both in terms of shape and motion -- and is the first approach to show the benefit of single rotation averaging for NRSfM.

Learning Online Multi-Sensor Depth Fusion

1 code implementation7 Apr 2022 Erik Sandström, Martin R. Oswald, Suryansh Kumar, Silvan Weder, Fisher Yu, Cristian Sminchisescu, Luc van Gool

Multi-sensor depth fusion is able to substantially improve the robustness and accuracy of 3D reconstruction methods, but existing techniques are not robust enough to handle sensors which operate with diverse value ranges as well as noise and outlier statistics.

3D Reconstruction Mixed Reality +1

Uncertainty-Aware Deep Multi-View Photometric Stereo

no code implementations CVPR 2022 Berk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc van Gool

At each pixel, our approach either selects or discards deep-PS and deep-MVS network prediction depending on the prediction uncertainty measure.

Surface Reconstruction

Neural Radiance Fields Approach to Deep Multi-View Photometric Stereo

no code implementations11 Oct 2021 Berk Kaya, Suryansh Kumar, Francesco Sarno, Vittorio Ferrari, Luc van Gool

Our method performs neural rendering of multi-view images while utilizing surface normals estimated by a deep photometric stereo network.

3D Reconstruction Neural Rendering

Neural Architecture Search for Efficient Uncalibrated Deep Photometric Stereo

no code implementations11 Oct 2021 Francesco Sarno, Suryansh Kumar, Berk Kaya, Zhiwu Huang, Vittorio Ferrari, Luc van Gool

We then perform a continuous relaxation of this search space and present a gradient-based optimization strategy to find an efficient light calibration and normal estimation network.

Neural Architecture Search

Generative Flows with Invertible Attentions

no code implementations CVPR 2022 Rhea Sanjay Sukthanker, Zhiwu Huang, Suryansh Kumar, Radu Timofte, Luc van Gool

The key idea is to exploit a masked scheme of these two attentions to learn long-range data dependencies in the context of generative flows.

Image Generation

Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution

no code implementations17 Jan 2021 Yan Wu, Zhiwu Huang, Suryansh Kumar, Rhea Sanjay Sukthanker, Radu Timofte, Luc van Gool

Modern solutions to the single image super-resolution (SISR) problem using deep neural networks aim not only at better performance accuracy but also at a lighter and computationally efficient model.

Image Super-Resolution Neural Architecture Search

Neural Architecture Search of SPD Manifold Networks

1 code implementation27 Oct 2020 Rhea Sanjay Sukthanker, Zhiwu Huang, Suryansh Kumar, Erik Goron Endsjo, Yan Wu, Luc van Gool

To address this problem, we first introduce a geometrically rich and diverse SPD neural architecture search space for an efficient SPD cell design.

Emotion Recognition Neural Architecture Search

Dense Non-Rigid Structure from Motion: A Manifold Viewpoint

no code implementations15 Jun 2020 Suryansh Kumar, Luc van Gool, Carlos E. P. de Oliveira, Anoop Cherian, Yuchao Dai, Hongdong Li

Assuming that a deforming shape is composed of a union of local linear subspace and, span a global low-rank space over multiple frames enables us to efficiently model complex non-rigid deformations.

Clustering

Superpixel Soup: Monocular Dense 3D Reconstruction of a Complex Dynamic Scene

no code implementations19 Nov 2019 Suryansh Kumar, Yuchao Dai, Hongdong Li

We assume that a dynamic scene can be approximated by numerous piecewise planar surfaces, where each planar surface enjoys its own rigid motion, and the global change in the scene between two frames is as-rigid-as-possible (ARAP).

3D Reconstruction

Non-Rigid Structure from Motion: Prior-Free Factorization Method Revisited

no code implementations27 Feb 2019 Suryansh Kumar

The benefit of this work lies in its simplicity of implementation, strong theoretical justification to the motion and structure estimation, and its invincible originality.

Dense Depth Estimation of a Complex Dynamic Scene without Explicit 3D Motion Estimation

no code implementations11 Feb 2019 Suryansh Kumar, Ram Srivatsav Ghorakavi, Yuchao Dai, Hongdong Li

Given per-pixel optical flow correspondences between two consecutive frames and, the sparse depth prior for the reference frame, we show that, we can effectively recover the dense depth map for the successive frames without solving for 3D motion parameters.

Depth Estimation Motion Estimation +1

Jumping Manifolds: Geometry Aware Dense Non-Rigid Structure from Motion

no code implementations CVPR 2019 Suryansh Kumar

These Grassmann points in the lower-dimension then act as a representative for the selection of high-dimensional Grassmann samples to perform each local reconstruction.

Clustering

Scalable Dense Non-rigid Structure-from-Motion: A Grassmannian Perspective

no code implementations CVPR 2018 Suryansh Kumar, Anoop Cherian, Yuchao Dai, Hongdong Li

To address these issues, in this paper, we propose a new approach for dense NRSfM by modeling the problem on a Grassmann manifold.

Spatial-Temporal Union of Subspaces for Multi-body Non-rigid Structure-from-Motion

no code implementations14 May 2017 Suryansh Kumar, Yuchao Dai, Hongdong Li

This spatio-temporal representation not only provides competitive 3D reconstruction but also outputs robust segmentation of multiple non-rigid objects.

3D Reconstruction

Multi-body Non-rigid Structure-from-Motion

no code implementations15 Jul 2016 Suryansh Kumar, Yuchao Dai, Hongdong Li

Recent progress have extended SFM to the areas of {multi-body SFM} (where there are {multiple rigid} relative motions in the scene), as well as {non-rigid SFM} (where there is a single non-rigid, deformable object or scene).

3D Reconstruction Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.