Search Results for author: Fabian Manhardt

Found 38 papers, 19 papers with code

Mixed Diffusion for 3D Indoor Scene Synthesis

1 code implementation31 May 2024 Siyi Hu, Diego Martin Arroyo, Stephanie Debats, Fabian Manhardt, Luca Carlone, Federico Tombari

Realistic conditional 3D scene synthesis significantly enhances and accelerates the creation of virtual environments, which can also provide extensive training data for computer vision and robotics research among other applications.

Denoising Indoor Scene Synthesis

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views

no code implementations4 Apr 2024 Francis Engelmann, Fabian Manhardt, Michael Niemeyer, Keisuke Tateno, Marc Pollefeys, Federico Tombari

Our OpenNeRF further leverages NeRF's ability to render novel views and extract open-set VLM features from areas that are not well observed in the initial posed images.

Image Segmentation Point Cloud Segmentation +2

3D scene generation from scene graphs and self-attention

1 code implementation2 Apr 2024 Pietro Bonazzi, Mengqi Wang, Diego Martin Arroyo, Fabian Manhardt, Nico Messikomer, Federico Tombari, Davide Scaramuzza

Synthesizing realistic and diverse indoor 3D scene layouts in a controllable fashion opens up applications in simulated navigation and virtual reality.

Scene Generation

KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation

1 code implementation CVPR 2024 Ruida Zhang, Chenyangguang Zhang, Yan Di, Fabian Manhardt, Xingyu Liu, Federico Tombari, Xiangyang Ji

In this paper, we present KP-RED, a unified KeyPoint-driven REtrieval and Deformation framework that takes object scans as input and jointly retrieves and deforms the most geometrically similar CAD models from a pre-processed database to tightly match the target.

3D Shape Retrieval Retrieval

Denoising Diffusion via Image-Based Rendering

no code implementations5 Feb 2024 Titas Anciukevičius, Fabian Manhardt, Federico Tombari, Paul Henderson

In this work, we introduce the first diffusion model able to perform fast, detailed reconstruction and generation of real-world 3D scenes.

3D Reconstruction Denoising +1

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

1 code implementation CVPR 2024 Yamei Chen, Yan Di, Guangyao Zhai, Fabian Manhardt, Chenyangguang Zhang, Ruida Zhang, Federico Tombari, Nassir Navab, Benjamin Busam

Leveraging the advantage of DINOv2 in providing SE(3)-consistent semantic features, we hierarchically extract two types of SE(3)-invariant geometric features to further encapsulate local-to-global object-specific information.

Object Pose Estimation

SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs

no code implementations21 Sep 2023 Guangyao Zhai, Xiaoni Cai, Dianye Huang, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam

In this paper, we present SG-Bot, a novel rearrangement framework that utilizes a coarse-to-fine scheme with a scene graph as the scene representation.

CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction

no code implementations15 Aug 2023 Yan Di, Chenyangguang Zhang, Pengyuan Wang, Guangyao Zhai, Ruida Zhang, Fabian Manhardt, Benjamin Busam, Xiangyang Ji, Federico Tombari

However, such strategies fail to consistently align the denoised point cloud with the given image, leading to unstable conditioning and inferior performance.

3D Reconstruction

U-RED: Unsupervised 3D Shape Retrieval and Deformation for Partial Point Clouds

1 code implementation ICCV 2023 Yan Di, Chenyangguang Zhang, Ruida Zhang, Fabian Manhardt, Yongzhi Su, Jason Rambach, Didier Stricker, Xiangyang Ji, Federico Tombari

In this paper, we propose U-RED, an Unsupervised shape REtrieval and Deformation pipeline that takes an arbitrary object observation as input, typically captured by RGB images or scans, and jointly retrieves and deforms the geometrically similar CAD models from a pre-established database to tightly match the target.

3D Shape Retrieval Retrieval

View-to-Label: Multi-View Consistency for Self-Supervised 3D Object Detection

no code implementations29 May 2023 Issa Mouawad, Nikolas Brasch, Fabian Manhardt, Federico Tombari, Francesca Odone

For autonomous vehicles, driving safely is highly dependent on the capability to correctly perceive the environment in 3D space, hence the task of 3D object detection represents a fundamental aspect of perception.

3D Object Detection Autonomous Vehicles +1

TextMesh: Generation of Realistic 3D Meshes From Text Prompts

1 code implementation24 Apr 2023 Christina Tsalicoglou, Fabian Manhardt, Alessio Tonioni, Michael Niemeyer, Federico Tombari

In addition, we propose a novel way to finetune the mesh texture, removing the effect of high saturation and improving the details of the output 3D mesh.

NeRFMeshing: Distilling Neural Radiance Fields into Geometrically-Accurate 3D Meshes

no code implementations16 Mar 2023 Marie-Julie Rakotosaona, Fabian Manhardt, Diego Martin Arroyo, Michael Niemeyer, Abhijit Kundu, Federico Tombari

Obtaining 3D meshes from neural radiance fields still remains an open challenge since NeRFs are optimized for view synthesis, not enforcing an accurate underlying geometry on the radiance field.

Novel View Synthesis Surface Reconstruction

TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation

no code implementations CVPR 2023 Hanzhi Chen, Fabian Manhardt, Nassir Navab, Benjamin Busam

In this paper, we introduce neural texture learning for 6D object pose estimation from synthetic data and a few unlabelled real images.

6D Pose Estimation using RGB

SPARF: Neural Radiance Fields from Sparse and Noisy Poses

1 code implementation CVPR 2023 Prune Truong, Marie-Julie Rakotosaona, Fabian Manhardt, Federico Tombari

Neural Radiance Field (NeRF) has recently emerged as a powerful representation to synthesize photorealistic novel views.

Novel View Synthesis

OPA-3D: Occlusion-Aware Pixel-Wise Aggregation for Monocular 3D Object Detection

no code implementations2 Nov 2022 Yongzhi Su, Yan Di, Fabian Manhardt, Guangyao Zhai, Jason Rambach, Benjamin Busam, Didier Stricker, Federico Tombari

Despite monocular 3D object detection having recently made a significant leap forward thanks to the use of pre-trained depth estimators for pseudo-LiDAR recovery, such two-stage methods typically suffer from overfitting and are incapable of explicitly encapsulating the geometric relation between depth and object bounding box.

Monocular 3D Object Detection Object +1

SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct Category-Level Object Pose Estimation

no code implementations13 Aug 2022 Ruida Zhang, Yan Di, Fabian Manhardt, Federico Tombari, Xiangyang Ji

In this paper, to handle these shortcomings, we propose an end-to-end trainable network SSP-Pose for category-level pose estimation, which integrates shape priors into a direct pose regression network.

Pose Estimation regression

RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation

1 code implementation30 Jul 2022 Ruida Zhang, Yan Di, Zhiqiang Lou, Fabian Manhardt, Federico Tombari, Xiangyang Ji

Category-level object pose estimation aims to predict the 6D pose as well as the 3D metric size of arbitrary objects from a known set of categories.

Object Pose Estimation

GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting

3 code implementations CVPR 2022 Yan Di, Ruida Zhang, Zhiqiang Lou, Fabian Manhardt, Xiangyang Ji, Nassir Navab, Federico Tombari

While 6D object pose estimation has recently made a huge leap forward, most methods can still only handle a single or a handful of different objects, which limits their applications.

 Ranked #1 on 6D Pose Estimation on LineMOD (Mean ADD-S metric)

6D Pose Estimation 6D Pose Estimation using RGB +3

Object-aware Monocular Depth Prediction with Instance Convolutions

1 code implementation2 Dec 2021 Enis Simsar, Evin Pınar Örnek, Fabian Manhardt, Helisa Dhamo, Nassir Navab, Federico Tombari

With the advent of deep learning, estimating depth from a single RGB image has recently received a lot of attention, being capable of empowering many different applications ranging from path planning for robotics to computational cinematography.

Depth Estimation Depth Prediction +2

Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs

1 code implementation ICCV 2021 Helisa Dhamo, Fabian Manhardt, Nassir Navab, Federico Tombari

Scene graphs are representations of a scene, composed of objects (nodes) and inter-object relationships (edges), proven to be particularly suited for this task, as they allow for semantic control on the generated content.

Object

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

2 code implementations ICCV 2021 Yan Di, Fabian Manhardt, Gu Wang, Xiangyang Ji, Nassir Navab, Federico Tombari

Directly regressing all 6 degrees-of-freedom (6DoF) for the object pose (e. g. the 3D rotation and translation) in a cluttered environment from a single RGB image is a challenging problem.

6D Pose Estimation 6D Pose Estimation using RGB +1

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

1 code implementation CVPR 2021 Gu Wang, Fabian Manhardt, Federico Tombari, Xiangyang Ji

In this work, we perform an in-depth investigation on both direct and indirect methods, and propose a simple yet effective Geometry-guided Direct Regression Network (GDR-Net) to learn the 6D pose in an end-to-end manner from dense correspondence-based intermediate geometric representations.

6D Pose Estimation 6D Pose Estimation using RGB +1

Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data

no code implementations ICCV 2019 Fabian Manhardt, Diego Martin Arroyo, Christian Rupprecht, Benjamin Busam, Tolga Birdal, Nassir Navab, Federico Tombari

For each object instance we predict multiple pose and class outcomes to estimate the specific pose distribution generated by symmetries and repetitive textures.

3D Object Detection Object +3

Cannot find the paper you are looking for? You can Submit a new open access paper.