Search Results for author: Francis Engelmann

Found 34 papers, 14 papers with code

SLAG: Scalable Language-Augmented Gaussian Splatting

no code implementations12 May 2025 Laszlo Szilagyi, Francis Engelmann, Jeannette Bohg

To address this, we introduce SLAG, a multi-GPU framework for language-augmented Gaussian splatting that enhances the speed and scalability of embedding large scenes.

Language Modeling Language Modelling

Robust Human Registration with Body Part Segmentation on Noisy Point Clouds

no code implementations4 Apr 2025 Kai Lascheit, Daniel Barath, Marc Pollefeys, Leonidas Guibas, Francis Engelmann

Additionally, we demonstrate that the fitted human mesh can refine body part labels, leading to improved segmentation.

Pose Estimation Segmentation

ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding

1 code implementation17 Oct 2024 Guangda Ji, Silvan Weder, Francis Engelmann, Marc Pollefeys, Hermann Blum

Further, we push forward the state-of-the-art performance on ScanNet and ScanNet200 dataset with prevalent 3D semantic segmentation models, demonstrating the efficacy of our generated dataset.

Ranked #3 on Semantic Segmentation on ScanNet (using extra training data)

3D Semantic Segmentation Image Generation +1

Search3D: Hierarchical Open-Vocabulary 3D Segmentation

no code implementations27 Sep 2024 Ayca Takmaz, Alexandros Delitzas, Robert W. Sumner, Francis Engelmann, Johanna Wald, Federico Tombari

Existing methods for open-vocabulary 3D instance segmentation primarily focus on identifying object-level instances but struggle with finer-grained scene entities such as object parts, or regions described by generic attributes.

3D Instance Segmentation 3D Part Segmentation +3

P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising

no code implementations29 Aug 2024 Mathias Vogel, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann

In this work, we tackle the task of point cloud denoising through a novel framework that adapts Diffusion Schr\"odinger bridges to points clouds.

Denoising

Improving 2D Feature Representations by 3D-Aware Fine-Tuning

no code implementations29 Jul 2024 Yuanwen Yue, Anurag Das, Francis Engelmann, Siyu Tang, Jan Eric Lenssen

In this work, we show that fine-tuning on 3D-aware data improves the quality of emerging semantic features.

Depth Estimation Semantic Segmentation

OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation

no code implementations30 May 2024 Gonca Yilmaz, Songyou Peng, Marc Pollefeys, Francis Engelmann, Hermann Blum

However, this flexibility comes with a trade-off: fully-supervised closed-set methods still outperform OVS methods on base classes, that is on classes on which they have been explicitly trained.

3D Instance Segmentation 3D Open-Vocabulary Instance Segmentation +8

Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

no code implementations18 Apr 2024 Oliver Lemke, Zuria Bauer, René Zurbrügg, Marc Pollefeys, Francis Engelmann, Hermann Blum

This allows for accurate detection directly in 3D scenes, object- and environment-aware grasp prediction, as well as robust and repeatable robotic manipulation.

3D Instance Segmentation Pose Estimation +3

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views

no code implementations4 Apr 2024 Francis Engelmann, Fabian Manhardt, Michael Niemeyer, Keisuke Tateno, Marc Pollefeys, Federico Tombari

Our OpenNeRF further leverages NeRF's ability to render novel views and extract open-set VLM features from areas that are not well observed in the initial posed images.

Image Segmentation NeRF +3

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

no code implementations30 Mar 2024 Yang Miao, Francis Engelmann, Olga Vysotska, Federico Tombari, Marc Pollefeys, Dániel Béla Baráth

We introduce a novel problem, i. e., the localization of an input image within a multi-modal reference map represented by a database of 3D scene graphs.

Visual Localization

ICGNet: A Unified Approach for Instance-Centric Grasping

1 code implementation18 Jan 2024 René Zurbrügg, Yifan Liu, Francis Engelmann, Suryansh Kumar, Marco Hutter, Vaishakh Patil, Fisher Yu

Executing a successful grasp in a cluttered environment requires multiple levels of scene understanding: First, the robot needs to analyze the geometric properties of individual objects to find feasible grasps.

Object Object Reconstruction +1

ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction

no code implementations29 Nov 2023 Silvan Weder, Francis Engelmann, Johannes L. Schönberger, Akihito Seki, Marc Pollefeys, Martin R. Oswald

Using these main contributions, our method can enable scenarios with real-time constraints and can scale to arbitrary scene sizes by processing and updating the scene only in a local region defined by the new measurement.

3D Semantic Segmentation Mixed Reality

AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation

1 code implementation1 Jun 2023 Yuanwen Yue, Sabarinath Mahadevan, Jonas Schult, Francis Engelmann, Bastian Leibe, Konrad Schindler, Theodora Kontogianni

In an iterative process, the model assigns each data point to an object (or the background), while the user corrects errors in the resulting segmentation and feeds them back into the model.

Binary Classification Interactive Segmentation +2

Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries

1 code implementation CVPR 2023 Yuanwen Yue, Theodora Kontogianni, Konrad Schindler, Francis Engelmann

Instead, we formulate floorplan reconstruction as a single-stage structured prediction task: find a variable-size set of polygons, which in turn are variable-length sequences of ordered vertices.

Structured Prediction Vocal Bursts Valence Prediction

Mask3D: Mask Transformer for 3D Semantic Instance Segmentation

1 code implementation6 Oct 2022 Jonas Schult, Francis Engelmann, Alexander Hermans, Or Litany, Siyu Tang, Bastian Leibe

Modern 3D semantic instance segmentation approaches predominantly rely on specialized voting mechanisms followed by carefully designed geometric clustering techniques.

3D Instance Segmentation 3D Semantic Instance Segmentation +2

4D-StOP: Panoptic Segmentation of 4D LiDAR using Spatio-temporal Object Proposal Generation and Aggregation

1 code implementation29 Sep 2022 Lars Kreuzberg, Idil Esen Zulfikar, Sabarinath Mahadevan, Francis Engelmann, Bastian Leibe

Our voting-based tracklet generation method followed by geometric feature-based aggregation generates significantly improved panoptic LiDAR segmentation quality when compared to modeling the entire 4D volume using Gaussian probability distributions.

4D Panoptic Segmentation Object Proposal Generation +1

Mix3D: Out-of-Context Data Augmentation for 3D Scenes

4 code implementations5 Oct 2021 Alexey Nekrasov, Jonas Schult, Or Litany, Bastian Leibe, Francis Engelmann

Since scene context helps reasoning about object semantics, current works focus on models with large capacity and receptive fields that can fully capture the global context of an input 3D scene.

3D Semantic Segmentation

3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance Segmentation

1 code implementation CVPR 2020 Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Niessner

We show that grouping proposals improves over NMS and outperforms previous state-of-the-art methods on the tasks of 3D object detection and semantic instance segmentation on the ScanNetV2 benchmark and the S3DIS dataset.

3D Object Detection 3D Semantic Instance Segmentation +3

SAMP: Shape and Motion Priors for 4D Vehicle Reconstruction

1 code implementation2 May 2020 Francis Engelmann, Jörg Stückler, Bastian Leibe

In this paper, we propose to use 3D shape and motion priors to regularize the estimation of the trajectory and the shape of vehicles in sequences of stereo images.

Pose Estimation

3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation

1 code implementation30 Mar 2020 Francis Engelmann, Martin Bokeloh, Alireza Fathi, Bastian Leibe, Matthias Nießner

We show that grouping proposals improves over NMS and outperforms previous state-of-the-art methods on the tasks of 3D object detection and semantic instance segmentation on the ScanNetV2 benchmark and the S3DIS dataset.

3D Instance Segmentation 3D Object Detection +3

Dilated Point Convolutions: On the Receptive Field Size of Point Convolutions on 3D Point Clouds

1 code implementation28 Jul 2019 Francis Engelmann, Theodora Kontogianni, Bastian Leibe

In a thorough ablation study, we show that the receptive field size is directly related to the performance of 3D point cloud processing tasks, including semantic segmentation and object classification.

3D Semantic Segmentation

Know What Your Neighbors Do: 3D Semantic Segmentation of Point Clouds

no code implementations2 Oct 2018 Francis Engelmann, Theodora Kontogianni, Jonas Schult, Bastian Leibe

In this paper, we present a deep learning architecture which addresses the problem of 3D semantic segmentation of unstructured point clouds.

3D Semantic Segmentation Segmentation

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

1 code implementation5 Feb 2018 Francis Engelmann, Theodora Kontogianni, Alexander Hermans, Bastian Leibe

The recently proposed PointNet architecture presents an interesting step ahead in that it can operate on unstructured point clouds, achieving encouraging segmentation results.

3D Semantic Segmentation Segmentation

Keyframe-Based Visual-Inertial Online SLAM with Relocalization

no code implementations7 Feb 2017 Anton Kasyanov, Francis Engelmann, Jörg Stückler, Bastian Leibe

Our visual-inertial SLAM system is based on a real-time capable visual-inertial odometry method that provides locally consistent trajectory and map estimates.

Pose Tracking Simultaneous Localization and Mapping

Cannot find the paper you are looking for? You can Submit a new open access paper.