Search Results for author: Marc Pollefeys

Found 240 papers, 96 papers with code

Handcrafted Outlier Detection Revisited

1 code implementation ECCV 2020 Luca Cavalli, Viktor Larsson, Martin Ralf Oswald, Torsten Sattler, Marc Pollefeys

As a result, outlier detection is a fundamental problem in computer vision and a wide range of approaches, from simple checks based on descriptor similarity to geometric verification, have been proposed over the last decades.

Outlier Detection Visual Localization

Calibration-free Structure-from-Motion with Calibrated Radial Trifocal Tensors

no code implementations ECCV 2020 Viktor Larsson, Nicolas Zobernig, Kasim Taskin, Marc Pollefeys

In this paper we consider the problem of Structure-from-Motion from images with unknown intrinsic calibration.

EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision

no code implementations3 Sep 2024 Yiming Zhao, Taein Kwon, Paul Streli, Marc Pollefeys, Christian Holz

Estimating touch contact and pressure in egocentric vision is a central task for downstream applications in Augmented Reality, Virtual Reality, as well as many robotic applications, because it provides precise physical insights into hand-object interaction and object manipulation.

Benchmarking Pose Estimation

Space3D-Bench: Spatial 3D Question Answering Benchmark

no code implementations29 Aug 2024 Emilia Szymanska, Mihai Dusmanu, Jan-Willem Buurlage, Mahdi Rad, Marc Pollefeys

Answering questions about the spatial properties of the environment poses challenges for existing language and vision foundation models due to a lack of understanding of the 3D world notably in terms of relationships between objects.

Question Answering

P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising

no code implementations29 Aug 2024 Mathias Vogel, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann

In this work, we tackle the task of point cloud denoising through a novel framework that adapts Diffusion Schr\"odinger bridges to points clouds.

Denoising

Global Structure-from-Motion Revisited

1 code implementation29 Jul 2024 Linfei Pan, Dániel Baráth, Marc Pollefeys, Johannes L. Schönberger

Recovering 3D structure and camera motion from images has been a long-standing focus of computer vision research and is known as Structure-from-Motion (SfM).

16k

Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information

1 code implementation22 Jul 2024 Luca Di Giammarino, Boyang Sun, Giorgio Grisetti, Marc Pollefeys, Hermann Blum, Daniel Barath

Our contributions involve using a data-driven approach with a simple architecture designed for real-time operation, a self-supervised data training method, and the capability to consistently integrate our map into a planning framework tailored for real-world robotics applications.

MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References

1 code implementation18 Jul 2024 Lukas Bösiger, Mihai Dusmanu, Marc Pollefeys, Zuria Bauer

Rendering realistic images from 3D reconstruction is an essential task of many Computer Vision and Robotics pipelines, notably for mixed-reality applications as well as training autonomous agents in simulated environments.

3D Reconstruction Mixed Reality +1

Learning to Make Keypoints Sub-Pixel Accurate

1 code implementation16 Jul 2024 Shinjeong Kim, Marc Pollefeys, Daniel Barath

This work addresses the challenge of sub-pixel accuracy in detecting 2D local features, a cornerstone problem in computer vision.

WildGaussians: 3D Gaussian Splatting in the Wild

1 code implementation11 Jul 2024 Jonas Kulhanek, Songyou Peng, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler

While the field of 3D scene reconstruction is dominated by NeRFs due to their photorealistic quality, 3D Gaussian Splatting (3DGS) has recently emerged, offering similar quality with real-time rendering speeds.

3D Scene Reconstruction

GLACE: Global Local Accelerated Coordinate Encoding

1 code implementation CVPR 2024 Fangjinhua Wang, Xudong Jiang, Silvano Galliani, Christoph Vogel, Marc Pollefeys

We propose GLACE, which integrates pre-trained global and local encodings and enables SCR to scale to large scenes with only a single small-sized network.

Camera Pose Estimation Pose Estimation +2

Dynamic 3D Gaussian Fields for Urban Areas

no code implementations5 Jun 2024 Tobias Fischer, Jonas Kulhanek, Samuel Rota Bulò, Lorenzo Porzi, Marc Pollefeys, Peter Kontschieder

We present an efficient neural 3D scene representation for novel-view synthesis (NVS) in large-scale, dynamic urban areas.

Mixed Reality Novel View Synthesis

OpenDAS: Domain Adaptation for Open-Vocabulary Segmentation

no code implementations30 May 2024 Gonca Yilmaz, Songyou Peng, Francis Engelmann, Marc Pollefeys, Hermann Blum

We, therefore, introduce a new task domain adaptation for open-vocabulary segmentation, enhancing VLMs with domain-specific priors while preserving their open-vocabulary nature.

3D Open-Vocabulary Instance Segmentation Domain Adaptation +2

NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild

1 code implementation CVPR 2024 Weining Ren, Zihan Zhu, Boyang Sun, Jiaqi Chen, Marc Pollefeys, Songyou Peng

Neural Radiance Fields (NeRFs) have shown remarkable success in synthesizing photorealistic views from multi-view images of static scenes, but face challenges in dynamic, real-world environments with distractors like moving objects, shadows, and lighting changes.

3D Neural Edge Reconstruction

no code implementations CVPR 2024 Lei LI, Songyou Peng, Zehao Yu, Shaohui Liu, Rémi Pautrat, Xiaochuan Yin, Marc Pollefeys

Real-world objects and environments are predominantly composed of edge features, including straight lines and curves.

Surface Reconstruction

NeRF in Robotics: A Survey

no code implementations2 May 2024 Guangming Wang, Lei Pan, Songyou Peng, Shaohui Liu, Chenfeng Xu, Yanzi Miao, Wei Zhan, Masayoshi Tomizuka, Marc Pollefeys, Hesheng Wang

Meticulous 3D environment representations have been a longstanding goal in computer vision and robotics fields.

Retrieval Robust to Object Motion Blur

1 code implementation27 Apr 2024 Rong Zou, Marc Pollefeys, Denys Rozumnyi

We propose a method for object retrieval in images that are affected by motion blur.

Object Retrieval

Efficient Solution of Point-Line Absolute Pose

1 code implementation CVPR 2024 Petr Hruby, Timothy Duff, Marc Pollefeys

We revisit certain problems of pose estimation based on 3D--2D correspondences between features which may be points or lines.

Pose Estimation

"Where am I?" Scene Retrieval with Language

no code implementations22 Apr 2024 Jiaqi Chen, Daniel Barath, Iro Armeni, Marc Pollefeys, Hermann Blum

As such, we need methods that interface between natural language and map representations of the environment.

Retrieval

Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

no code implementations18 Apr 2024 Oliver Lemke, Zuria Bauer, René Zurbrügg, Marc Pollefeys, Francis Engelmann, Hermann Blum

This allows for accurate detection directly in 3D scenes, object- and environment-aware grasp prediction, as well as robust and repeatable robotic manipulation.

3D Instance Segmentation Pose Estimation +3

OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views

no code implementations4 Apr 2024 Francis Engelmann, Fabian Manhardt, Michael Niemeyer, Keisuke Tateno, Marc Pollefeys, Federico Tombari

Our OpenNeRF further leverages NeRF's ability to render novel views and extract open-set VLM features from areas that are not well observed in the initial posed images.

Image Segmentation Point Cloud Segmentation +2

SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

no code implementations30 Mar 2024 Yang Miao, Francis Engelmann, Olga Vysotska, Federico Tombari, Marc Pollefeys, Dániel Béla Baráth

We introduce a novel problem, i. e., the localization of an input image within a multi-modal reference map represented by a database of 3D scene graphs.

Visual Localization

F$^3$Loc: Fusion and Filtering for Floorplan Localization

no code implementations5 Mar 2024 Changan Chen, Rui Wang, Christoph Vogel, Marc Pollefeys

In this paper we propose an efficient data-driven solution to self-localization within a floorplan.

Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion

no code implementations CVPR 2024 Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Marc Pollefeys, Martin R. Oswald

Directly generating scenes from satellite imagery offers exciting possibilities for integration into applications like games and map services.

3D Generation Neural Rendering +2

EgoGen: An Egocentric Synthetic Data Generator

no code implementations CVPR 2024 Gen Li, Kaifeng Zhao, Siwei Zhang, Xiaozhong Lyu, Mihai Dusmanu, Yan Zhang, Marc Pollefeys, Siyu Tang

To address this challenge, we introduce EgoGen, a new synthetic data generator that can produce accurate and rich ground-truth training data for egocentric perception tasks.

Human Mesh Recovery Motion Synthesis

F3Loc: Fusion and Filtering for Floorplan Localization

no code implementations CVPR 2024 Changan Chen, Rui Wang, Christoph Vogel, Marc Pollefeys

In this paper we propose an efficient data-driven solution to self-localization within a floorplan.

Multiway Point Cloud Mosaicking with Diffusion and Global Optimization

1 code implementation CVPR 2024 Shengze Jin, Iro Armeni, Marc Pollefeys, Daniel Barath

We introduce a novel framework for multiway point cloud mosaicking (named Wednesday) designed to co-align sets of partially overlapping point clouds -- typically obtained from 3D scanners or moving RGB-D cameras -- into a unified coordinate system.

Denoising

MuRF: Multi-Baseline Radiance Fields

1 code implementation CVPR 2024 Haofei Xu, Anpei Chen, Yuedong Chen, Christos Sakaridis, Yulun Zhang, Marc Pollefeys, Andreas Geiger, Fisher Yu

We present Multi-Baseline Radiance Fields (MuRF), a general feed-forward approach to solving sparse view synthesis under multiple different baseline settings (small and large baselines, and different number of input views).

Zero-shot Generalization

ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction

no code implementations29 Nov 2023 Silvan Weder, Francis Engelmann, Johannes L. Schönberger, Akihito Seki, Marc Pollefeys, Martin R. Oswald

Using these main contributions, our method can enable scenarios with real-time constraints and can scale to arbitrary scene sizes by processing and updating the scene only in a local region defined by the new measurement.

3D Semantic Segmentation Mixed Reality

Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

no code implementations29 Nov 2023 Yu Zheng, Guangming Wang, Jiuming Liu, Marc Pollefeys, Hesheng Wang

Through the hash-based representation, we propose the Spherical Frustum sparse Convolution (SFC) and Frustum Fast Point Sampling (F2PS) to convolve and sample the points stored in spherical frustums respectively.

Position Segmentation +1

Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change

no code implementations15 Nov 2023 Tao Sun, Yan Hao, Shengyu Huang, Silvio Savarese, Konrad Schindler, Marc Pollefeys, Iro Armeni

To this end, we introduce the Nothing Stands Still (NSS) benchmark, which focuses on the spatiotemporal registration of 3D scenes undergoing large spatial and temporal change, ultimately creating one coherent spatiotemporal map.

Point Cloud Registration

Leveraging Neural Radiance Fields for Uncertainty-Aware Visual Localization

no code implementations10 Oct 2023 Le Chen, Weirong Chen, Rui Wang, Marc Pollefeys

As a promising fashion for visual localization, scene coordinate regression (SCR) has seen tremendous progress in the past decade.

regression Visual Localization

Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation

no code implementations8 Oct 2023 Dominik Hollidt, Clinton Wang, Polina Golland, Marc Pollefeys

We present a novel approach to perform 3D semantic segmentation solely from 2D supervision by leveraging Neural Radiance Fields (NeRFs).

3D Semantic Segmentation Segmentation

Active Visual Localization for Multi-Agent Collaboration: A Data-Driven Approach

no code implementations4 Oct 2023 Matthew Hanlon, Boyang Sun, Marc Pollefeys, Hermann Blum

However, localizing e. g. a ground robot in the map of a drone or head-mounted MR headset presents unique challenges due to viewpoint changes.

Visual Localization

Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature

no code implementations27 Sep 2023 Shengze Jin, Daniel Barath, Marc Pollefeys, Iro Armeni

Point cloud registration has seen recent success with several learning-based methods that focus on correspondence matching and, as such, optimize only for this objective.

Point Cloud Registration Pose Estimation

Handbook on Leveraging Lines for Two-View Relative Pose Estimation

no code implementations27 Sep 2023 Petr Hruby, Shaohui Liu, Rémi Pautrat, Marc Pollefeys, Daniel Barath

We propose an approach for estimating the relative pose between calibrated image pairs by jointly exploiting points, lines, and their coincidences in a hybrid manner.

Pose Estimation

Volumetric Semantically Consistent 3D Panoptic Mapping

1 code implementation26 Sep 2023 Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath

We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating comprehensive, accurate, and efficient semantic 3D maps suitable for autonomous agents in unstructured environments.

CaSAR: Contact-aware Skeletal Action Recognition

no code implementations17 Sep 2023 Junan Lin, Zhichao Sun, Enjie Cao, Taein Kwon, Mahdi Rad, Marc Pollefeys

Skeletal Action recognition from an egocentric view is important for applications such as interfaces in AR/VR glasses and human-robot interaction, where the device has limited resources.

Action Recognition

Learning Disentangled Avatars with Hybrid 3D Representations

no code implementations12 Sep 2023 Yao Feng, Weiyang Liu, Timo Bolkart, Jinlong Yang, Marc Pollefeys, Michael J. Black

Towards this end, both explicit and implicit 3D representations are heavily studied for a holistic modeling and capture of the whole human (e. g., body, clothing, face and hair), but neither representation is an optimal choice in terms of representation efficacy since different parts of the human avatar have different modeling desiderata.

Disentanglement

ResFields: Residual Neural Fields for Spatiotemporal Signals

1 code implementation6 Sep 2023 Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys, Siyu Tang

Neural fields, a category of neural networks trained to represent high-frequency signals, have gained significant attention in recent years due to their impressive performance in modeling complex 3D data, such as signed distance (SDFs) or radiance fields (NeRFs), via a single multi-layer perceptron (MLP).

4D reconstruction Neural Rendering

Vanishing Point Estimation in Uncalibrated Images with Prior Gravity Direction

1 code implementation ICCV 2023 Rémi Pautrat, Shaohui Liu, Petr Hruby, Marc Pollefeys, Daniel Barath

We tackle the problem of estimating a Manhattan frame, i. e. three orthogonal vanishing points, and the unknown focal length of the camera, leveraging a prior vertical direction.

Quantification of Predictive Uncertainty via Inference-Time Sampling

no code implementations3 Aug 2023 Katarína Tóthová, Ľubor Ladický, Daniel Thul, Marc Pollefeys, Ender Konukoglu

Predictive variability due to data ambiguities has typically been addressed via construction of dedicated models with built-in probabilistic capabilities that are trained to predict uncertainty estimates as variables of interest.

AffineGlue: Joint Matching and Robust Estimation

no code implementations28 Jul 2023 Daniel Barath, Dmytro Mishkin, Luca Cavalli, Paul-Edouard Sarlin, Petr Hruby, Marc Pollefeys

Moreover, we derive a new minimal solver for homography estimation, requiring only a single affine correspondence (AC) and a gravity prior.

Homography Estimation

Consensus-Adaptive RANSAC

1 code implementation26 Jul 2023 Luca Cavalli, Daniel Barath, Marc Pollefeys, Viktor Larsson

The proposed attention mechanism and one-step transformer provide an adaptive behavior that enhances the performance of RANSAC, making it a more effective tool for robust estimation.

Lazy Visual Localization via Motion Averaging

no code implementations19 Jul 2023 Siyan Dong, Shaohui Liu, Hengkai Guo, Baoquan Chen, Marc Pollefeys

Visual (re)localization is critical for various applications in computer vision and robotics.

Visual Localization

Next-generation Surgical Navigation: Marker-less Multi-view 6DoF Pose Estimation of Surgical Instruments

no code implementations5 May 2023 Jonas Hein, Nicola Cavalcanti, Daniel Suter, Lukas Zingg, Fabio Carrillo, Lilian Calvet, Mazda Farshad, Marc Pollefeys, Nassir Navab, Philipp Fürnstahl

Third, we evaluate three state-of-the-art single-view and multi-view methods for the task of 6DoF pose estimation of surgical instruments and analyze the influence of camera configurations, training data, and occlusions on the pose accuracy and generalization ability.

Anatomy Pose Estimation

Learning-based Relational Object Matching Across Views

no code implementations3 May 2023 Cathrin Elich, Iro Armeni, Martin R. Oswald, Marc Pollefeys, Joerg Stueckler

Our approach compares favorably to previous state-of-the-art object-level matching approaches and achieves improved performance over a pure keypoint-based approach for large view-point changes.

Graph Neural Network Image Retrieval +3

SGAligner : 3D Scene Alignment with Scene Graphs

1 code implementation28 Apr 2023 Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni

We propose SGAligner, the first method for aligning pairs of 3D scene graphs that is robust to in-the-wild scenarios (ie, unknown overlap -- if any -- and changes in the environment).

3D Scene Graph Alignment Contrastive Learning +2

Tracking by 3D Model Estimation of Unknown Objects in Videos

no code implementations ICCV 2023 Denys Rozumnyi, Jiri Matas, Marc Pollefeys, Vittorio Ferrari, Martin R. Oswald

We argue that this representation is limited and instead propose to guide and improve 2D tracking with an explicit object representation, namely the textured 3D shape and 6DoF pose in each video frame.

Object Segmentation +1

Human from Blur: Human Pose Tracking from Blurry Images

no code implementations ICCV 2023 Yiming Zhao, Denys Rozumnyi, Jie Song, Otmar Hilliges, Marc Pollefeys, Martin R. Oswald

The key idea is to tackle the inverse problem of image deblurring by modeling the forward problem with a 3D human model, a texture map, and a sequence of poses to describe human motion.

Deblurring Image Deblurring +2

3D Line Mapping Revisited

1 code implementation CVPR 2023 Shaohui Liu, Yifan Yu, Rémi Pautrat, Marc Pollefeys, Viktor Larsson

In contrast to sparse keypoints, a handful of line segments can concisely encode the high-level scene layout, as they often delineate the main structural elements.

Visual Localization

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

1 code implementation ICCV 2023 Jiuming Liu, Guangming Wang, Zhe Liu, Chaokang Jiang, Marc Pollefeys, Hesheng Wang

Specifically, a projection-aware hierarchical transformer is proposed to capture long-range dependencies and filter outliers by extracting point features globally.

Point Cloud Registration

NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM

no code implementations7 Feb 2023 Zihan Zhu, Songyou Peng, Viktor Larsson, Zhaopeng Cui, Martin R. Oswald, Andreas Geiger, Marc Pollefeys

Neural implicit representations have recently become popular in simultaneous localization and mapping (SLAM), especially in dense visual SLAM.

3D Scene Reconstruction Novel View Synthesis +2

Privacy Preserving Localization via Coordinate Permutations

no code implementations ICCV 2023 Linfei Pan, Johannes L. Schönberger, Viktor Larsson, Marc Pollefeys

Recent methods on privacy-preserving image-based localization use a random line parameterization to protect the privacy of query images and database maps.

Image-Based Localization Pose Estimation +1

SGAligner: 3D Scene Alignment with Scene Graphs

no code implementations ICCV 2023 Sayan Deb Sarkar, Ondrej Miksik, Marc Pollefeys, Daniel Barath, Iro Armeni

We propose SGAligner, the first method for aligning pairs of 3D scene graphs that is robust to in-the-wild scenarios (i. e., unknown overlap - if any - and changes in the environment).

Contrastive Learning Knowledge Graphs

Guiding Local Feature Matching with Surface Curvature

no code implementations ICCV 2023 Shuzhe Wang, Juho Kannala, Marc Pollefeys, Daniel Barath

We propose a new method, named curvature similarity extractor (CSE), for improving local feature matching across images.

Depth Estimation Depth Prediction

DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients

1 code implementation CVPR 2023 Rémi Pautrat, Daniel Barath, Viktor Larsson, Martin R. Oswald, Marc Pollefeys

Their learned counterparts are more repeatable and can handle challenging images, but at the cost of a lower accuracy and a bias towards wireframe lines.

Line Detection Line Segment Detection

VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction

1 code implementation CVPR 2023 Yufan Ren, Fangjinhua Wang, Tong Zhang, Marc Pollefeys, Sabine Süsstrunk

The success of the Neural Radiance Fields (NeRF) in novel view synthesis has inspired researchers to propose neural implicit scene reconstruction.

Novel View Synthesis

LaMAR: Benchmarking Localization and Mapping for Augmented Reality

1 code implementation19 Oct 2022 Paul-Edouard Sarlin, Mihai Dusmanu, Johannes L. Schönberger, Pablo Speciale, Lukas Gruber, Viktor Larsson, Ondrej Miksik, Marc Pollefeys

To close this gap, we introduce LaMAR, a new benchmark with a comprehensive capture and GT pipeline that co-registers realistic trajectories and sensor streams captured by heterogeneous AR devices in large, unconstrained scenes.

Benchmarking Diversity

NeuralMeshing: Differentiable Meshing of Implicit Neural Representations

no code implementations5 Oct 2022 Mathias Vetsch, Sandro Lombardi, Marc Pollefeys, Martin R. Oswald

The generation of triangle meshes from point clouds, i. e. meshing, is a core task in computer graphics and computer vision.

Capturing and Animation of Body and Clothing from Monocular Video

1 code implementation4 Oct 2022 Yao Feng, Jinlong Yang, Marc Pollefeys, Michael J. Black, Timo Bolkart

Building on this insight, we propose SCARF (Segmented Clothed Avatar Radiance Field), a hybrid model combining a mesh-based body with a neural radiance field.

Virtual Try-on

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

1 code implementation ICCV 2023 Weicai Ye, Shuo Chen, Chong Bao, Hujun Bao, Marc Pollefeys, Zhaopeng Cui, Guofeng Zhang

Existing inverse rendering combined with neural rendering methods can only perform editable novel view synthesis on object-specific scenes, while we present intrinsic neural radiance fields, dubbed IntrinsicNeRF, which introduce intrinsic decomposition into the NeRF-based neural rendering method and can extend its application to room-scale scenes.

Clustering Inverse Rendering +2

Learning-Based Dimensionality Reduction for Computing Compact and Effective Local Feature Descriptors

1 code implementation27 Sep 2022 Hao Dong, Xieyuanli Chen, Mihai Dusmanu, Viktor Larsson, Marc Pollefeys, Cyrill Stachniss

A distinctive representation of image patches in form of features is a key component of many computer vision and robotics tasks, such as image matching, image retrieval, and visual localization.

Dimensionality Reduction Image Retrieval +2

Visual Localization via Few-Shot Scene Region Classification

1 code implementation14 Aug 2022 Siyan Dong, Shuzhe Wang, Yixin Zhuang, Juho Kannala, Marc Pollefeys, Baoquan Chen

Visual (re)localization addresses the problem of estimating the 6-DoF (Degree of Freedom) camera pose of a query image captured in a known scene, which is a key building block of many computer vision and robotics applications.

General Classification Memorization +2

CompNVS: Novel View Synthesis with Scene Completion

no code implementations23 Jul 2022 Zuoyue Li, Tianxing Fan, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald

We introduce a scalable framework for novel view synthesis from RGB-D images with largely incomplete scene coverage.

Novel View Synthesis Scene Understanding

NeFSAC: Neurally Filtered Minimal Samples

1 code implementation16 Jul 2022 Luca Cavalli, Marc Pollefeys, Daniel Barath

We tested NeFSAC on more than 100k image pairs from three publicly available real-world datasets and found that it leads to one order of magnitude speed-up, while often finding more accurate results than USAC alone.

Autonomous Driving Pose Estimation

Context-Aware Sequence Alignment using 4D Skeletal Augmentation

1 code implementation CVPR 2022 Taein Kwon, Bugra Tekin, Siyu Tang, Marc Pollefeys

Temporal alignment of fine-grained human actions in videos is important for numerous applications in computer vision, robotics, and mixed reality.

Hand Pose Estimation Mixed Reality +1

Spatial Computing and Intuitive Interaction: Bringing Mixed Reality and Robotics Together

no code implementations3 Feb 2022 Jeffrey Delmerico, Roi Poranne, Federica Bogo, Helen Oleynikova, Eric Vollenweider, Stelian Coros, Juan Nieto, Marc Pollefeys

Spatial computing -- the ability of devices to be aware of their surroundings and to represent this digitally -- offers novel capabilities in human-robot interaction.

Mixed Reality

Learning To Find Good Models in RANSAC

no code implementations CVPR 2022 Daniel Barath, Luca Cavalli, Marc Pollefeys

We propose the Model Quality Network, MQ-Net in short, for predicting the quality, e. g. the pose error of essential matrices, of models generated inside RANSAC.

Pose Estimation

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

1 code implementation CVPR 2022 Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys

Neural implicit representations have recently shown encouraging results in various domains, including promising progress in simultaneous localization and mapping (SLAM).

Simultaneous Localization and Mapping

EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices

1 code implementation14 Dec 2021 Siwei Zhang, Qianli Ma, Yan Zhang, Zhiyin Qian, Taein Kwon, Marc Pollefeys, Federica Bogo, Siyu Tang

Key to reasoning about interactions is to understand the body pose and motion of the interaction partner from the egocentric view.

Motion Estimation

LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies

no code implementations30 Nov 2021 Sandro Lombardi, Bangbang Yang, Tianxing Fan, Hujun Bao, Guofeng Zhang, Marc Pollefeys, Zhaopeng Cui

In this work, we propose a novel neural implicit representation for the human body, which is fully differentiable and optimizable with disentangled shape and pose latent spaces.

3D Reconstruction motion retargeting +1

Learning to Align Sequential Actions in the Wild

no code implementations CVPR 2022 Weizhe Liu, Bugra Tekin, Huseyin Coskun, Vibhav Vineet, Pascal Fua, Marc Pollefeys

To this end, we propose an approach to enforce temporal priors on the optimal transport matrix, which leverages temporal consistency, while allowing for variations in the order of actions.

Representation Learning

Non-local Recurrent Regularization Networks for Multi-view Stereo

no code implementations13 Oct 2021 Qingshan Xu, Martin R. Oswald, Wenbing Tao, Marc Pollefeys, Zhaopeng Cui

However, existing recurrent methods only model the local dependencies in the depth domain, which greatly limits the capability of capturing the global scene context along the depth dimension.

Depth Estimation

Reconstructing and grounding narrated instructional videos in 3D

no code implementations9 Sep 2021 Dimitri Zhukov, Ignacio Rocco, Ivan Laptev, Josef Sivic, Johannes L. Schönberger, Bugra Tekin, Marc Pollefeys

Contrary to the standard scenario of instance-level 3D reconstruction, where identical objects or scenes are present in all views, objects in different instructional videos may have large appearance variations given varying conditions and versions of the same product.

3D Reconstruction

Learning Motion Priors for 4D Human Body Capture in 3D Scenes

1 code implementation ICCV 2021 Siwei Zhang, Yan Zhang, Federica Bogo, Marc Pollefeys, Siyu Tang

To prove the effectiveness of the proposed motion priors, we combine them into a novel pipeline for 4D human body capture in 3D scenes.

Friction

Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

1 code implementation NeurIPS 2021 Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Marc Pollefeys

We address the novel task of jointly reconstructing the 3D shape, texture, and motion of an object from a single motion-blurred image.

Deblurring Object +2

Shape As Points: A Differentiable Poisson Solver

2 code implementations NeurIPS 2021 Songyou Peng, Chiyu "Max" Jiang, Yiyi Liao, Michael Niemeyer, Marc Pollefeys, Andreas Geiger

However, the implicit nature of neural implicit representations results in slow inference time and requires careful initialization.

3D Reconstruction Surface Reconstruction

Towards Efficient Graph Convolutional Networks for Point Cloud Handling

no code implementations ICCV 2021 Yawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory Chirikjian, Luc van Gool

In this paper, we aim at improving the computational efficiency of graph convolutional networks (GCNs) for learning on point clouds.

Computational Efficiency

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose

2 code implementations CVPR 2021 Paul-Edouard Sarlin, Ajaykumar Unagar, Måns Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler

In this paper, we go Back to the Feature: we argue that deep networks should focus on learning robust and invariant visual features, while the geometric estimation should be left to principled algorithms.

3D geometry Camera Localization +3

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

1 code implementation CVPR 2021 Cheng Zhang, Zhaopeng Cui, yinda zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu

We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features.

 Ranked #1 on Monocular 3D Object Detection on SUN RGB-D (using extra training data)

3D Shape Reconstruction Graph Neural Network +5

Localizing Unsynchronized Sensors with Unknown Sources

1 code implementation6 Feb 2021 Dalia El Badawy, Viktor Larsson, Marc Pollefeys, Ivan Dokmanić

We look at the general case where neither the emission times of the sources nor the reference time frames of the receivers are known.

Orthographic-Perspective Epipolar Geometry

no code implementations ICCV 2021 Viktor Larsson, Marc Pollefeys, Magnus Oskarsson

In this paper we consider the epipolar geometry between orthographic and perspective cameras.

Camera Calibration

The Card Shuffling Hypotheses: Building a Time and Memory Efficient Graph Convolutional Network

no code implementations1 Jan 2021 Yawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory Chirikjian, Luc van Gool

State-of-the-art GCNs adopt $K$-nearest neighbor (KNN) searches for local feature aggregation and feature extraction operations from layer to layer.

3D Classification Point Cloud Classification +2

DeepSurfels: Learning Online Appearance Fusion

1 code implementation CVPR 2021 Marko Mihajlovic, Silvan Weder, Marc Pollefeys, Martin R. Oswald

We present DeepSurfels, a novel hybrid scene representation for geometry and appearance information.

CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth

no code implementations18 Dec 2020 Xingxing Zuo, Nathaniel Merrill, Wei Li, Yong liu, Marc Pollefeys, Guoquan Huang

In this work, we present a lightweight, tightly-coupled deep depth network and visual-inertial odometry (VIO) system, which can provide accurate state estimates and dense depth maps of the immediate surroundings.

Depth Estimation Depth Prediction +1

FMODetect: Robust Detection of Fast Moving Objects

1 code implementation ICCV 2021 Denys Rozumnyi, Jiri Matas, Filip Sroubek, Marc Pollefeys, Martin R. Oswald

Compared to other methods, such as deblatting, the inference is of several orders of magnitude faster and allows applications such as real-time fast moving object detection and retrieval in large video collections.

Deblurring Image Matting +3

Sat2Vid: Street-view Panoramic Video Synthesis from a Single Satellite Image

no code implementations ICCV 2021 Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Rongjun Qin, Marc Pollefeys, Martin R. Oswald

For geometrical and temporal consistency, our approach explicitly creates a 3D point cloud representation of the scene and maintains dense 3D-2D correspondences across frames that reflect the geometric scene configuration inferred from the satellite view.

Image Generation

DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal Fusion

1 code implementation CVPR 2021 Arda Düzçeker, Silvano Galliani, Christoph Vogel, Pablo Speciale, Mihai Dusmanu, Marc Pollefeys

We propose an online multi-view depth prediction approach on posed video streams, where the scene geometry information computed in the previous time steps is propagated to the current time step in an efficient and geometrically plausible way.

Decoder Depth Estimation +1

Cross-Descriptor Visual Localization and Mapping

1 code implementation ICCV 2021 Mihai Dusmanu, Ondrej Miksik, Johannes L. Schönberger, Marc Pollefeys

Visual localization and mapping is the key technology underlying the majority of mixed reality and robotics systems.

Mixed Reality Visual Localization

DeFMO: Deblurring and Shape Recovery of Fast Moving Objects

5 code implementations CVPR 2021 Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Jiri Matas, Marc Pollefeys

We propose a method that, given a single image with its estimated background, outputs the object's appearance and position in a series of sub-frames as if captured by a high-speed camera (i. e. temporal super-resolution).

Deblurring Object Tracking +1

NeuralFusion: Online Depth Fusion in Latent Space

1 code implementation CVPR 2021 Silvan Weder, Johannes L. Schönberger, Marc Pollefeys, Martin R. Oswald

We present a novel online depth map fusion approach that learns depth map aggregation in a latent feature space.

Freetures: Localization in Signed Distance Function Maps

no code implementations19 Oct 2020 Alexander Millane, Helen Oleynikova, Christian Lanegger, Jeff Delmerico, Juan Nieto, Roland Siegwart, Marc Pollefeys, Cesar Cadena

Localization of a robotic system within a previously mapped environment is important for reducing estimation drift and for reusing previously built maps.

Robotics

Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors

no code implementations8 Oct 2020 Cathrin Elich, Martin R. Oswald, Marc Pollefeys, Joerg Stueckler

Our approach learns to decompose images of synthetic scenes with multiple objects on a planar surface into its constituent scene objects and to infer their 3D properties from a single view.

Decision Making Scene Understanding +1

Semi-Supervised Learning of Multi-Object 3D Scene Representations

no code implementations28 Sep 2020 Cathrin Elich, Martin R. Oswald, Marc Pollefeys, Joerg Stueckler

By differentiable rendering, we train our model to decompose scenes self-supervised from RGB-D images.

Decision Making Object +1

Self-Supervised Learning of Non-Rigid Residual Flow and Ego-Motion

no code implementations22 Sep 2020 Ivan Tishchenko, Sandro Lombardi, Martin R. Oswald, Marc Pollefeys

Most of the current scene flow methods choose to model scene flow as a per point translation vector without differentiating between static and dynamic components of 3D motion.

Self-Supervised Learning Translation

HoloLens 2 Research Mode as a Tool for Computer Vision Research

2 code implementations25 Aug 2020 Dorin Ungureanu, Federica Bogo, Silvano Galliani, Pooja Sama, Xin Duan, Casey Meekhof, Jan Stühmer, Thomas J. Cashman, Bugra Tekin, Johannes L. Schönberger, Pawel Olszta, Marc Pollefeys

Mixed reality headsets, such as the Microsoft HoloLens 2, are powerful sensing devices with integrated compute capabilities, which makes it an ideal platform for computer vision research.

Mixed Reality

LIC-Fusion 2.0: LiDAR-Inertial-Camera Odometry with Sliding-Window Plane-Feature Tracking

no code implementations17 Aug 2020 Xingxing Zuo, Yulin Yang, Patrick Geneva, Jiajun Lv, Yong liu, Guoquan Huang, Marc Pollefeys

Only the tracked planar points belonging to the same plane will be used for plane initialization, which makes the plane extraction efficient and robust.

Robotics

KAPLAN: A 3D Point Descriptor for Shape Completion

no code implementations31 Jul 2020 Audrey Richard, Ian Cherabier, Martin R. Oswald, Marc Pollefeys, Konrad Schindler

We present a novel 3D shape completion method that operates directly on unstructured point clouds, thus avoiding resource-intensive data structures like voxel grids.

3D Shape Reconstruction

Infrastructure-based Multi-Camera Calibration using Radial Projections

1 code implementation ECCV 2020 Yukai Lin, Viktor Larsson, Marcel Geppert, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler

In particular, our approach is more robust than the naive approach of first estimating intrinsic parameters and pose per camera before refining the extrinsic parameters of the system.

Camera Calibration Self-Driving Cars +1

AdaLAM: Revisiting Handcrafted Outlier Detection

3 code implementations7 Jun 2020 Luca Cavalli, Viktor Larsson, Martin Ralf Oswald, Torsten Sattler, Marc Pollefeys

Local feature matching is a critical component of many computer vision pipelines, including among others Structure-from-Motion, SLAM, and Visual Localization.

Key Point Matching Outlier Detection +1

OmniSLAM: Omnidirectional Localization and Dense Mapping for Wide-baseline Multi-camera Systems

no code implementations18 Mar 2020 Changhee Won, Hochang Seok, Zhaopeng Cui, Marc Pollefeys, Jongwoo Lim

In this paper, we present an omnidirectional localization and dense mapping system for a wide-baseline multiview stereo setup with ultra-wide field-of-view (FOV) fisheye cameras, which has a 360 degrees coverage of stereo observations of the environment.

Depth Estimation Visual Odometry

Multi-View Optimization of Local Feature Geometry

1 code implementation ECCV 2020 Mihai Dusmanu, Johannes L. Schönberger, Marc Pollefeys

In this work, we address the problem of refining the geometry of local image features from multiple views without known scene or camera geometry.

Camera Localization

Convolutional Occupancy Networks

6 code implementations ECCV 2020 Songyou Peng, Michael Niemeyer, Lars Mescheder, Marc Pollefeys, Andreas Geiger

Recently, implicit neural representations have gained popularity for learning-based 3D reconstruction.

3D Reconstruction

Self-Supervised Linear Motion Deblurring

1 code implementation10 Feb 2020 Peidong Liu, Joel Janai, Marc Pollefeys, Torsten Sattler, Andreas Geiger

Motion blurry images challenge many computer vision algorithms, e. g, feature detection, motion estimation, or object recognition.

Deblurring Image Deblurring +4

Aerial Single-View Depth Completion with Image-Guided Uncertainty Estimation

2 code implementations17 Jan 2020 Lucas Teixeira, Martin R. Oswald, Marc Pollefeys, Margarita Chli

In this paper, we propose a depth completion and uncertainty estimation approach that better handles the challenges of aerial platforms, such as large viewpoint and depth variations, and limited computing resources.

Depth Completion Monocular Depth Estimation +1

Learned Multi-View Texture Super-Resolution

no code implementations14 Jan 2020 Audrey Richard, Ian Cherabier, Martin R. Oswald, Vagia Tsiminaki, Marc Pollefeys, Konrad Schindler

We present a super-resolution method capable of creating a high-resolution texture map for a virtual 3D object from a set of lower-resolution images of that object.

Decoder Image Super-Resolution