Visual Localization

151 papers with code • 5 benchmarks • 19 datasets

Visual Localization is the problem of estimating the camera pose of a given image relative to a visual representation of a known scene.

Source: Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization

Libraries

Use these libraries to find Visual Localization models and implementations

Latest papers with no code

PRAM: Place Recognition Anywhere Model for Efficient Visual Localization

no code yet • 11 Apr 2024

Humans localize themselves efficiently in known environments by first recognizing landmarks defined on certain objects and their spatial relationships, and then verifying the location by aligning detailed structures of recognized objects with those in the memory.

3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization

no code yet • 17 Mar 2024

This paper presents a novel system designed for 3D mapping and visual relocalization using 3D Gaussian Splatting.

The NeRFect Match: Exploring NeRF Features for Visual Localization

no code yet • 14 Mar 2024

Significantly, we introduce NeRFMatch, an advanced 2D-3D matching function that capitalizes on the internal knowledge of NeRF learned via view synthesis.

Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency

no code yet • 14 Feb 2024

After expanding the training set, we propose a training approach that leverages the specificities and the underlying geometry of this mix of real and synthetic images.

Semantic Object-level Modeling for Robust Visual Camera Relocalization

no code yet • 10 Feb 2024

Due to the improvement of CNN-based object detection algorithm, the robustness of visual relocalization is greatly enhanced especially in viewpoints where classical methods fail.

UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization

no code yet • 11 Jan 2024

Despite significant progress in global localization of Unmanned Aerial Vehicles (UAVs) in GPS-denied environments, existing methods remain constrained by the availability of datasets.

LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization

no code yet • 27 Dec 2023

We apply this approach to the domains of 2D image and 3D LiDAR points on the task of cross-modal localization.

PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields

no code yet • 17 Dec 2023

In this paper, we propose a novel visual localization framework, \ie, PNeRFLoc, based on a unified point-based representation.

Implicit Learning of Scene Geometry from Poses for Global Localization

no code yet • 4 Dec 2023

In this paper, we propose to utilize these minimal available labels (. i. e, poses) to learn the underlying 3D geometry of the scene and use the geometry to estimate the 6 DoF camera pose.

G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training

no code yet • 3 Dec 2023

G2D achieves superior performance across 6 medical imaging tasks and 25 diseases, particularly in semantic segmentation, which necessitates fine-grained, semantically-grounded image features.