Visual Localization

151 papers with code • 5 benchmarks • 20 datasets

Visual Localization is the problem of estimating the camera pose of a given image relative to a visual representation of a known scene.

Source: Fine-Grained Segmentation Networks: Self-Supervised Segmentation for Improved Long-Term Visual Localization

Libraries

Use these libraries to find Visual Localization models and implementations

Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer

GeWu-Lab/Generalizable-Audio-Visual-Segmentation 13 Sep 2023

Never having seen an object and heard its sound simultaneously, can the model still accurately localize its visual position from the input audio?

7
13 Sep 2023

Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond

qwenlm/qwen-vl 24 Aug 2023

In this work, we introduce the Qwen-VL series, a set of large-scale vision-language models (LVLMs) designed to perceive and understand both texts and images.

3,738
24 Aug 2023

OFVL-MS: Once for Visual Localization across Multiple Indoor Scenes

mooncake199809/ufvl-net ICCV 2023

In this work, we seek to predict camera poses across scenes with a multi-task learning manner, where we view the localization of each scene as a new task.

9
23 Aug 2023

D2S: Representing local descriptors and global scene coordinates for camera relocalization

ais-lab/feat2map 28 Jul 2023

In this study, we propose a direct learning-based approach that utilizes a simple network named D2S to represent local descriptors and their scene coordinates.

43
28 Jul 2023

ResMatch: Residual Attention Learning for Local Feature Matching

acuooooo/resmatch 11 Jul 2023

In order to facilitate the learning of matching and filtering, we inject the similarity of descriptors and relative positions into cross- and self-attention score, respectively.

26
11 Jul 2023

LightGlue: Local Feature Matching at Light Speed

cvg/lightglue ICCV 2023

We introduce LightGlue, a deep neural network that learns to match local features across images.

2,990
23 Jun 2023

Yes, we CANN: Constrained Approximate Nearest Neighbors for local feature-based visual localization

google-research/google-research ICCV 2023

In this paper, we take a step back from this assumption and propose Constrained Approximate Nearest Neighbors (CANN), a joint solution of k-nearest-neighbors across both the geometry and appearance space using only local features.

32,870
15 Jun 2023

Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance

roylin1229/IIB_descriptor 13 May 2023

Existing binary descriptors may not perform well for long-term visual measurement tasks due to their sensitivity to illumination variations.

4
13 May 2023

Eiffel Tower: A Deep-Sea Underwater Dataset for Long-Term Visual Localization

clementinboittiaux/sfm-pipeline 9 May 2023

This paper presents a new deep-sea dataset to benchmark underwater long-term visual localization.

0
09 May 2023

Privacy-Preserving Representations are not Enough -- Recovering Scene Content from Camera Poses

kunalchelani/objectpositioningfromposes 8 May 2023

In this paper, we show that an attacker can learn about details of a scene without any access by simply querying a localization service.

1
08 May 2023