TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Photo geolocation estimation	GWS15k	GeoDecoder	Street level (1 km)	0.7	# 1
Photo geolocation estimation	GWS15k	GeoDecoder	City level (25 km)	1.5	# 3
Photo geolocation estimation	GWS15k	GeoDecoder	Region level (200 km)	8.7	# 3
Photo geolocation estimation	GWS15k	GeoDecoder	Country level (750 km)	26.9	# 3
Photo geolocation estimation	GWS15k	GeoDecoder	Continent level (2500 km)	50.5	# 3
Photo geolocation estimation	Im2GPS3k	GeoDecoder	Street level (1 km)	12.8	# 2
Photo geolocation estimation	Im2GPS3k	GeoDecoder	City level (25 km)	33.5	# 3
Photo geolocation estimation	Im2GPS3k	GeoDecoder	Region level (200 km)	45.9	# 4
Photo geolocation estimation	Im2GPS3k	GeoDecoder	Country level (750 km)	61.0	# 4
Photo geolocation estimation	Im2GPS3k	GeoDecoder	Continent level (2500 km)	76.1	# 5
Photo geolocation estimation	Im2GPS3k	GeoDecoder	Training Images	4.7M	# 5
Photo geolocation estimation	YFCC26k	GeoDecoder	Street level (1 km)	10.1	# 3
Photo geolocation estimation	YFCC26k	GeoDecoder	City level (25 km)	23.9	# 2
Photo geolocation estimation	YFCC26k	GeoDecoder	Region level (200 km)	34.1	# 3
Photo geolocation estimation	YFCC26k	GeoDecoder	Country level (750 km)	49.6	# 3
Photo geolocation estimation	YFCC26k	GeoDecoder	Continent level (2500 km)	69.0	# 3
Photo geolocation estimation	YFCC26k	GeoDecoder	Training Images	4.7M	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-we-are-and-what-we-re-looking-at-query/photo-geolocation-estimation-on-gws15k)](https://paperswithcode.com/sota/photo-geolocation-estimation-on-gws15k?p=where-we-are-and-what-we-re-looking-at-query)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-we-are-and-what-we-re-looking-at-query/photo-geolocation-estimation-on-im2gps3k)](https://paperswithcode.com/sota/photo-geolocation-estimation-on-im2gps3k?p=where-we-are-and-what-we-re-looking-at-query)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/where-we-are-and-what-we-re-looking-at-query/photo-geolocation-estimation-on-yfcc26k)](https://paperswithcode.com/sota/photo-geolocation-estimation-on-yfcc26k?p=where-we-are-and-what-we-re-looking-at-query)`

Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes

CVPR 2023 · Brandon Clark, Alec Kerrigan, Parth Parag Kulkarni, Vicente Vivanco Cepeda, Mubarak Shah ·

Determining the exact latitude and longitude that a photo was taken is a useful and widely applicable task, yet it remains exceptionally difficult despite the accelerated progress of other computer vision tasks. Most previous approaches have opted to learn a single representation of query images, which are then classified at different levels of geographic granularity. These approaches fail to exploit the different visual cues that give context to different hierarchies, such as the country, state, and city level. To this end, we introduce an end-to-end transformer-based architecture that exploits the relationship between different geographic levels (which we refer to as hierarchies) and the corresponding visual scene information in an image through hierarchical cross-attention. We achieve this by learning a query for each geographic hierarchy and scene type. Furthermore, we learn a separate representation for different environmental scenes, as different scenes in the same location are often defined by completely different visual features. We achieve state of the art street level accuracy on 4 standard geo-localization datasets : Im2GPS, Im2GPS3k, YFCC4k, and YFCC26k, as well as qualitatively demonstrate how our method learns different representations for different visual hierarchies and scenes, which has not been demonstrated in the previous methods. These previous testing datasets mostly consist of iconic landmarks or images taken from social media, which makes them either a memorization task, or biased towards certain places. To address this issue we introduce a much harder testing dataset, Google-World-Streets-15k, comprised of images taken from Google Streetview covering the whole planet and present state of the art results. Our code will be made available in the camera-ready version.

PDF Abstract CVPR 2023 PDF CVPR 2023 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Image-Based Localization

Memorization

Photo geolocation estimation

Datasets

Places

Results from the Paper

Edit

Ranked #1 on Photo geolocation estimation on GWS15k

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Photo geolocation estimation	GWS15k	GeoDecoder	Street level (1 km)	0.7	# 1	Compare
			City level (25 km)	1.5	# 3	Compare
			Region level (200 km)	8.7	# 3	Compare
			Country level (750 km)	26.9	# 3	Compare
			Continent level (2500 km)	50.5	# 3	Compare
Photo geolocation estimation	Im2GPS3k	GeoDecoder	Street level (1 km)	12.8	# 2	Compare
			City level (25 km)	33.5	# 3	Compare
			Region level (200 km)	45.9	# 4	Compare
			Country level (750 km)	61.0	# 4	Compare
			Continent level (2500 km)	76.1	# 5	Compare
			Training Images	4.7M	# 5	Compare
Photo geolocation estimation	YFCC26k	GeoDecoder	Street level (1 km)	10.1	# 3	Compare
			City level (25 km)	23.9	# 2	Compare
			Region level (200 km)	34.1	# 3	Compare
			Country level (750 km)	49.6	# 3	Compare
			Continent level (2500 km)	69.0	# 3	Compare
			Training Images	4.7M	# 2	Compare

Methods

Add Remove

fail

Edit Social Preview

Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove