Statewide Visual Geolocalization In The Wild
2024 Β· Florian Fervers, Sebastian Bullinger, Christoph Bodensteiner, et al.
Abstract
This work presents a method that is able to predict the geolocation of a street-view photo taken in the wild within a state-sized search region by matching against a database of aerial reference imagery. We partition the search region into geographical cells and train a model to map cells and corresponding photos into a joint embedding space that is used to perform retrieval at test time. The model utilizes aerial images for each cell at multiple levels-of-detail to provide sufficient information about the surrounding scene. We propose a novel layout of the search region with consistent cell resolutions that allows scaling to large geographical regions. Experiments demonstrate that the method successfully localizes 60.6% of all non-panoramic street-view photos uploaded to the crowd-sourcing platform Mapillary in the state of Massachusetts to within 50m of their ground-truth location. Source code is available at https://github.com/fferflo/statewide-visual-geolocalization.
Authors
(none)
Tags
Stats
Code
Related papers
- Just Zoom In: Cross-view Geo-localization Via Autoregressive Zooming (2026)0.00
- VIGOR: Cross-view Image Geo-localization Beyond One-to-one Retrieval (2020)21.49
- Coming Down To Earth: Satellite-to-street View Synthesis For Geo-localization (2021)16.28
- Where Is This? Video Geolocation Based On Neural Network Features (2018)0.00
- Cross-view Image Matching For Geo-localization In Urban Environments (2017)17.16
- From Street To Orbit: Training-free Cross-view Retrieval Via Location Semantics And LLM Guidance (2025)0.00
- Geo-localization Via Ground-to-satellite Cross-view Image Retrieval (2022)12.54
- G3: An Effective And Adaptive Framework For Worldwide Geolocalization Using Large Multi-modality Models (2024)3.58