VIGOR: Cross-view Image Geo-localization Beyond One-to-one Retrieval
2020 Β· Sijie Zhu, Taojiannan Yang, Chen Chen
Abstract
Cross-view image geo-localization aims to determine the locations of street-view query images by matching with GPS-tagged reference images from aerial view. Recent works have achieved surprisingly high retrieval accuracy on city-scale datasets. However, these results rely on the assumption that there exists a reference image exactly centered at the location of any query image, which is not applicable for practical scenarios. In this paper, we redefine this problem with a more realistic assumption that the query image can be arbitrary in the area of interest and the reference images are captured before the queries emerge. This assumption breaks the one-to-one retrieval setting of existing datasets as the queries and reference images are not perfectly aligned pairs, and there may be multiple reference images covering one query location. To bridge the gap between this realistic setting and existing datasets, we propose a new large-scale benchmark -- VIGOR -- for cross-View Image Geo-local
Authors
(none)
Tags
Stats
Related papers
- Cross-view Image Matching For Geo-localization In Urban Environments (2017)17.16
- Just Zoom In: Cross-view Geo-localization Via Autoregressive Zooming (2026)0.00
- Cross-view Image Geo-localization With Panorama-bev Co-retrieval Network (2024)13.94
- VICI: Vlm-instructed Cross-view Image-localisation (2025)2.51
- Geo-localization Via Ground-to-satellite Cross-view Image Retrieval (2022)12.54
- BEV-CV: Birds-eye-view Transform For Cross-view Geo-localisation (2023)5.84
- C-BEV: Contrastive Bird's Eye View Training For Cross-view Image Retrieval And 3-dof Pose Estimation (2023)0.00
- Coming Down To Earth: Satellite-to-street View Synthesis For Geo-localization (2021)16.28