VICI: Vlm-instructed Cross-view Image-localisation
2025 Β· Xiaohan Zhang, Tavis Shore, Chen Chen, et al.
Abstract
In this paper, we present a high-performing solution to the UAVM 2025 Challenge, which focuses on matching narrow FOV street-level images to corresponding satellite imagery using the University-1652 dataset. As panoramic Cross-View Geo-Localisation nears peak performance, it becomes increasingly important to explore more practical problem formulations. Real-world scenarios rarely offer panoramic street-level queries; instead, queries typically consist of limited-FOV images captured with unknown camera parameters. Our work prioritises discovering the highest achievable performance under these constraints, pushing the limits of existing architectures. Our method begins by retrieving candidate satellite image embeddings for a given query, followed by a re-ranking stage that selectively enhances retrieval accuracy within the top candidates. This two-stage approach enables more precise matching, even under the significant viewpoint and scale variations inherent in the task. Through experime
Authors
(none)
Tags
Stats
Related papers
- From Street To Orbit: Training-free Cross-view Retrieval Via Location Semantics And LLM Guidance (2025)0.00
- BEV-CV: Birds-eye-view Transform For Cross-view Geo-localisation (2023)5.84
- VIGOR: Cross-view Image Geo-localization Beyond One-to-one Retrieval (2020)21.49
- Cross-view Image Matching For Geo-localization In Urban Environments (2017)17.16
- Geo-localization Via Ground-to-satellite Cross-view Image Retrieval (2022)12.54
- Just Zoom In: Cross-view Geo-localization Via Autoregressive Zooming (2026)0.00
- Cross-view Image Geo-localization With Panorama-bev Co-retrieval Network (2024)13.94
- Localizing And Orienting Street Views Using Overhead Imagery (2016)17.26