Clnet: Cross-view Correspondence Makes A Stronger Geo-localizationer
2025 Β· Xianwei Cao, Dou Quan, Shuang Wang, et al.
Abstract
Image retrieval-based cross-view geo-localization (IRCVGL) aims to match images captured from significantly different viewpoints, such as satellite and street-level images. Existing methods predominantly rely on learning robust global representations or implicit feature alignment, which often fail to model explicit spatial correspondences crucial for accurate localization. In this work, we propose a novel correspondence-aware feature refinement framework, termed CLNet, that explicitly bridges the semantic and geometric gaps between different views. CLNet decomposes the view alignment process into three learnable and complementary modules: a Neural Correspondence Map (NCM) that spatially aligns cross-view features via latent correspondence fields; a Nonlinear Embedding Converter (NEC) that remaps features across perspectives using an MLP-based transformation; and a Global Feature Recalibration (GFR) module that reweights informative feature channels guided by learned spatial cues. The p
Authors
(none)
Tags
Stats
Related papers
- Cross-view Geo-localization, Image Retrieval, Multiscale Geometric Modeling, Frequency Domain Enhancement (2026)0.00
- Just Zoom In: Cross-view Geo-localization Via Autoregressive Zooming (2026)0.00
- Cross-view Image Matching For Geo-localization In Urban Environments (2017)17.16
- BEV-CV: Birds-eye-view Transform For Cross-view Geo-localisation (2023)5.84
- Geo-localization Via Ground-to-satellite Cross-view Image Retrieval (2022)12.54
- Multiview Image-based Localization (2025)0.00
- From Street To Orbit: Training-free Cross-view Retrieval Via Location Semantics And LLM Guidance (2025)0.00
- Cross-view Image Geo-localization With Panorama-bev Co-retrieval Network (2024)13.94