Cross-view Geo-localization, Image Retrieval, Multiscale Geometric Modeling, Frequency Domain Enhancement
2026 Β· Hongying Zhang, Shuaishuai Ma
Abstract
Cross-view geo-localization (CVGL) aims to establish spatial correspondences between images captured from significantly different viewpoints and constitutes a fundamental technique for visual localization in GNSS-denied environments. Nevertheless, CVGL remains challenging due to severe geometric asymmetry, texture inconsistency across imaging domains, and the progressive degradation of discriminative local information. Existing methods predominantly rely on spatial domain feature alignment, which is inherently sensitive to large scale viewpoint variations and local disturbances. To alleviate these limitations, this paper proposes the Spatial and Frequency Domain Enhancement Network (SFDE), which leverages complementary representations from spatial and frequency domains. SFDE adopts a three branch parallel architecture to model global semantic context, local geometric structure, and statistical stability in the frequency domain, respectively, thereby characterizing consistency across do
Authors
(none)
Tags
Stats
Related papers
- BEV-CV: Birds-eye-view Transform For Cross-view Geo-localisation (2023)5.84
- Clnet: Cross-view Correspondence Makes A Stronger Geo-localizationer (2025)0.00
- Just Zoom In: Cross-view Geo-localization Via Autoregressive Zooming (2026)0.00
- Cross-view Image Matching For Geo-localization In Urban Environments (2017)17.16
- VIGOR: Cross-view Image Geo-localization Beyond One-to-one Retrieval (2020)21.49
- Geo-localization Via Ground-to-satellite Cross-view Image Retrieval (2022)12.54
- C-BEV: Contrastive Bird's Eye View Training For Cross-view Image Retrieval And 3-dof Pose Estimation (2023)0.00
- Cross-view Image Geo-localization With Panorama-bev Co-retrieval Network (2024)13.94