Scicevpr: Stable Cross-image Correlation Enhanced Model For Visual Place Recognition
2025 Β· Shanshan Wan, Yingmei Wei, Lai Kang, et al.
Abstract
Visual Place Recognition (VPR) is a major challenge for robotics and autonomous systems, with the goal of predicting the location of an image based solely on its visual features. State-of-the-art (SOTA) models extract global descriptors using the powerful foundation model DINOv2 as backbone. These models either explore the cross-image correlation or propose a time-consuming two-stage re-ranking strategy to achieve better performance. However, existing works only utilize the final output of DINOv2, and the current cross-image correlation causes unstable retrieval results. To produce both discriminative and constant global descriptors, this paper proposes stable cross-image correlation enhanced model for VPR called SciceVPR. This model explores the full potential of DINOv2 in providing useful feature representations that implicitly encode valuable contextual knowledge. Specifically, SciceVPR first uses a multi-layer feature fusion module to capture increasingly detailed task-relevant cha
Authors
(none)
Tags
Stats
Related papers
- Evaluation Of Visual Place Recognition Methods For Image Pair Retrieval In 3D Vision And Robotics (2026)0.00
- Mutualvpr: A Mutual Learning Framework For Resolving Supervision Inconsistencies Via Adaptive Clustering (2024)0.00
- Towards Test-time Efficient Visual Place Recognition Via Asymmetric Query Processing (2025)0.00
- Embodiedplace: Learning Mixture-of-features With Embodied Constraints For Visual Place Recognition (2025)0.00
- Unipr-3d: Towards Universal Visual Place Recognition With Visual Geometry Grounded Transformer (2025)2.95
- Range And Bird's Eye View Fused Cross-modal Visual Place Recognition (2025)0.00
- Mixvpr: Feature Mixing For Visual Place Recognition (2023)22.68
- Focus On Local: Finding Reliable Discriminative Regions For Visual Place Recognition (2025)10.70