Textinplace: Indoor Visual Place Recognition In Repetitive Structures With Scene Text Spotting And Verification
2025 Β· Huaqi Tao, Bingxi Liu, Calvin Chen, et al.
Abstract
Visual Place Recognition (VPR) is a crucial capability for long-term autonomous robots, enabling them to identify previously visited locations using visual information. However, existing methods remain limited in indoor settings due to the highly repetitive structures inherent in such environments. We observe that scene texts frequently appear in indoor spaces and can help distinguish visually similar but different places. This inspires us to propose TextInPlace, a simple yet effective VPR framework that integrates Scene Text Spotting (STS) to mitigate visual perceptual ambiguity in repetitive indoor environments. Specifically, TextInPlace adopts a dual-branch architecture within a local parameter sharing network. The VPR branch employs attention-based aggregation to extract global descriptors for coarse-grained retrieval, while the STS branch utilizes a bridging text spotter to detect and recognize scene texts. Finally, the discriminative texts are filtered to compute text similarity
Authors
(none)
Tags
Stats
Related papers
- Text2graph VPR: A Text-to-graph Expert System For Explainable Place Recognition In Changing Environments (2025)0.00
- Evaluation Of Visual Place Recognition Methods For Image Pair Retrieval In 3D Vision And Robotics (2026)0.00
- Embodiedplace: Learning Mixture-of-features With Embodied Constraints For Visual Place Recognition (2025)0.00
- Structvpr++: Distill Structural And Semantic Knowledge With Weighting Samples For Visual Place Recognition (2025)3.58
- Scicevpr: Stable Cross-image Correlation Enhanced Model For Visual Place Recognition (2025)4.06
- Structured Pruning For Efficient Visual Place Recognition (2024)2.26
- Mixvpr: Feature Mixing For Visual Place Recognition (2023)22.68
- Structvpr: Distill Structural Knowledge With Weighting Samples For Visual Place Recognition (2022)10.97