Text2graph VPR: A Text-to-graph Expert System For Explainable Place Recognition In Changing Environments
2025 Β· Saeideh Yousefzadeh, Hamidreza Pourreza
Abstract
Visual Place Recognition (VPR) in long-term deployment requires reasoning beyond pixel similarity: systems must make transparent, interpretable decisions that remain robust under lighting, weather and seasonal change. We present Text2Graph VPR, an explainable semantic localization system that converts image sequences into textual scene descriptions, parses those descriptions into structured scene graphs, and reasons over the resulting graphs to identify places. Scene graphs capture objects, attributes and pairwise relations; we aggregate per-frame graphs into a compact place representation and perform retrieval with a dual-similarity mechanism that fuses learned Graph Attention Network (GAT) embeddings and a Shortest-Path (SP) kernel for structural matching. This hybrid design enables both learned semantic matching and topology-aware comparison, and -- critically -- produces human-readable intermediate representations that support diagnostic analysis and improve transparency in the dec
Authors
(none)
Tags
Stats
Related papers
- Textinplace: Indoor Visual Place Recognition In Repetitive Structures With Scene Text Spotting And Verification (2025)4.72
- SAGE: Spatial-visual Adaptive Graph Exploration For Efficient Visual Place Recognition (2025)2.16
- Lavpr: Benchmarking Language And Vision For Place Recognition (2026)2.35
- Structvpr++: Distill Structural And Semantic Knowledge With Weighting Samples For Visual Place Recognition (2025)3.58
- Unipr-3d: Towards Universal Visual Place Recognition With Visual Geometry Grounded Transformer (2025)2.95
- Towards Test-time Efficient Visual Place Recognition Via Asymmetric Query Processing (2025)0.00
- Data-efficient Large Scale Place Recognition With Graded Similarity Supervision (2023)16.32
- Structvpr: Distill Structural Knowledge With Weighting Samples For Visual Place Recognition (2022)10.97