Uniloc: Towards Universal Place Recognition Using Any Single Modality
2024 Β· Yan Xia, Zhendong Li, Yun-Jin Li, et al.
Abstract
To date, most place recognition methods focus on single-modality retrieval. While they perform well in specific environments, cross-modal methods offer greater flexibility by allowing seamless switching between map and query sources. It also promises to reduce computation requirements by having a unified model, and achieving greater sample efficiency by sharing parameters. In this work, we develop a universal solution to place recognition, UniLoc, that works with any single query modality (natural language, image, or point cloud). UniLoc leverages recent advances in large-scale contrastive learning, and learns by matching hierarchically at two levels: instance-level matching and scene-level matching. Specifically, we propose a novel Self-Attention based Pooling (SAP) module to evaluate the importance of instance descriptors when aggregated into a place-level descriptor. Experiments on the KITTI-360 dataset demonstrate the benefits of cross-modality for place recognition, achieving supe
Authors
(none)
Tags
Stats
Related papers
- Modalink: Unifying Modalities For Efficient Image-to-pointcloud Place Recognition (2024)9.02
- Crossloc3d: Aerial-ground Cross-source 3D Place Recognition (2023)9.23
- Are Local Features All You Need For Cross-domain Visual Place Recognition? (2023)13.80
- Megaloc: One Retrieval To Place Them All (2025)9.19
- Logg3d-net: Locally Guided Global Descriptor Learning For 3D Place Recognition (2021)19.02
- Boq: A Place Is Worth A Bag Of Learnable Queries (2024)16.09
- Unipr-3d: Towards Universal Visual Place Recognition With Visual Geometry Grounded Transformer (2025)2.95
- Universal Vision-language Dense Retrieval: Learning A Unified Representation Space For Multi-modal Retrieval (2022)3.45