Attention-based Pyramid Aggregation Network For Visual Place Recognition
2018 Β· Yingying Zhu, Jiong Wang, Lingxi Xie, et al.
Abstract
Visual place recognition is challenging in the urban environment and is usually viewed as a large scale image retrieval task. The intrinsic challenges in place recognition exist that the confusing objects such as cars and trees frequently occur in the complex urban scene, and buildings with repetitive structures may cause over-counting and the burstiness problem degrading the image representations. To address these problems, we present an Attention-based Pyramid Aggregation Network (APANet), which is trained in an end-to-end manner for place recognition. One main component of APANet, the spatial pyramid pooling, can effectively encode the multi-size buildings containing geo-information. The other one, the attention block, is adopted as a region evaluator for suppressing the confusing regional features while highlighting the discriminative ones. When testing, we further propose a simple yet effective PCA power whitening strategy, which significantly improves the widely used PCA whitenin
Authors
(none)
Tags
Stats
Related papers
- Multires-netvlad: Augmenting Place Recognition Training With Low-resolution Imagery (2022)16.01
- Spatio-semantic Convnet-based Visual Place Recognition (2019)10.21
- City-scale Visual Place Recognition With Deep Local Features Based On Multi-scale Ordered VLAD Pooling (2020)1.69
- PCAN: 3D Attention Map Learning Using Contextual Information For Point Cloud Based Retrieval (2019)17.42
- Efficient 3D Point Cloud Feature Learning For Large-scale Place Recognition (2021)14.73
- Towards Implicit Aggregation: Robust Image Representation For Place Recognition In The Transformer Era (2025)3.09
- Attention-aware Age-agnostic Visual Place Recognition (2019)8.82
- A Hybrid Compact Neural Architecture For Visual Place Recognition (2019)12.99