Sca-pvnet: Self-and-cross Attention Based Aggregation Of Point Cloud And Multi-view For 3D Object Retrieval
2023 Β· Dongyun Lin, Yi Cheng, Aiyuan Guo, et al.
Abstract
To address 3D object retrieval, substantial efforts have been made to generate highly discriminative descriptors of 3D objects represented by a single modality, e.g., voxels, point clouds or multi-view images. It is promising to leverage the complementary information from multi-modality representations of 3D objects to further improve retrieval performance. However, multi-modality 3D object retrieval is rarely developed and analyzed on large-scale datasets. In this paper, we propose self-and-cross attention based aggregation of point cloud and multi-view images (SCA-PVNet) for 3D object retrieval. With deep features extracted from point clouds and multi-view images, we design two types of feature aggregation modules, namely the In-Modality Aggregation Module (IMAM) and the Cross-Modality Aggregation Module (CMAM), for effective feature fusion. IMAM leverages a self-attention mechanism to aggregate multi-view features while CMAM exploits a cross-attention mechanism to interact point clo
Authors
(none)
Tags
Stats
Related papers
- Multiple Discrimination And Pairwise CNN For View-based 3D Object Retrieval (2020)14.27
- PCAN: 3D Attention Map Learning Using Contextual Information For Point Cloud Based Retrieval (2019)17.42
- PREMA: Part-based Recurrent Multi-view Aggregation Network For 3D Shape Retrieval (2021)3.58
- SCA3D: Enhancing Cross-modal 3D Retrieval Via 3D Shape And Caption Paired Data Augmentation (2025)4.17
- Enhanced Cross-modal 3D Retrieval Via Tri-modal Reconstruction (2025)0.00
- View N-gram Network For 3D Object Retrieval (2019)13.05
- COM3D: Leveraging Cross-view Correspondence And Cross-modal Mining For 3D Retrieval (2024)3.58
- Pointnetvlad: Deep Point Cloud Based Retrieval For Large-scale Place Recognition (2018)25.45