Cross-modal Subspace Learning For Fine-grained Sketch-based Image Retrieval
2017 Β· Peng Xu, Qiyue Yin, Yongye Huang, et al.
Abstract
Sketch-based image retrieval (SBIR) is challenging due to the inherent domain-gap between sketch and photo. Compared with pixel-perfect depictions of photos, sketches are iconic renderings of the real world with highly abstract. Therefore, matching sketch and photo directly using low-level visual clues are unsufficient, since a common low-level subspace that traverses semantically across the two modalities is non-trivial to establish. Most existing SBIR studies do not directly tackle this cross-modal problem. This naturally motivates us to explore the effectiveness of cross-modal retrieval methods in SBIR, which have been applied in the image-text matching successfully. In this paper, we introduce and compare a series of state-of-the-art cross-modal subspace learning methods and benchmark them on two recently released fine-grained SBIR datasets. Through thorough examination of the experimental results, we have demonstrated that the subspace learning can effectively model the sketch-pho
Authors
(none)
Tags
Stats
Related papers
- Crossatnet - A Novel Cross-attention Based Framework For Sketch-based Image Retrieval (2021)11.29
- Relation-aware Meta-learning For Zero-shot Sketch-based Image Retrieval (2024)0.00
- Towards Unsupervised Sketch-based Image Retrieval (2021)0.00
- A Zero-shot Framework For Sketch-based Image Retrieval (2018)16.49
- Cross-modal Hierarchical Modelling For Fine-grained Sketch Based Image Retrieval (2020)6.77
- Sketch Less For More: On-the-fly Fine-grained Sketch Based Image Retrieval (2020)15.28
- Stylemeup: Towards Style-agnostic Sketch-based Image Retrieval (2021)14.69
- Modality-aware Representation Learning For Zero-shot Sketch-based Image Retrieval (2024)8.60