Crossatnet - A Novel Cross-attention Based Framework For Sketch-based Image Retrieval
2021 Β· Ushasi Chaudhuri, Biplab Banerjee, Avik Bhattacharya, et al.
Abstract
We propose a novel framework for cross-modal zero-shot learning (ZSL) in the context of sketch-based image retrieval (SBIR). Conventionally, the SBIR schema mainly considers simultaneous mappings among the two image views and the semantic side information. Therefore, it is desirable to consider fine-grained classes mainly in the sketch domain using highly discriminative and semantically rich feature space. However, the existing deep generative modeling-based SBIR approaches majorly focus on bridging the gaps between the seen and unseen classes by generating pseudo-unseen-class samples. Besides, violating the ZSL protocol by not utilizing any unseen-class information during training, such techniques do not pay explicit attention to modeling the discriminative nature of the shared space. Also, we note that learning a unified feature space for both the multi-view visual data is a tedious task considering the significant domain difference between sketches and color images. In this respect,
Authors
(none)
Tags
Stats
Related papers
- An Efficient Framework For Zero-shot Sketch-based Image Retrieval (2021)13.65
- Relation-aware Meta-learning For Zero-shot Sketch-based Image Retrieval (2024)0.00
- A Zero-shot Framework For Sketch-based Image Retrieval (2018)16.49
- Cross-modal Subspace Learning For Fine-grained Sketch-based Image Retrieval (2017)13.34
- Semantic Adversarial Network For Zero-shot Sketch-based Image Retrieval (2019)10.74
- CLIP For All Things Zero-shot Sketch-based Image Retrieval, Fine-grained Or Not (2023)15.54
- Adapt And Align To Improve Zero-shot Sketch-based Image Retrieval (2023)0.00
- Stacked Semantic-guided Network For Zero-shot Sketch-based Image Retrieval (2019)0.00