Where To Look And How To Describe: Fashion Image Retrieval With An Attentional Heterogeneous Bilinear Network
2020 Β· Haibo Su, Peng Wang, Lingqiao Liu, et al.
Abstract
Fashion products typically feature in compositions of a variety of styles at different clothing parts. In order to distinguish images of different fashion products, we need to extract both appearance (i.e., "how to describe") and localization (i.e.,"where to look") information, and their interactions. To this end, we propose a biologically inspired framework for image-based fashion product retrieval, which mimics the hypothesized twostream visual processing system of human brain. The proposed attentional heterogeneous bilinear network (AHBN) consists of two branches: a deep CNN branch to extract fine-grained appearance attributes and a fully convolutional branch to extract landmark localization information. A joint channel-wise attention mechanism is further applied to the extracted heterogeneous features to focus on important channels, followed by a compact bilinear pooling layer to model the interaction of the two streams. Our proposed framework achieves satisfactory performance on t
Authors
(none)
Tags
Stats
Related papers
- Clothing Retrieval With Visual Attention Model (2017)12.10
- Mmfl-net: Multi-scale And Multi-granularity Feature Learning For Cross-domain Fashion Retrieval (2022)5.84
- Search By Image: Deeply Exploring Beneficial Features For Beauty Product Retrieval (2023)0.00
- Methods And Advancement Of Content-based Fashion Image Retrieval: A Review (2023)0.00
- Searching For Apparel Products From Images In The Wild (2019)0.00
- Diversity In Fashion Recommendation Using Semantic Parsing (2019)10.21
- Cross-domain Image Retrieval With Attention Modeling (2017)13.74
- Fashionmv: Product-level Composed Image Retrieval With Multi-view Fashion Data (2026)2.98