Fashion IQ: A New Dataset Towards Retrieving Images By Natural Language Feedback
2019 · Hui Wu, Yupeng Gao, Xiaoxiao Guo, et al.
Abstract
Conversational interfaces for the detail-oriented retail fashion domain are more natural, expressive, and user friendly than classical keyword-based search interfaces. In this paper, we introduce the Fashion IQ dataset to support and advance research on interactive fashion image retrieval. Fashion IQ is the first fashion dataset to provide human-generated captions that distinguish similar pairs of garment images together with side-information consisting of real-world product descriptions and derived visual attribute labels for these images. We provide a detailed analysis of the characteristics of the Fashion IQ data, and present a transformer-based user simulator and interactive image retriever that can seamlessly integrate visual attributes with image features, user feedback, and dialog history, leading to improved performance over the state of the art in dialog-based image retrieval. We believe that our dataset will encourage further work on developing more natural and real-world app
Authors
(none)
Tags
Stats
Related papers
- Training And Challenging Models For Text-guided Fashion Image Retrieval (2022)0.00
- Conversational Fashion Image Retrieval Via Multiturn Natural Language Feedback (2021)11.85
- Designovel's System Description For Fashion-iq Challenge 2019 (2019)0.00
- Facap: A Large-scale Fashion Dataset For Fine-grained Composed Image Retrieval (2025)0.00
- Fashionmv: Product-level Composed Image Retrieval With Multi-view Fashion Data (2026)2.98
- Modality-agnostic Attention Fusion For Visual Search With Text Feedback (2020)0.00
- Fashion-rag: Multimodal Fashion Image Editing Via Retrieval-augmented Generation (2025)4.52
- FIRE-CIR: Fine-grained Reasoning For Composed Fashion Image Retrieval (2026)0.00