Lookbench: A Live And Holistic Open Benchmark For Fashion Image Retrieval

Abstract

In this paper, we present LookBench (We use the term "look" to reflect retrieval that mirrors how people shop -- finding the exact item, a close substitute, or a visually consistent alternative.), a live, holistic and challenging benchmark for fashion image retrieval in real e-commerce settings. LookBench includes both recent product images sourced from live websites and AI-generated fashion images, reflecting contemporary trends and use cases. Each test sample is time-stamped and we intend to update the benchmark periodically, enabling contamination-aware evaluation aligned with declared training cutoffs. Grounded in our fine-grained attribute taxonomy, LookBench covers single-item and outfit-level retrieval across. Our experiments reveal that LookBench poses a significant challenge on strong baselines, with many models achieving below \(60%\) Recall@1. Our proprietary model achieves the best performance on LookBench, and we release an open-source counterpart that ranks second, with b

Lookbench: A Live And Holistic Open Benchmark For Fashion Image Retrieval

Abstract

Authors

Tags

Stats

Related papers