Ufinebench: Towards Text-based Person Retrieval With Ultra-fine Granularity
2023 Β· Jialong Zuo, Hanyu Zhou, Ying Nie, et al.
Abstract
Existing text-based person retrieval datasets often have relatively coarse-grained text annotations. This hinders the model to comprehend the fine-grained semantics of query texts in real scenarios. To address this problem, we contribute a new benchmark named \textbf\{UFineBench\} for text-based person retrieval with ultra-fine granularity. Firstly, we construct a new \textbf\{dataset\} named UFine6926. We collect a large number of person images and manually annotate each image with two detailed textual descriptions, averaging 80.8 words each. The average word count is three to four times that of the previous datasets. In addition of standard in-domain evaluation, we also propose a special \textbf\{evaluation paradigm\} more representative of real scenarios. It contains a new evaluation set with cross domains, cross textual granularity and cross textual styles, named UFine3C, and a new evaluation metric for accurately measuring retrieval ability, named mean Similarity Distribution (m
Authors
(none)
Tags
Stats
Related papers
- Rethinking Benchmarks For Cross-modal Image-text Retrieval (2023)13.11
- Sa-person: Text-based Person Retrieval With Scene-aware Re-ranking (2025)0.00
- Text-based Person Search With Limited Data (2021)15.69
- Automatic Synthetic Data And Fine-grained Adaptive Feature Alignment For Composed Person Retrieval (2023)3.75
- TVPR: Text-to-video Person Retrieval And A New Benchmark (2023)2.26
- Multi-path Exploration And Feedback Adjustment For Text-to-image Person Retrieval (2024)0.00
- Improving Text-based Person Search Via Part-level Cross-modal Correspondence (2024)0.00
- Decoupled Cross-modal Alignment Network For Text-rgbt Person Retrieval And A High-quality Benchmark (2025)0.00