Text-based Person Search With Limited Data
2021 Β· Xiao Han, Sen He, Li Zhang, et al.
Abstract
Text-based person search (TBPS) aims at retrieving a target person from an image gallery with a descriptive text query. Solving such a fine-grained cross-modal retrieval task is challenging, which is further hampered by the lack of large-scale datasets. In this paper, we present a framework with two novel components to handle the problems brought by limited data. Firstly, to fully utilize the existing small-scale benchmarking datasets for more discriminative feature learning, we introduce a cross-modal momentum contrastive learning framework to enrich the training data for a given mini-batch. Secondly, we propose to transfer knowledge learned from existing coarse-grained large-scale datasets containing image-text pairs from drastically different problem domains to compensate for the lack of TBPS training data. A transfer learning method is designed so that useful information can be transferred despite the large domain gap. Armed with these components, our method achieves new state of t
Authors
(none)
Tags
Stats
Related papers
- Semi-supervised Text-based Person Search (2024)3.58
- Contrastive Transformer Learning With Proximity Data Generation For Text-based Person Search (2023)11.88
- Boosting Weak Positives For Text Based Person Search (2025)0.00
- TIPCB: A Simple But Effective Part-based Convolutional Baseline For Text-based Person Search (2021)20.24
- Enhancing Visual Representation For Text-based Person Searching (2024)1.69
- Improving Text-based Person Search Via Part-level Cross-modal Correspondence (2024)0.00
- Beat: Bi-directional One-to-many Embedding Alignment For Text-based Person Retrieval (2024)10.85
- Up-person: Unified Parameter-efficient Transfer Learning For Text-based Person Retrieval (2025)4.26