DSSL: Deep Surroundings-person Separation Learning For Text-based Person Retrieval
2021 Β· Aichun Zhu, Zijie Wang, Yifeng Li, et al.
Abstract
Many previous methods on text-based person retrieval tasks are devoted to learning a latent common space mapping, with the purpose of extracting modality-invariant features from both visual and textual modality. Nevertheless, due to the complexity of high-dimensional data, the unconstrained mapping paradigms are not able to properly catch discriminative clues about the corresponding person while drop the misaligned information. Intuitively, the information contained in visual data can be divided into person information (PI) and surroundings information (SI), which are mutually exclusive from each other. To this end, we propose a novel Deep Surroundings-person Separation Learning (DSSL) model in this paper to effectively extract and match person information, and hence achieve a superior retrieval accuracy. A surroundings-person separation and fusion mechanism plays the key role to realize an accurate and effective surroundings-person separation under a mutually exclusion constraint. In
Authors
(none)
Tags
Stats
Related papers
- Deep-person: Learning Discriminative Deep Features For Person Re-identification (2017)16.90
- Dynamic Uncertainty Learning With Noisy Correspondence For Text-based Person Search (2025)7.50
- Sa-person: Text-based Person Retrieval With Scene-aware Re-ranking (2025)0.00
- See Finer, See More: Implicit Modality Alignment For Text-based Person Retrieval (2022)18.39
- Improving Text-based Person Search Via Part-level Cross-modal Correspondence (2024)0.00
- Cross-modal Implicit Relation Reasoning And Aligning For Text-to-image Person Retrieval (2023)18.15
- Decoupled Cross-modal Alignment Network For Text-rgbt Person Retrieval And A High-quality Benchmark (2025)0.00
- Contrastive Transformer Learning With Proximity Data Generation For Text-based Person Search (2023)11.88