BIT: Matching-based Bi-directional Interaction Transformation Network For Visible-infrared Person Re-identification
2026 Β· Haoxuan Xu, Guanglin Niu
Abstract
Visible-Infrared Person Re-Identification (VI-ReID) is a challenging retrieval task due to the substantial modality gap between visible and infrared images. While existing methods attempt to bridge this gap by learning modality-invariant features within a shared embedding space, they often overlook the complex and implicit correlations between modalities. This limitation becomes more severe under distribution shifts, where infrared samples are often far fewer than visible ones. To address these challenges, we propose a novel network termed Bi-directional Interaction Transformation (BIT). Instead of relying on rigid feature alignment, BIT adopts a matching-based strategy that explicitly models the interaction between visible and infrared image pairs. Specifically, BIT employs an encoder-decoder architecture where the encoder extracts preliminary feature representations, and the decoder performs bi-directional feature integration and query aware scoring to enhance cross-modality correspo
Authors
(none)
Tags
Stats
Related papers
- Bridging The Gap: Multi-level Cross-modality Joint Alignment For Visible-infrared Person Re-identification (2023)11.29
- Implicit Discriminative Knowledge Learning For Visible-infrared Person Re-identification (2024)16.19
- Bridging The Distribution Gap Of Visible-infrared Person Re-identification With Modality Batch Normalization (2021)8.60
- Video-based Visible-infrared Person Re-identification With Auxiliary Samples (2023)13.49
- Dynamic Dual-attentive Aggregation Learning For Visible-infrared Person Re-identification (2020)19.67
- Unified Batch All Triplet Loss For Visible-infrared Person Re-identification (2021)9.03
- Mutual Information Guided Optimal Transport For Unsupervised Visible-infrared Person Re-identification (2024)0.00
- Mix-modality Person Re-identification: A New And Practical Paradigm (2024)6.34