Multi-granularity Graph Pooling For Video-based Person Re-identification
2022 Β· Honghu Pan, Yongyong Chen, Zhenyu He
Abstract
The video-based person re-identification (ReID) aims to identify the given pedestrian video sequence across multiple non-overlapping cameras. To aggregate the temporal and spatial features of the video samples, the graph neural networks (GNNs) are introduced. However, existing graph-based models, like STGCN, perform the \textit\{mean\}/\textit\{max pooling\} on node features to obtain the graph representation, which neglect the graph topology and node importance. In this paper, we propose the graph pooling network (GPNet) to learn the multi-granularity graph representation for the video retrieval, where the \textit\{graph pooling layer\} is implemented to downsample the graph. We first construct a multi-granular graph, whose node features denote image embedding learned by backbone, and edges are established between the temporal and Euclidean neighborhood nodes. We then implement multiple graph convolutional layers to perform the neighborhood aggregation on the graphs. To do
Authors
(none)
Tags
Stats
Related papers
- Pose-aided Video-based Person Re-identification Via Recurrent Graph Convolutional Network (2022)10.97
- Pgganet: Pose Guided Graph Attention Network For Person Re-identification (2021)0.00
- Graph Convolution For Re-ranking In Person Re-identification (2021)8.35
- Person Re-identification With Deep Similarity-guided Graph Neural Network (2018)17.55
- Graph Convolution Based Efficient Re-ranking For Visual Retrieval (2023)9.92
- Graph Based Temporal Aggregation For Video Retrieval (2020)0.00
- Structured Deep Hashing With Convolutional Neural Networks For Fast Person Re-identification (2017)12.87
- A Pose-sensitive Embedding For Person Re-identification With Expanded Cross Neighborhood Re-ranking (2017)23.25