Learning Segment Similarity And Alignment In Large-scale Content Based Video Retrieval
2023 Β· Chen Jiang, Kaiming Huang, Sifeng He, et al.
Abstract
With the explosive growth of web videos in recent years, large-scale Content-Based Video Retrieval (CBVR) becomes increasingly essential in video filtering, recommendation, and copyright protection. Segment-level CBVR (S-CBVR) locates the start and end time of similar segments in finer granularity, which is beneficial for user browsing efficiency and infringement detection especially in long video scenarios. The challenge of S-CBVR task is how to achieve high temporal alignment accuracy with efficient computation and low storage consumption. In this paper, we propose a Segment Similarity and Alignment Network (SSAN) in dealing with the challenge which is firstly trained end-to-end in S-CBVR. SSAN is based on two newly proposed modules in video retrieval: (1) An efficient Self-supervised Keyframe Extraction (SKE) module to reduce redundant frame features, (2) A robust Similarity Pattern Detection (SPD) module for temporal alignment. In comparison with uniform frame extraction, SKE not o
Authors
(none)
Tags
Stats
Related papers
- Sync From The Sea: Retrieving Alignable Videos From Large-scale Datasets (2024)4.52
- Semantic Video Moments Retrieval At Scale: A New Task And A Baseline (2022)0.00
- VRAG: Region Attention Graphs For Content-based Video Retrieval (2022)0.00
- T2VLAD: Global-local Sequence Alignment For Text-video Retrieval (2021)16.65
- A Lightweight Moment Retrieval System With Global Re-ranking And Robust Adaptive Bidirectional Temporal Search (2025)3.58
- Differentiable Resolution Compression And Alignment For Efficient Video Classification And Retrieval (2023)5.27
- Hanet: Hierarchical Alignment Networks For Video-text Retrieval (2021)0.00
- Video-text Retrieval By Supervised Sparse Multi-grained Learning (2023)8.03