Improving Video Corpus Moment Retrieval With Partial Relevance Enhancement
2024 Β· Danyang Hou, Liang Pang, Huawei Shen, et al.
Abstract
Video Corpus Moment Retrieval (VCMR) is a new video retrieval task aimed at retrieving a relevant moment from a large corpus of untrimmed videos using a text query. The relevance between the video and query is partial, mainly evident in two aspects:~(1)~Scope: The untrimmed video contains many frames, but not all are relevant to the query. Strong relevance is typically observed only within the relevant moment.~(2)~Modality: The relevance of the query varies with different modalities. Action descriptions align more with visual elements, while character conversations are more related to textual information.Existing methods often treat all video contents equally, leading to sub-optimal moment retrieval. We argue that effectively capturing the partial relevance between the query and video is essential for the VCMR task. To this end, we propose a Partial Relevance Enhanced Model~(PREM) to improve VCMR. VCMR involves two sub-tasks: video retrieval and moment localization. To align with their
Authors
(none)
Tags
Stats
Related papers
- Video Corpus Moment Retrieval With Contrastive Learning (2021)14.35
- When One Moment Isn't Enough: Multi-moment Retrieval With Cross-moment Interactions (2025)1.81
- PRVR: Partially Relevant Video Retrieval (2022)2.26
- Semantic Video Moments Retrieval At Scale: A New Task And A Baseline (2022)0.00
- Viseret: A Simple Yet Effective Approach To Moment Retrieval Via Fine-grained Video Segmentation (2021)0.00
- Uneven Event Modeling For Partially Relevant Video Retrieval (2025)1.40
- Frame-wise Cross-modal Matching For Video Moment Retrieval (2020)13.17
- Towards Efficient And Robust Moment Retrieval System: A Unified Framework For Multi-granularity Models And Temporal Reranking (2025)2.26