Semantic Video Moments Retrieval At Scale: A New Task And A Baseline
2022 Β· Na Li
Abstract
Motivated by the increasing need of saving search effort by obtaining relevant video clips instead of whole videos, we propose a new task, named Semantic Video Moments Retrieval at scale (SVMR), which aims at finding relevant videos coupled with re-localizing the video clips in them. Instead of a simple combination of video retrieval and video re-localization, our task is more challenging because of several essential aspects. In the 1st stage, our SVMR should take into account the fact that: 1) a positive candidate long video can contain plenty of irrelevant clips which are also semantically meaningful. 2) a long video can be positive to two totally different query clips if it contains clips relevant to two queries. The 2nd re-localization stage also exhibits different assumptions from existing video re-localization tasks, which hold an assumption that the reference video must contain semantically similar segments corresponding to the query clip. Instead, in our scenario, the retrieved
Authors
(none)
Tags
Stats
Related papers
- Improving Video Corpus Moment Retrieval With Partial Relevance Enhancement (2024)7.89
- Frame-wise Cross-modal Matching For Video Moment Retrieval (2020)13.17
- Towards Balanced Alignment: Modal-enhanced Semantic Modeling For Video Moment Retrieval (2023)14.33
- Momentseeker: A Task-oriented Benchmark For Long-video Moment Retrieval (2025)0.00
- On Semantic Similarity In Video Retrieval (2021)12.81
- Hybrid-learning Video Moment Retrieval Across Multi-domain Labels (2024)0.00
- Towards Efficient And Robust Moment Retrieval System: A Unified Framework For Multi-granularity Models And Temporal Reranking (2025)2.26
- Viseret: A Simple Yet Effective Approach To Moment Retrieval Via Fine-grained Video Segmentation (2021)0.00