LOVO: Efficient Complex Object Query In Large-scale Video Datasets
2025 Β· Yuxin Liu, Yuezhang Peng, Hefeng Zhou, et al.
Abstract
The widespread deployment of cameras has led to an exponential increase in video data, creating vast opportunities for applications such as traffic management and crime surveillance. However, querying specific objects from large-scale video datasets presents challenges, including (1) processing massive and continuously growing data volumes, (2) supporting complex query requirements, and (3) ensuring low-latency execution. Existing video analysis methods struggle with either limited adaptability to unseen object classes or suffer from high query latency. In this paper, we present LOVO, a novel system designed to efficiently handle comp\(\underline\{L\}\)ex \(\underline\{O\}\)bject queries in large-scale \(\underline\{V\}\)ide\(\underline\{O\}\) datasets. Agnostic to user queries, LOVO performs one-time feature extraction using pre-trained visual encoders, generating compact visual embeddings for key frames to build an efficient index. These visual embeddings, along with associated bound
Authors
(none)
Tags
Stats
Related papers
- Lazyvlm: Neuro-symbolic Approach To Video Analytics (2025)0.00
- Lovr: A Benchmark For Long Video Retrieval In Multimodal Contexts (2025)0.00
- Exploiting Local Indexing And Deep Feature Confidence Scores For Fast Image-to-video Search (2018)2.26
- Object-centric Framework For Video Moment Retrieval (2025)0.00
- SALOVA: Segment-augmented Long Video Assistant For Targeted Retrieval And Routing In Long-form Video Analysis (2024)0.00
- Verve: Versatile Retrieval For Videos Via Unified Embeddings (2026)0.00
- The VISIONE Video Search System: Exploiting Off-the-shelf Text Search Engines For Large-scale Video Retrieval (2020)10.74
- RAVU: Retrieval Augmented Video Understanding With Compositional Reasoning Over Graph (2025)0.00