MQRLD: A Multimodal Data Retrieval Platform With Query-aware Feature Representation And Learned Index Based On Data Lake
2024 Β· Ming Sheng, Shuliang Wang, Yong Zhang, et al.
Abstract
Multimodal data has become a crucial element in the realm of big data analytics, driving advancements in data exploration, data mining, and empowering artificial intelligence applications. To support high-quality retrieval for these cutting-edge applications, a robust multimodal data retrieval platform should meet the challenges of transparent data storage, rich hybrid queries, effective feature representation, and high query efficiency. However, among the existing platforms, traditional schema-on-write systems, multi-model databases, vector databases, and data lakes, which are the primary options for multimodal data retrieval, make it difficult to fulfill these challenges simultaneously. Therefore, there is an urgent need to develop a more versatile multimodal data retrieval platform to address these issues. In this paper, we introduce a Multimodal Data Retrieval Platform with Query-aware Feature Representation and Learned Index based on Data Lake (MQRLD). It leverages the transparent
Authors
(none)
Tags
Stats
Related papers
- An Interactive Multi-modal Query Answering System With Retrieval-augmented Large Language Models (2024)5.84
- SLQ: Bridging Modalities Via Shared Latent Queries For Retrieval With Frozen Mllms (2026)0.00
- Mm-embed: Universal Multimodal Retrieval With Multimodal Llms (2024)0.00
- RETLLM: Training And Data-free Mllms For Multimodal Information Retrieval (2026)1.57
- Multimodal Neural Databases (2023)10.74
- IDMR: Towards Instance-driven Precise Visual Correspondence In Multimodal Retrieval (2025)2.29
- GME: Improving Universal Multimodal Retrieval By Multimodal Llms (2024)0.00
- Indexing Multimodal Language Models For Large-scale Image Retrieval (2026)0.00