CHASE: A Native Relational Database For Hybrid Queries On Structured And Unstructured Data
2025 Β· Rui Ma, Kai Zhang, Zhenying He, et al.
Abstract
Querying both structured and unstructured data has become a new paradigm in data analytics and recommendation. With unstructured data, such as text and videos, are converted to high-dimensional vectors and queried with approximate nearest neighbor search (ANNS). State-of-the-art database systems implement vector search as a plugin in the relational query engine, which tries to utilize the ANN index to enhance performance. After investigating a broad range of hybrid queries, we find that such designs may miss potential optimization opportunities and achieve suboptimal performance for certain queries. In this paper, we propose CHASE, a query engine that is natively designed to support efficient hybrid queries on structured and unstructured data. CHASE performs specific designs and optimizations on multiple stages in query processing. First, semantic analysis is performed to categorize queries and optimize query plans dynamically. Second, new physical operators are implemented to avoid re
Authors
(none)
Tags
Stats
Related papers
- HQANN: Efficient And Robust Similarity Search For Hybrid Queries With Structured And Unstructured Constraints (2022)9.76
- Navigable Proximity Graph-driven Native Hybrid Queries With Structured And Unstructured Constraints (2022)0.00
- Frequency-aware Graph Construction And Search For Dynamic Vector Databases (2025)0.00
- ACORN: Performant And Predicate-agnostic Search Over Vector Embeddings And Structured Data (2024)11.76
- DEG: Efficient Hybrid Vector Search Using The Dynamic Edge Navigation Graph (2025)6.34
- DGAI: Decoupled On-disk Graph-based ANN Index For Efficient Updates And Queries (2025)0.00
- All-in-one Graph-based Indexing For Hybrid Search On Gpus (2025)0.00
- A Reference Architecture For Agentic Hybrid Retrieval In Dataset Search (2026)0.00