Rear: Retrieve, Expand And Refine For Effective Multitable Retrieval
2025 Β· Rishita Agarwal, Himanshu Singhal, Peter Baile Chen, et al.
Abstract
Answering natural language queries over relational data often requires retrieving and reasoning over multiple tables, yet most retrievers optimize only for query-table relevance and ignore table table compatibility. We introduce REAR (Retrieve, Expand and Refine), a three-stage, LLM-free framework that separates semantic relevance from structural joinability for efficient, high-fidelity multi-table retrieval. REAR (i) retrieves query-aligned tables, (ii) expands these with structurally joinable tables via fast, precomputed column-embedding comparisons, and (iii) refines them by pruning noisy or weakly related candidates. Empirically, REAR is retriever-agnostic and consistently improves dense/sparse retrievers on complex table QA datasets (BIRD, MMQA, and Spider) by improving both multi-table retrieval quality and downstream SQL execution. Despite being LLM-free, it delivers performance competitive with state-of-the-art LLM-augmented retrieval systems (e.g.,ARM) while achieving much low
Authors
(none)
Tags
Stats
Related papers
- Expandr: Teaching Dense Retrievers Beyond Queries With LLM Guidance (2025)3.25
- Your Dense Retriever Is Secretly An Expeditious Reasoner (2025)0.00
- A Reference Architecture For Agentic Hybrid Retrieval In Dataset Search (2026)0.00
- Mor: Better Handling Diverse Queries With A Mixture Of Sparse, Dense, And Human Retrievers (2025)2.26
- Lightretriever: A Llm-based Text Retrieval Architecture With Extremely Faster Query Inference (2025)0.00
- Rebol: Retrieval Via Bayesian Optimization With Batched LLM Relevance Observations And Query Reformulation (2026)0.00
- MARVEL: Multimodal Adaptive Reasoning-intensive Expand-rerank And Retrieval (2026)0.00
- An Interactive Multi-modal Query Answering System With Retrieval-augmented Large Language Models (2024)5.84