Treehop: Generate And Filter Next Query Embeddings Efficiently For Multi-hop Question Answering
2025 Β· Zhonghao Li, Kunpeng Zhang, Jinghuai Ou, et al.
Abstract
Retrieval-augmented generation (RAG) systems face significant challenges in multi-hop question answering (MHQA), where complex queries require synthesizing information across multiple document chunks. Existing approaches typically rely on iterative LLM-based query rewriting and routing, resulting in high computational costs due to repeated LLM invocations and multi-stage processes. To address these limitations, we propose TreeHop, an embedding-level framework without the need for LLMs in query refinement. TreeHop dynamically updates query embeddings by fusing semantic information from prior queries and retrieved documents, enabling iterative retrieval through embedding-space operations alone. This method replaces the traditional "Retrieve-Rewrite-Vectorize-Retrieve" cycle with a streamlined "Retrieve-Embed-Retrieve" loop, significantly reducing computational overhead. Moreover, a rule-based stop criterion is introduced to further prune redundant retrievals, balancing efficiency and rec
Authors
(none)
Tags
Stats
Related papers
- Grithopper: Decomposition-free Multi-hop Dense Retrieval (2025)0.00
- Multimodal RAG For Unstructured Data:leveraging Modality-aware Knowledge Graphs With Hybrid Retrieval (2025)0.00
- Hetarag: Hybrid Deep Retrieval-augmented Generation Across Heterogeneous Data Stores (2025)3.27
- HASH-RAG: Bridging Deep Hashing With Retriever For Efficient, Fine Retrieval And Augmented Generation (2025)0.00
- Hyem: Query-adaptive Hyperbolic Retrieval For Biomedical Ontologies Via Euclidean Vector Indexing (2026)0.00
- Hyprag: Hyperbolic Dense Retrieval For Retrieval Augmented Generation (2026)0.00
- A Systematic Study Of Retrieval Pipeline Design For Retrieval-augmented Medical Question Answering (2026)0.00
- Resolving The Robustness-precision Trade-off In Financial RAG Through Hybrid Document-routed Retrieval (2026)0.00