HASH-RAG: Bridging Deep Hashing With Retriever For Efficient, Fine Retrieval And Augmented Generation
2025 Β· Jinyu Guo, Xunlei Chen, Qiyang Xia, et al.
Abstract
Retrieval-Augmented Generation (RAG) encounters efficiency challenges when scaling to massive knowledge bases while preserving contextual relevance. We propose Hash-RAG, a framework that integrates deep hashing techniques with systematic optimizations to address these limitations. Our queries directly learn binary hash codes from knowledgebase code, eliminating intermediate feature extraction steps, and significantly reducing storage and computational overhead. Building upon this hash-based efficient retrieval framework, we establish the foundation for fine-grained chunking. Consequently, we design a Prompt-Guided Chunk-to-Context (PGCC) module that leverages retrieved hash-indexed propositions and their original document segments through prompt engineering to enhance the LLM's contextual awareness. Experimental evaluations on NQ, TriviaQA, and HotpotQA datasets demonstrate that our approach achieves a 90% reduction in retrieval time compared to conventional methods while maintaining c
Authors
(none)
Tags
Stats
Related papers
- Hetarag: Hybrid Deep Retrieval-augmented Generation Across Heterogeneous Data Stores (2025)3.27
- Ragdb: A Zero-dependency, Embeddable Architecture For Multimodal Retrieval-augmented Generation On The Edge (2025)0.00
- SRAG: RAG With Structured Data Improves Vector Retrieval (2026)0.00
- Frustratingly Simple Retrieval Improves Challenging, Reasoning-intensive Benchmarks (2025)0.00
- Dota-rag: Dynamic Of Thought Aggregation RAG (2025)0.00
- Hiperrag: High-performance Retrieval Augmented Generation For Scientific Insights (2025)6.34
- RAG Without Forgetting: Continual Query-infused Key Memory (2026)0.00
- Slimrag: Retrieval Without Graphs Via Entity-aware Context Selection (2025)1.91