Hyprag: Hyperbolic Dense Retrieval For Retrieval Augmented Generation
2026 Β· Hiren Madhu, Ngoc Bui, Ali Maatouk, et al.
Abstract
Embedding geometry plays a fundamental role in retrieval quality, yet dense retrievers for retrieval-augmented generation (RAG) remain largely confined to Euclidean space. However, natural language exhibits hierarchical structure from broad topics to specific entities that Euclidean embeddings fail to preserve, causing semantically distant documents to appear spuriously similar and increasing hallucination risk. To address these limitations, we introduce hyperbolic dense retrieval, developing two model variants in the Lorentz model of hyperbolic space: HyTE-FH, a fully hyperbolic transformer, and HyTE-H, a hybrid architecture projecting pre-trained Euclidean embeddings into hyperbolic space. To prevent representational collapse during sequence aggregation, we introduce the Outward Einstein Midpoint, a geometry-aware pooling operator that provably preserves hierarchical structure. On MTEB, HyTE-FH outperforms equivalent Euclidean baselines, while on RAGBench, HyTE-H achieves up to 29% g
Authors
(none)
Tags
Stats
Related papers
- Hyem: Query-adaptive Hyperbolic Retrieval For Biomedical Ontologies Via Euclidean Vector Indexing (2026)0.00
- Hetarag: Hybrid Deep Retrieval-augmented Generation Across Heterogeneous Data Stores (2025)3.27
- Advancing Retrieval-augmented Generation For Structured Enterprise And Internal Data (2025)1.20
- Optimizing Retrieval-augmented Generation: Analysis Of Hyperparameter Impact On Performance And Efficiency (2025)0.00
- Hyperbolic Hierarchical Alignment Reasoning Network For Text-3d Retrieval (2025)1.81
- Multimodal RAG For Unstructured Data:leveraging Modality-aware Knowledge Graphs With Hybrid Retrieval (2025)0.00
- Hierarchical Retrieval: The Geometry And A Pretrain-finetune Recipe (2025)0.99
- Ragdb: A Zero-dependency, Embeddable Architecture For Multimodal Retrieval-augmented Generation On The Edge (2025)0.00