Hiperrag: High-performance Retrieval Augmented Generation For Scientific Insights
2025 Β· Ozan Gokdemir, Carlo Siebenschuh, Alexander Brace, et al.
Abstract
The volume of scientific literature is growing exponentially, leading to underutilized discoveries, duplicated efforts, and limited cross-disciplinary collaboration. Retrieval Augmented Generation (RAG) offers a way to assist scientists by improving the factuality of Large Language Models (LLMs) in processing this influx of information. However, scaling RAG to handle millions of articles introduces significant challenges, including the high computational costs associated with parsing documents and embedding scientific knowledge, as well as the algorithmic complexity of aligning these representations with the nuanced semantics of scientific content. To address these issues, we introduce HiPerRAG, a RAG workflow powered by high performance computing (HPC) to index and retrieve knowledge from more than 3.6 million scientific articles. At its core are Oreo, a high-throughput model for multimodal document parsing, and ColTrast, a query-aware encoder fine-tuning algorithm that enhances retri
Authors
(none)
Tags
Stats
Related papers
- Hetarag: Hybrid Deep Retrieval-augmented Generation Across Heterogeneous Data Stores (2025)3.27
- HASH-RAG: Bridging Deep Hashing With Retriever For Efficient, Fine Retrieval And Augmented Generation (2025)0.00
- Ragdb: A Zero-dependency, Embeddable Architecture For Multimodal Retrieval-augmented Generation On The Edge (2025)0.00
- Graph-based Retriever Captures The Long Tail Of Biomedical Knowledge (2024)0.00
- Graph-aware Late Chunking For Retrieval-augmented Generation In Biomedical Literature (2026)0.00
- Advancing Retrieval-augmented Generation For Structured Enterprise And Internal Data (2025)1.20
- Erarag: Efficient And Incremental Retrieval Augmented Generation For Growing Corpora (2025)4.51
- Ragperf: An End-to-end Benchmarking Framework For Retrieval-augmented Generation Systems (2026)0.00