FRAG: Toward Federated Vector Database Management For Collaborative And Secure Retrieval-augmented Generation
2024 Β· Dongfang Zhao
Abstract
This paper introduces \textit\{Federated Retrieval-Augmented Generation (FRAG)\}, a novel database management paradigm tailored for the growing needs of retrieval-augmented generation (RAG) systems, which are increasingly powered by large-language models (LLMs). FRAG enables mutually-distrusted parties to collaboratively perform Approximate \(k\)-Nearest Neighbor (ANN) searches on encrypted query vectors and encrypted data stored in distributed vector databases, all while ensuring that no party can gain any knowledge about the queries or data of others. Achieving this paradigm presents two key challenges: (i) ensuring strong security guarantees, such as Indistinguishability under Chosen-Plaintext Attack (IND-CPA), under practical assumptions (e.g., we avoid overly optimistic assumptions like non-collusion among parties); and (ii) maintaining performance overheads comparable to traditional, non-federated RAG systems. To address these challenges, FRAG employs a single-key homomorphic enc
Authors
(none)
Tags
Stats
Related papers
- Ragdb: A Zero-dependency, Embeddable Architecture For Multimodal Retrieval-augmented Generation On The Edge (2025)0.00
- Hetarag: Hybrid Deep Retrieval-augmented Generation Across Heterogeneous Data Stores (2025)3.27
- SRAG: RAG With Structured Data Improves Vector Retrieval (2026)0.00
- Cracking Vector Search Indexes (2025)3.58
- HASH-RAG: Bridging Deep Hashing With Retriever For Efficient, Fine Retrieval And Augmented Generation (2025)0.00
- Self-aware Vector Embeddings For Retrieval-augmented Generation: A Neuroscience-inspired Framework For Temporal, Confidence-weighted, And Relational Knowledge (2026)0.00
- Advancing Retrieval-augmented Generation For Structured Enterprise And Internal Data (2025)1.20
- Don't Forget Private Retrieval: Distributed Private Similarity Search For Large Language Models (2023)5.24