Optimizing Retrieval-augmented Generation: Analysis Of Hyperparameter Impact On Performance And Efficiency
2025 Β· Adel Ammar, Anis Koubaa, Omer Nacar, et al.
Abstract
Large language models achieve high task performance yet often hallucinate or rely on outdated knowledge. Retrieval-augmented generation (RAG) addresses these gaps by coupling generation with external search. We analyse how hyperparameters influence speed and quality in RAG systems, covering Chroma and Faiss vector stores, chunking policies, cross-encoder re-ranking, and temperature, and we evaluate six metrics: faithfulness, answer correctness, answer relevancy, context precision, context recall, and answer similarity. Chroma processes queries 13% faster, whereas Faiss yields higher retrieval precision, revealing a clear speed-accuracy trade-off. Naive fixed-length chunking with small windows and minimal overlap outperforms semantic segmentation while remaining the quickest option. Re-ranking provides modest gains in retrieval quality yet increases runtime by roughly a factor of 5, so its usefulness depends on latency constraints. These results help practitioners balance computational
Authors
(none)
Tags
Stats
Related papers
- Rag-check: Evaluating Multimodal Retrieval Augmented Generation Performance (2025)0.00
- Re-ranking The Context For Multimodal Retrieval Augmented Generation (2025)0.00
- Hetarag: Hybrid Deep Retrieval-augmented Generation Across Heterogeneous Data Stores (2025)3.27
- Funnelrag: A Coarse-to-fine Progressive Retrieval Paradigm For RAG (2024)3.58
- Ragsmith: A Framework For Finding The Optimal Composition Of Retrieval-augmented Generation Methods Across Datasets (2025)0.00
- Neurosymbolic Retrievers For Retrieval-augmented Generation (2026)0.00
- Optimizing Retrieval For RAG Via Reinforcement Learning (2025)0.00
- Erarag: Efficient And Incremental Retrieval Augmented Generation For Growing Corpora (2025)4.51