Scaling Sparse And Dense Retrieval In Decoder-only Llms
2025 Β· Hansi Zeng, Julian Killingback, Hamed Zamani
Abstract
Scaling large language models (LLMs) has shown great potential for improving retrieval model performance; however, previous studies have mainly focused on dense retrieval trained with contrastive loss (CL), neglecting the scaling behavior of other retrieval paradigms and optimization techniques, such as sparse retrieval and knowledge distillation (KD). In this work, we conduct a systematic comparative study on how different retrieval paradigms (sparse vs. dense) and fine-tuning objectives (CL vs. KD vs. their combination) affect retrieval performance across different model scales. Using MSMARCO passages as the training dataset, decoder-only LLMs (Llama-3 series: 1B, 3B, 8B), and a fixed compute budget, we evaluate various training configurations on both in-domain (MSMARCO, TREC DL) and out-of-domain (BEIR) benchmarks. Our key findings reveal that: (1) Scaling behaviors emerge clearly only with CL, where larger models achieve significant performance gains, whereas KD-trained models show
Authors
(none)
Tags
Stats
Related papers
- Scalingnote: Scaling Up Retrievers With Large Language Models For Real-world Dense Retrieval (2024)0.00
- A Comparative Study Of Specialized Llms As Dense Retrievers (2025)2.26
- CSPLADE: Learned Sparse Retrieval With Causal Language Models (2025)0.00
- SLQ: Bridging Modalities Via Shared Latent Queries For Retrieval With Frozen Mllms (2026)0.00
- Scaling Laws For Dense Retrieval (2024)10.07
- Evaluating The Effectiveness And Scalability Of Llm-based Data Augmentation For Retrieval (2025)0.00
- Making Large Language Models Efficient Dense Retrievers (2025)0.00
- Transforming Llms Into Cross-modal And Cross-lingual Retrieval Systems (2024)4.52