CSPLADE: Learned Sparse Retrieval With Causal Language Models
2025 Β· Zhichao Xu, Aosong Feng, Yijun Tian, et al.
Abstract
In recent years, dense retrieval has been the focus of information retrieval (IR) research. While effective, dense retrieval produces uninterpretable dense vectors, and suffers from the drawback of large index size. Learned sparse retrieval (LSR) has emerged as promising alternative, achieving competitive retrieval performance while also being able to leverage the classical inverted index data structure for efficient retrieval. However, limited works have explored scaling LSR beyond BERT scale. In this work, we identify two challenges in training large language models (LLM) for LSR: (1) training instability during the early stage of contrastive training; (2) suboptimal performance due to pre-trained LLM's unidirectional attention. To address these challenges, we propose two corresponding techniques: (1) a lightweight adaptation training phase to eliminate training instability; (2) two model variants to enable bidirectional information. With these techniques, we are able to train LSR mo
Authors
(none)
Tags
Stats
Related papers
- Scaling Sparse And Dense Retrieval In Decoder-only Llms (2025)6.34
- SLQ: Bridging Modalities Via Shared Latent Queries For Retrieval With Frozen Mllms (2026)0.00
- Learning Retrieval Models With Sparse Autoencoders (2026)0.00
- Sparse And Dense Retrievers Learn Better Together: Joint Sparse-dense Optimization For Text-image Retrieval (2025)0.00
- Mistral-splade: Llms For Better Learned Sparse Retrieval (2024)0.00
- To Case Or Not To Case: An Empirical Study In Learned Sparse Retrieval (2026)1.57
- Scalingnote: Scaling Up Retrievers With Large Language Models For Real-world Dense Retrieval (2024)0.00
- On The Challenges And Opportunities Of Learned Sparse Retrieval For Code (2026)0.00