MST-R: Multi-stage Tuning For Retrieval Systems And Metric Evaluation
2024 Β· Yash Malviya, Karan Dhingra, Maneesh Singh
Abstract
Regulatory documents are rich in nuanced terminology and specialized semantics. FRAG systems: Frozen retrieval-augmented generators utilizing pre-trained (or, frozen) components face consequent challenges with both retriever and answering performance. We present a system that adapts the retriever performance to the target domain using a multi-stage tuning (MST) strategy. Our retrieval approach, called MST-R (a) first fine-tunes encoders used in vector stores using hard negative mining, (b) then uses a hybrid retriever, combining sparse and dense retrievers using reciprocal rank fusion, and then (c) adapts the cross-attention encoder by fine-tuning only the top-k retrieved results. We benchmark the system performance on the dataset released for the RIRAG challenge (as part of the RegNLP workshop at COLING 2025). We achieve significant performance gains obtaining a top rank on the RegNLP challenge leaderboard. We also show that a trivial answering approach games the RePASs metric outscor
Authors
(none)
Tags
Stats
Related papers
- DS@GT At TREC TOT 2025: Bridging Vague Recollection With Fusion Retrieval And Learned Reranking (2026)0.00
- RAG Playground: A Framework For Systematic Evaluation Of Retrieval Strategies And Prompt Engineering In RAG Systems (2024)0.00
- Optimizing Retrieval For RAG Via Reinforcement Learning (2025)0.00
- Frustratingly Simple Retrieval Improves Challenging, Reasoning-intensive Benchmarks (2025)0.00
- REAL-MM-RAG: A Real-world Multi-modal Retrieval Benchmark (2025)4.52
- Mor: Better Handling Diverse Queries With A Mixture Of Sparse, Dense, And Human Retrievers (2025)2.26
- DAT: Dynamic Alpha Tuning For Hybrid Retrieval In Retrieval-augmented Generation (2025)0.00
- Towards Robust Ranker For Text Retrieval (2022)5.84