Revisiting Query Variants: The Advantage Of Retrieval Over Generation Of Query Variants For Effective QPP
2025 Β· Fangzheng Tian, Debasis Ganguly, Craig MacDonald
Abstract
Leveraging query variants (QVs), i.e., queries with potentially similar information needs to the target query, has been shown to improve the effectiveness of query performance prediction (QPP) approaches. Existing QV-based QPP methods generate QVs facilitated by either query expansion or non-contextual embeddings, which may introduce topical drifts and hallucinations. In this paper, we propose a method that retrieves QVs from a training set (e.g., MS MARCO) for a given target query of QPP. To achieve a high recall in retrieving queries with the most similar information needs as the target query from a training set, we extend the directly retrieved QVs (1-hop QVs) by a second retrieval using their denoted relevant documents (which yields 2-hop QVs). Our experiments, conducted on TREC DL'19 and DL'20, show that the QPP methods with QVs retrieved by our method outperform the best-performing existing generated-QV-based QPP approaches by as much as around 20%, on neural ranking models like
Authors
(none)
Tags
Stats
Related papers
- VQPP: Video Query Performance Prediction Benchmark (2026)0.78
- An Analysis Of Variations In The Effectiveness Of Query Performance Prediction (2022)6.34
- On Coherence-based Predictors For Dense Query Performance Prediction (2023)0.00
- Enhancing Document VQA Models Via Retrieval-augmented Generation (2025)0.00
- POQD: Performance-oriented Query Decomposer For Multi-vector Retrieval (2025)2.46
- Generative Query Reformulation Using Ensemble Prompting, Document Fusion, And Relevance Feedback (2024)0.00
- Evaluating The Robustness Of Retrieval Pipelines With Query Variation Generators (2021)13.36
- Fix Before Search: Benchmarking Agentic Query Visual Pre-processing In Multimodal Retrieval-augmented Generation (2026)1.24