Disentangled Modeling Of Domain And Relevance For Adaptable Dense Retrieval
2022 Β· Jingtao Zhan, Qingyao Ai, Yiqun Liu, et al.
Abstract
Recent advance in Dense Retrieval (DR) techniques has significantly improved the effectiveness of first-stage retrieval. Trained with large-scale supervised data, DR models can encode queries and documents into a low-dimensional dense space and conduct effective semantic matching. However, previous studies have shown that the effectiveness of DR models would drop by a large margin when the trained DR models are adopted in a target domain that is different from the domain of the labeled data. One of the possible reasons is that the DR model has never seen the target corpus and thus might be incapable of mitigating the difference between the training and target domains. In practice, unfortunately, training a DR model for each target domain to avoid domain shift is often a difficult task as it requires additional time, storage, and domain-specific data labeling, which are not always available. To address this problem, in this paper, we propose a novel DR framework named Disentangled Dense
Authors
(none)
Tags
Stats
Related papers
- Towards Dynamic Dense Retrieval With Routing Strategy (2026)0.00
- How To Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval (2023)11.39
- Interpreting Dense Retrieval As Mixture Of Topics (2021)0.00
- Learning To Retrieve: How To Train A Dense Retrieval Model Effectively And Efficiently (2020)0.00
- Dense Retrieval Adaptation Using Target Domain Description (2023)7.50
- Domain Adaptation For Dense Retrieval Through Self-supervision By Pseudo-relevance Labeling (2022)0.00
- Are We There Yet? A Decision Framework For Replacing Term Based Retrieval With Dense Retrieval Systems (2022)0.00
- Few-shot Conversational Dense Retrieval (2021)16.68