Adversarial Sampling And Training For Semi-supervised Information Retrieval
2018 Β· Dae Hoon Park, Yi Chang
Abstract
Ad-hoc retrieval models with implicit feedback often have problems, e.g., the imbalanced classes in the data set. Too few clicked documents may hurt generalization ability of the models, whereas too many non-clicked documents may harm effectiveness of the models and efficiency of training. In addition, recent neural network-based models are vulnerable to adversarial examples due to the linear nature in them. To solve the problems at the same time, we propose an adversarial sampling and training framework to learn ad-hoc retrieval models with implicit feedback. Our key idea is (i) to augment clicked examples by adversarial training for better generalization and (ii) to obtain very informational non-clicked examples by adversarial sampling and training. Experiments are performed on benchmark data sets for common ad-hoc retrieval tasks such as Web search, item recommendation, and question answering. Experimental results indicate that the proposed approaches significantly outperform strong
Authors
(none)
Tags
Stats
Related papers
- Noisy Self-training With Synthetic Queries For Dense Retrieval (2023)0.00
- Learning More From Less: Towards Strengthening Weak Supervision For Ad-hoc Retrieval (2019)5.84
- Domain Adaptation For Dense Retrieval Through Self-supervision By Pseudo-relevance Labeling (2022)0.00
- Optimizing Dense Retrieval Model Training With Hard Negatives (2021)16.34
- A Review Of Image Retrieval Techniques: Data Augmentation And Adversarial Learning Approaches (2024)0.00
- ESANS: Effective And Semantic-aware Negative Sampling For Large-scale Retrieval Systems (2025)2.26
- Pre-training For Ad-hoc Retrieval: Hyperlink Is Also You Need (2021)10.35
- Unite: Uncertainty-based Iterative Document Sampling For Domain Adaptation In Information Retrieval (2026)0.00