RADAR: Recall Augmentation Through Deferred Asynchronous Retrieval
2025 Β· Amit Jaspal, Qian Dang, Ajantha Ramineni
Abstract
Modern large-scale recommender systems employ multi-stage ranking funnel (Retrieval, Pre-ranking, Ranking) to balance engagement and computational constraints (latency, CPU). However, the initial retrieval stage, often relying on efficient but less precise methods like K-Nearest Neighbors (KNN), struggles to effectively surface the most engaging items from billion-scale catalogs, particularly distinguishing highly relevant and engaging candidates from merely relevant ones. We introduce Recall Augmentation through Deferred Asynchronous Retrieval (RADAR), a novel framework that leverages asynchronous, offline computation to pre-rank a significantly larger candidate set for users using the full complexity ranking model. These top-ranked items are stored and utilized as a high-quality retrieval source during online inference, bypassing online retrieval and pre-ranking stages for these candidates. We demonstrate through offline experiments that RADAR significantly boosts recall (2X Recall@2
Authors
(none)
Tags
Stats
Related papers
- Deep Retrieval: Learning A Retrievable Structure For Large-scale Recommendations (2020)0.00
- Raserec: Retrieval-augmented Sequential Recommendation (2024)2.92
- RAD-DPO: Robust Adaptive Denoising Direct Preference Optimization For Generative Retrieval In E-commerce (2026)0.00
- Optimizing Retrieval For RAG Via Reinforcement Learning (2025)0.00
- Grank: Towards Target-aware And Streamlined Industrial Retrieval With A Generate-rank Framework (2025)0.00
- Hierarchical Structured Neural Network: Efficient Retrieval Scaling For Large Scale Recommendation (2024)0.00
- Domain-adaptive And Scalable Dense Retrieval For Content-based Recommendation (2026)0.00
- Frustratingly Simple Retrieval Improves Challenging, Reasoning-intensive Benchmarks (2025)0.00