Sampling Is All You Need On Modeling Long-term User Behaviors For CTR Prediction
2022 Β· Yue Cao, Xiaojiang Zhou, Jiaqi Feng, et al.
Abstract
Rich user behavior data has been proven to be of great value for Click-Through Rate (CTR) prediction applications, especially in industrial recommender, search, or advertising systems. However, it's non-trivial for real-world systems to make full use of long-term user behaviors due to the strict requirements of online serving time. Most previous works adopt the retrieval-based strategy, where a small number of user behaviors are retrieved first for subsequent attention. However, the retrieval-based methods are sub-optimal and would cause more or less information losses, and it's difficult to balance the effectiveness and efficiency of the retrieval algorithm. In this paper, we propose SDIM (Sampling-based Deep Interest Modeling), a simple yet effective sampling-based end-to-end approach for modeling long-term user behaviors. We sample from multiple hash functions to generate hash signatures of the candidate item and each item in the user behavior sequence, and obtain the user interes
Authors
(none)
Tags
Stats
Related papers
- R2LED: Equipping Retrieval And Refinement In Lifelong User Modeling With Semantic Ids For CTR Prediction (2026)0.00
- SEMINAR: Search Enhanced Multi-modal Interest Network And Approximate Retrieval For Lifelong Sequential Recommendation (2024)0.00
- Everyone's Preference Changes Differently: Weighted Multi-interest Retrieval Model (2022)0.00
- Synergizing Implicit And Explicit User Interests: A Multi-embedding Retrieval Framework At Pinterest (2025)0.00
- Taxonomy-based Negative Sampling In Personalized Semantic Search For E-commerce (2025)0.00
- Learning To Hash For Recommendation: A Survey (2024)0.00
- Crops: Improving Dense Retrieval With Cross-perspective Positive Samples In Short-video Search (2025)0.00
- Influence Guided Sampling For Domain Adaptation Of Text Retrievers (2026)0.00