MDDL: A Framework For Reinforcement Learning-based Position Allocation In Multi-channel Feed
2023 Β· Xiaowen Shi, Ze Wang, Yuanying Cai, et al.
Abstract
Nowadays, the mainstream approach in position allocation system is to utilize a reinforcement learning model to allocate appropriate locations for items in various channels and then mix them into the feed. There are two types of data employed to train reinforcement learning (RL) model for position allocation, named strategy data and random data. Strategy data is collected from the current online model, it suffers from an imbalanced distribution of state-action pairs, resulting in severe overestimation problems during training. On the other hand, random data offers a more uniform distribution of state-action pairs, but is challenging to obtain in industrial scenarios as it could negatively impact platform revenue and user experience due to random exploration. As the two types of data have different distributions, designing an effective strategy to leverage both types of data to enhance the efficacy of the RL model training has become a highly challenging problem. In this study, we propo
Authors
(none)
Tags
Stats
Related papers
- Dynamic Channel Access Via Meta-reinforcement Learning (2021)5.84
- Distributional Reinforcement Learning For Multi-dimensional Reward Functions (2021)0.00
- A Multi-task Approach To Robust Deep Reinforcement Learning For Resource Allocation (2023)0.00
- Comadice: Offline Cooperative Multi-agent Reinforcement Learning With Stationary Distribution Shift Regularization (2024)0.00
- Offline Reinforcement Learning For Wireless Network Optimization With Mixture Datasets (2023)9.59
- Multi-agent Reinforcement Learning For Resources Allocation Optimization: A Survey (2025)0.00
- Bridging Distributionally Robust Learning And Offline RL: An Approach To Mitigate Distribution Shift And Partial Data Coverage (2023)0.00
- Distributional Reward Estimation For Effective Multi-agent Deep Reinforcement Learning (2022)0.00