Dual Policy Distillation
2020 Β· Kwei-Herng Lai, Daochen Zha, Yuening Li, et al.
Abstract
Policy distillation, which transfers a teacher policy to a student policy has achieved great success in challenging tasks of deep reinforcement learning. This teacher-student framework requires a well-trained teacher model which is computationally expensive. Moreover, the performance of the student model could be limited by the teacher model if the teacher model is not optimal. In the light of collaborative learning, we study the feasibility of involving joint intellectual efforts from diverse perspectives of student models. In this work, we introduce dual policy distillation(DPD), a student-student framework in which two learners operate on the same environment to explore different perspectives of the environment and extract knowledge from each other to enhance their learning. The key challenge in developing this dual learning framework is to identify the beneficial knowledge from the peer learner for contemporary learning-based reinforcement learning algorithms, since it is unclear w
Authors
(none)
Tags
Stats
Related papers
- Online Policy Distillation With Decision-attention (2024)0.00
- Continual Policy Distillation From Distributed Reinforcement Learning Teachers (2026)0.00
- Continual Deep Reinforcement Learning With Task-agnostic Policy Distillation (2024)0.00
- Fedhpd: Heterogeneous Federated Reinforcement Learning Via Policy Distillation (2025)2.26
- Transfer Heterogeneous Knowledge Among Peer-to-peer Teammates: A Model Distillation Approach (2020)0.00
- KD-MARL: Resource-aware Knowledge Distillation In Multi-agent Reinforcement Learning (2026)0.00
- Periodic Intra-ensemble Knowledge Distillation For Reinforcement Learning (2020)4.52
- A New Framework For Multi-agent Reinforcement Learning -- Centralized Training And Exploration With Decentralized Execution Via Policy Distillation (2019)0.00