Learning To Simulate Self-driven Particles System With Coordinated Policy Optimization
2021 Β· Zhenghao Peng, Quanyi Li, Ka Ming Hui, et al.
Abstract
Self-Driven Particles (SDP) describe a category of multi-agent systems common in everyday life, such as flocking birds and traffic flows. In a SDP system, each agent pursues its own goal and constantly changes its cooperative or competitive behaviors with its nearby agents. Manually designing the controllers for such SDP system is time-consuming, while the resulting emergent behaviors are often not realistic nor generalizable. Thus the realistic simulation of SDP systems remains challenging. Reinforcement learning provides an appealing alternative for automating the development of the controller for SDP. However, previous multi-agent reinforcement learning (MARL) methods define the agents to be teammates or enemies before hand, which fail to capture the essence of SDP where the role of each agent varies to be cooperative or competitive even within one episode. To simulate SDP with MARL, a key challenge is to coordinate agents' behaviors while still maximizing individual objectives. Tak
Authors
(none)
Tags
Stats
Related papers
- Role Play: Learning Adaptive Role-specific Strategies In Multi-agent Interactions (2024)0.00
- Spacer: Self-play Anchoring With Centralized Reference Models (2025)0.00
- Multi-agent Cooperation Through Learning-aware Policy Gradients (2024)0.00
- Large-scale Traffic Signal Control Using A Novel Multi-agent Reinforcement Learning (2019)16.21
- Co2po: Coordinated Constrained Policy Optimization For Multi-agent RL (2026)0.00
- Strategic Coordination For Evolving Multi-agent Systems: A Hierarchical Reinforcement And Collective Learning Approach (2025)0.00
- Evolution Of Societies Via Reinforcement Learning (2024)0.00
- MACS: Deep Reinforcement Learning Based SDN Controller Synchronization Policy Design (2019)9.92