Optimal Decision-making In Mixed-agent Partially Observable Stochastic Environments Via Reinforcement Learning
2019 Β· Roi Ceren
Abstract
Optimal decision making with limited or no information in stochastic environments where multiple agents interact is a challenging topic in the realm of artificial intelligence. Reinforcement learning (RL) is a popular approach for arriving at optimal strategies by predicating stimuli, such as the reward for following a strategy, on experience. RL is heavily explored in the single-agent context, but is a nascent concept in multiagent problems. To this end, I propose several principled model-free and partially model-based reinforcement learning approaches for several multiagent settings. In the realm of normative reinforcement learning, I introduce scalable extensions to Monte Carlo exploring starts for partially observable Markov Decision Processes (POMDP), dubbed MCES-P, where I expand the theory and algorithm to the multiagent setting. I first examine MCES-P with probably approximately correct (PAC) bounds in the context of multiagent setting, showing MCESP+PAC holds in the presence o
Authors
(none)
Tags
Stats
Related papers
- Multi-agent Off-policy Actor-critic Reinforcement Learning For Partially Observable Environments (2024)2.26
- Achieving Fairness In Multi-agent Markov Decision Processes Using Reinforcement Learning (2023)0.00
- Macro-action-based Multi-agent/robot Deep Reinforcement Learning Under Partial Observability (2022)5.84
- Centralized Model And Exploration Policy For Multi-agent RL (2021)0.00
- Sample-efficient Reinforcement Learning Of Partially Observable Markov Games (2022)0.00
- Goal-oriented Inference Of Environment From Redundant Observations (2023)3.58
- Reinforcement Learning Under Partial Observability Guided By Learned Environment Models (2022)6.34
- Actor-critic Policy Optimization In Partially Observable Multiagent Environments (2018)0.00