Performance Dynamics And Termination Errors In Reinforcement Learning: A Unifying Perspective
2019 Β· Nikki Lijing Kuang, Clement H. C. Leung
Abstract
In reinforcement learning, a decision needs to be made at some point as to whether it is worthwhile to carry on with the learning process or to terminate it. In many such situations, stochastic elements are often present which govern the occurrence of rewards, with the sequential occurrences of positive rewards randomly interleaved with negative rewards. For most practical learners, the learning is considered useful if the number of positive rewards always exceeds the negative ones. A situation that often calls for learning termination is when the number of negative rewards exceeds the number of positive rewards. However, while this seems reasonable, the error of premature termination, whereby termination is enacted along with the conclusion of learning failure despite the positive rewards eventually far outnumber the negative ones, can be significant. In this paper, using combinatorial analysis we study the error probability in wrongly terminating a reinforcement learning activity whi
Authors
(none)
Tags
Stats
Related papers
- The Termination Critic (2019)0.00
- Stochastic Reinforcement Learning (2019)5.24
- Tackling Uncertainties In Multi-agent Reinforcement Learning Through Integration Of Agent Termination Dynamics (2025)2.26
- Understanding Individual Decision-making In Multi-agent Reinforcement Learning: A Dynamical Systems Approach (2025)0.00
- Loss Dynamics Of Temporal Difference Reinforcement Learning (2023)0.00
- Regret Analysis In Deterministic Reinforcement Learning (2021)0.00
- Implications Of Human Irrationality For Reinforcement Learning (2020)0.00
- Tiered Reinforcement Learning: Pessimism In The Face Of Uncertainty And Constant Regret (2022)0.00