Reinforcement Learning In Economics And Finance
2020 Β· Arthur Charpentier, Romuald Elie, Carl Remlinger
Abstract
Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal rewards. As in online learning, the agent learns sequentially. As in multi-armed bandit problems, when an agent picks an action, he can not infer ex-post the rewards induced by other action choices. In reinforcement learning, his actions have consequences: they influence not only rewards, but also future states of the world. The goal of reinforcement learning is to find an optimal policy -- a mapping from the states of the world to the set of actions, in order to maximize cumulative reward, which is a long term strategy. Exploring might be sub-optimal on a short-term horizon but could lead to optimal long-term ones. Many problems of optimal control, popular in economics for more than forty years, can be expressed in the reinforcement learning framework, an
Authors
(none)
Tags
Stats
Related papers
- From Reinforcement Learning To Optimal Control: A Unified Framework For Sequential Decisions (2019)0.00
- Average Reward Adjusted Discounted Reinforcement Learning: Near-blackwell-optimal Policies For Real-world Applications (2020)0.00
- Decentralized Reinforcement Learning: Global Decision-making Via Local Economic Transactions (2020)0.00
- An Agent Design With Goal Reaching Guarantees For Enhancement Of Learning (2024)0.00
- Direct And Indirect Reinforcement Learning (2019)10.74
- Demystifying Reinforcement Learning In Time-varying Systems (2022)0.00
- Reinforcement Learning With Algorithms From Probabilistic Structure Estimation (2021)0.00
- Automated Reinforcement Learning: An Overview (2022)0.00