Adversarial Recovery Of Agent Rewards From Latent Spaces Of The Limit Order Book
2019 Β· Jacobo Roa-Vicens, Yuanbo Wang, Virgile Mison, et al.
Abstract
Inverse reinforcement learning has proved its ability to explain state-action trajectories of expert agents by recovering their underlying reward functions in increasingly challenging environments. Recent advances in adversarial learning have allowed extending inverse RL to applications with non-stationary environment dynamics unknown to the agents, arbitrary structures of reward functions and improved handling of the ambiguities inherent to the ill-posed nature of inverse RL. This is particularly relevant in real time applications on stochastic environments involving risk, like volatile financial markets. Moreover, recent work on simulation of complex environments enable learning algorithms to engage with real market data through simulations of its latent space representations, avoiding a costly exploration of the original environment. In this paper, we explore whether adversarial inverse RL algorithms can be adapted and trained within such latent space simulations from real market da
Authors
(none)
Tags
Stats
Related papers
- Towards Inverse Reinforcement Learning For Limit Order Book Dynamics (2019)0.00
- Learning Robust Rewards With Adversarial Inverse Reinforcement Learning (2017)0.00
- Offline Inverse RL: New Solution Concepts And Provably Efficient Algorithms (2024)0.00
- Inverse Reinforcement Learning From Non-stationary Learning Agents (2024)0.00
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- Active Learning For Risk-sensitive Inverse Reinforcement Learning (2019)0.00
- Robust Risk-sensitive Reinforcement Learning Agents For Trading Markets (2021)0.00
- Statistical Analysis Of Inverse Entropy-regularized Reinforcement Learning (2025)0.00