Deep Reinforcement Learning Of Marked Temporal Point Processes
2018 Β· Utkarsh Upadhyay, Abir de, Manuel Gomez-Rodriguez
Abstract
In a wide variety of applications, humans interact with a complex environment by means of asynchronous stochastic discrete events in continuous time. Can we design online interventions that will help humans achieve certain goals in such asynchronous setting? In this paper, we address the above problem from the perspective of deep reinforcement learning of marked temporal point processes, where both the actions taken by an agent and the feedback it receives from the environment are asynchronous stochastic discrete events characterized using marked temporal point processes. In doing so, we define the agent's policy using the intensity and mark distribution of the corresponding process and then derive a flexible policy gradient method, which embeds the agent's actions and the feedback it receives into real-valued vectors using deep recurrent neural networks. Our method does not make any assumptions on the functional form of the intensity and mark distribution of the feedback and it allows
Authors
(none)
Tags
Stats
Related papers
- Learning Temporal Point Processes Via Reinforcement Learning (2018)0.00
- Markov Decision Processes Under External Temporal Processes (2023)0.00
- Reinforcement Learning With Policy Mixture Model For Temporal Point Processes Clustering (2019)0.00
- Learning When To Act: Interval-aware Reinforcement Learning With Predictive Temporal Structure (2026)0.00
- Dynamic Deep-reinforcement-learning Algorithm In Partially Observable Markov Decision Processes (2023)0.00
- Minimum-delay Adaptation In Non-stationary Reinforcement Learning Via Online High-confidence Change-point Detection (2021)0.00
- Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control In Computationally Complex Environments (2019)0.00
- Revisiting State Augmentation Methods For Reinforcement Learning With Stochastic Delays (2021)10.35