A Temporally Correlated Latent Exploration For Reinforcement Learning
2024 Β· Sumin Oh, Wansoo Kim, Hyunjin Kim
Abstract
Efficient exploration remains one of the longstanding problems of deep reinforcement learning. Instead of depending solely on extrinsic rewards from the environments, existing methods use intrinsic rewards to enhance exploration. However, we demonstrate that these methods are vulnerable to Noisy TV and stochasticity. To tackle this problem, we propose Temporally Correlated Latent Exploration (TeCLE), which is a novel intrinsic reward formulation that employs an action-conditioned latent space and temporal correlation. The action-conditioned latent space estimates the probability distribution of states, thereby avoiding the assignment of excessive intrinsic rewards to unpredictable states and effectively addressing both problems. Whereas previous works inject temporal correlation for action selection, the proposed method injects it for intrinsic reward computation. We find that the injected temporal correlation determines the exploratory behaviors of agents. Various experiments show tha
Authors
(none)
Tags
Stats
Related papers
- Temporal Representations For Exploration: Learning Complex Exploratory Behavior Without Extrinsic Rewards (2026)0.00
- Self-supervised Exploration Via Temporal Inconsistency In Reinforcement Learning (2022)3.58
- Directed Exploration In Reinforcement Learning From Linear Temporal Logic (2024)0.00
- Random Latent Exploration For Deep Reinforcement Learning (2024)0.00
- Temporal Difference Uncertainties As A Signal For Exploration (2020)0.00
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Information Content Exploration (2023)0.00
- Beyond Noisy-tvs: Noise-robust Exploration Via Learning Progress Monitoring (2025)0.00