CartPole
Emerging21papers using it
35HF downloads
0HF likes
2024first seen
CartPole is a benchmark used in reinforcement learning that involves balancing a pole on a moving cart, evaluating the performance of algorithms in managing this dynamic control task.
π€ Hugging Faceβ mit
Papers using CartPole (21)
- Improving the Effectiveness of Potential-Based Reward Shaping in
Reinforcement LearningK-score: Kalman Filter As A Principled Alternative To Reward Normalization In Reinforcement LearningResiduals-based Offline Reinforcement LearningReinforcement Learning for Control with Probabilistic Stability Guarantee: A Finite-Sample ApproachBayesian Conservative Policy Optimization (BCPO): A Novel Uncertainty-Calibrated Offline Reinforcement Learning with Credible Lower BoundsOnline Adaptive Reinforcement Learning with Echo State Networks for Non-Stationary DynamicsA Controlled Study of Double DQN and Dueling DQN Under Cross-Environment TransferEnhanced-FQL($\lambda$), an Efficient and Interpretable RL with novel Fuzzy Eligibility Traces and Segmented Experience ReplayLearning Without Critics? Revisiting GRPO in Classical Reinforcement Learning EnvironmentsNFQ2.0: The CartPole Benchmark RevisitedHeterogeneous Federated Reinforcement Learning Using Wasserstein BarycentersSafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning LibraryQuantifying First-order Markov Violations In Noisy Reinforcement Learning: A Causal Discovery ApproachImproving the Data-efficiency of Reinforcement Learning by Warm-starting with LLMBellman operator convergence enhancements in reinforcement learning algorithmsUsing Part-based Representations for Explainable Deep Reinforcement
LearningOptimizing Variational Quantum Circuits Using Metaheuristic Strategies
in Reinforcement LearningQuantifying First-Order Markov Violations in Noisy Reinforcement Learning: A Causal Discovery ApproachLearning to Control Dynamical Agents via Spiking Neural Networks and Metropolis-Hastings SamplingSampling Complexity of TD and PPO in RKHSDiagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring