CartPole-v-1
Emerging9papers using it
17HF downloads
4HF likes
2025first seen
CartPole-v1 - Imitation Learning Datasets This is a dataset created by Imitation Learning Datasets project. It was created by using Stable Baselines weights from a PPO policy from HuggingFace. Description The dataset consists of 1,000 episodes with an average episodic reward of 500. Each entry consists of: obs (list):
π€ Hugging Faceβ mit
Papers using CartPole-v-1 (9)
- Not All Transitions Matter: Evidence from PPODirected-MAML: Meta Reinforcement Learning Algorithm with Task-directed ApproximationARISE: Adaptive Reinforcement Integrated with Swarm ExplorationBeyond ReLU: Chebyshev-DQN for Enhanced Deep Q-NetworksOn-Policy Optimization of ANFIS Policies Using Proximal Policy OptimizationHybrid Quantum-classical Policy Gradient For Adaptive Control Of Cyber-physical Systems: A Comparative Study Of VQC Vs. MLPBASIL: Best-Action Symbolic Interpretable Learning for Evolving Compact RL PoliciesGradient Free Deep Reinforcement Learning With TabPFNHybrid Quantum-Classical Policy Gradient for Adaptive Control of Cyber-Physical Systems: A Comparative Study of VQC vs. MLP