CartPole-v-1

Name: CartPole-v-1
License: mit

Emerging

9papers using it

17HF downloads

4HF likes

2025first seen

CartPole-v1 - Imitation Learning Datasets This is a dataset created by Imitation Learning Datasets project. It was created by using Stable Baselines weights from a PPO policy from HuggingFace. Description The dataset consists of 1,000 episodes with an average episodic reward of 500. Each entry consists of: obs (list):

🤗 Hugging Face⚖ mit

Papers using CartPole-v-1 (9)

Not All Transitions Matter: Evidence from PPO2026

Directed-MAML: Meta Reinforcement Learning Algorithm with Task-directed Approximation2025 · 1 cites

ARISE: Adaptive Reinforcement Integrated with Swarm Exploration2026

Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks2025

On-Policy Optimization of ANFIS Policies Using Proximal Policy Optimization2025

Hybrid Quantum-classical Policy Gradient For Adaptive Control Of Cyber-physical Systems: A Comparative Study Of VQC Vs. MLP2025

BASIL: Best-Action Symbolic Interpretable Learning for Evolving Compact RL Policies2025

Gradient Free Deep Reinforcement Learning With TabPFN2025

Hybrid Quantum-Classical Policy Gradient for Adaptive Control of Cyber-Physical Systems: A Comparative Study of VQC vs. MLP2025