← all datasets

CartPole-v-1

Emerging
9papers using it
17HF downloads
4HF likes
2025first seen

CartPole-v1 - Imitation Learning Datasets This is a dataset created by Imitation Learning Datasets project. It was created by using Stable Baselines weights from a PPO policy from HuggingFace. Description The dataset consists of 1,000 episodes with an average episodic reward of 500. Each entry consists of: obs (list):

Papers using CartPole-v-1 (9)

CartPole-v-1 β€” datasets β€” reinforcement-learning