← all datasets

LunarLander-v-2

Emerging
3papers using it
37HF downloads
1HF likes
2025first seen

LunarLander-v2 - Imitation Learning Datasets This is a dataset created by Imitation Learning Datasets project. It was created by using Stable Baselines weights from a PPO policy from HuggingFace. Description The dataset consists of 1,000 episodes with an average episodic reward of 500. Each entry consists of: obs (list

Papers using LunarLander-v-2 (3)

LunarLander-v-2 β€” datasets β€” reinforcement-learning