← all datasets

Simpler

Emerging

7papers using it

2025first seen

The 'Simpler' dataset is a benchmark used to evaluate the spatial reasoning capabilities of Vision-Language-Action models.

🔎 Find this dataset

Papers using Simpler (7)

DTP: A Simple Yet Effective Distracting Token Pruning Framework For Vision-language Action Models2026

DAM-VLA: A Dynamic Action Model-Based Vision-Language-Action Framework for Robot Manipulation2026

Beyond Attention Magnitude: Leveraging Inter-layer Rank Consistency for Efficient Vision-Language-Action Models2026

MVP-LAM: Learning Action-Centric Latent Action via Cross-Viewpoint Reconstruction2026

DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning2025

Villa-x: Enhancing Latent Action Modeling In Vision-language-action Models2025

VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching2025

Simpler — datasets — multimodal