12M ego-centric manipulation videos
Emerging1papers using it
2025first seen
The '12M ego-centric manipulation videos' dataset contains a large collection of videos focused on human manipulation tasks from a first-person perspective, and it is used to evaluate the performance of models in predicting future frames based on initial frames and language instructions.