← all datasets

MLVU

Emerging
13papers using it
2025first seen

MLVU is a dataset that contains 200 videos and 800 generated summaries, used to evaluate video-to-text summarization through multimodal question answering.

Papers using MLVU (13)

MLVU β€” datasets β€” multimodal