MSVD
Emerging3papers using it
2020first seen
The MSVD (Microsoft Video Description) dataset contains a collection of video clips paired with multiple descriptive sentences, and it is used to evaluate video-text retrieval methods by measuring the ability to match videos with their corresponding textual descriptions.