MSVD

Emerging

3papers using it

2020first seen

The MSVD (Microsoft Video Description) dataset contains a collection of video clips paired with multiple descriptive sentences, and it is used to evaluate video-text retrieval methods by measuring the ability to match videos with their corresponding textual descriptions.

🔎 Find this dataset

Papers using MSVD (3)

Stacked Convolutional Deep Encoding Network for Video-Text Retrieval2020 · 1 cites

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval2021

T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval2024