MSVD-QA

Emerging

4papers using it

2022first seen

The 'MSVD-QA' dataset is a benchmark that contains video clips paired with questions and answers, used to evaluate the performance of models in the task of video question answering (VideoQA).

🔎 Find this dataset

Papers using MSVD-QA (4)

HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language Models2026

ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task2025

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models2022 · 64 cites

Video Question Answering Using CLIP-Guided Visual-Text Attention2023 · 1 cites