MSVD-QA
Emerging4papers using it
2022first seen
The 'MSVD-QA' dataset is a benchmark that contains video clips paired with questions and answers, used to evaluate the performance of models in the task of video question answering (VideoQA).
Papers using MSVD-QA (4)
- HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language ModelsResNetVLLM -- Multi-modal Vision LLM for the Video Understanding TaskZero-Shot Video Question Answering via Frozen Bidirectional Language
ModelsVideo Question Answering Using CLIP-Guided Visual-Text Attention