Video QA
Emerging5papers using it
2024first seen
The 'Video QA' dataset/benchmark contains video clips paired with questions and answers, and it is used to evaluate the ability of models to understand and reason about visual content in videos.
Papers using Video QA (5)
- Reinforcement Learning Tuning For Videollms: Reward Design And Data EfficiencyHyperTokens: Controlling Token Dynamics for Continual Video-Language UnderstandingReinforcing Structured Chain-of-Thought for Video UnderstandingMMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse AttentionCREMA: Generalizable and Efficient Video-Language Reasoning via
Multimodal Modular Fusion