LongVideoBench

Name: LongVideoBench
License: cc-by-nc-sa-4.0

Emerging

2papers using it

18,062HF downloads

45HF likes

2025first seen

Dataset Card for LongVideoBench Large multimodal models (LMMs) are handling increasingly longer and more complex inputs. However, few public benchmarks are available to assess these advancements. To address this, we introduce LongVideoBench, a question-answering benchmark with video-language interleaved inputs up to an hour long. It comprises 3,763 web-collected videos with subtitles across diverse themes, designed to evaluate LMMs on long-term multimodal understanding. The… See the full description on the dataset page: https://huggingface.co/datasets/longvideobench/LongVideoBench.

🤗 Hugging Face⚖ cc-by-nc-sa-4.0

Papers using LongVideoBench (2)

T*: Re-thinking Temporal Search For Long-form Video Understanding2025 · 5 cites

Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding2026