LongVideoBench
Emerging2papers using it
18,062HF downloads
45HF likes
2025first seen
Dataset Card for LongVideoBench Large multimodal models (LMMs) are handling increasingly longer and more complex inputs. However, few public benchmarks are available to assess these advancements. To address this, we introduce LongVideoBench, a question-answering benchmark with video-language interleaved inputs up to an hour long. It comprises 3,763 web-collected videos with subtitles across diverse themes, designed to evaluate LMMs on long-term multimodal understanding. The⦠See the full description on the dataset page: https://huggingface.co/datasets/longvideobench/LongVideoBench.
π€ Hugging Faceβ cc-by-nc-sa-4.0