OmniVideoBench
Emerging2papers using it
2,766HF downloads
5HF likes
2026first seen
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs β¨ Overview Recent advances in multimodal large language models (MLLMs) have brought remarkable progress in video understanding.However, most existing benchmarks fail to jointly evaluate both audio and visual reasoning β often focusing on one m
π€ Hugging Faceβ cc-by-nc-nd-4.0