SEED-Bench
Canonical1papers using it
2,063HF downloads
23HF likes
2026first seen
SEED-Bench Card Benchmark details Benchmark type: SEED-Bench is a large-scale benchmark to evaluate Multimodal Large Language Models (MLLMs). It consists of 19K multiple choice questions with accurate human annotations, which covers 12 evaluation dimensions including the comprehension of both the image and video modality. Benchmark date: SEED-Bench was collected in July 2023. Paper or resources for more information: https://github.com/AILab-CVC/SEED-Bench License:β¦ See the full description on the dataset page: https://huggingface.co/datasets/AILab-CVC/SEED-Bench.
π€ Hugging Faceβ cc-by-nc-4.0