← all datasets

V*Bench

Emerging
3papers using it
2025first seen

The 'V* Bench' dataset/benchmark is used to evaluate the performance and efficiency of agentic multimodal large language models (MLLMs) in tasks involving visual perception and reasoning.

Papers using V*Bench (3)

V*Bench β€” datasets β€” llm-papers