V*Bench
Emerging3papers using it
2025first seen
The 'V* Bench' dataset/benchmark is used to evaluate the performance and efficiency of agentic multimodal large language models (MLLMs) in tasks involving visual perception and reasoning.
The 'V* Bench' dataset/benchmark is used to evaluate the performance and efficiency of agentic multimodal large language models (MLLMs) in tasks involving visual perception and reasoning.