← all datasets

VTC-Bench

Emerging
2papers using it
128HF downloads
1HF likes
2025first seen

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining Paper | GitHub Visual Tool Chain-Bench (VTC-Bench) is a comprehensive benchmark designed to evaluate the tool-use proficiency and multi-tool composition capabilities of Multimodal Large Language Models (MLLMs). To emulate authentic c

Papers using VTC-Bench (2)

VTC-Bench β€” datasets β€” ai-agents