← all datasets

ToolBench

Canonical
12papers using it
2024first seen

ToolBench is a benchmark containing approximately 47,000 tools used to evaluate the performance of large language models in tool retrieval tasks through various query types and probing methods.

Papers using ToolBench (12)

ToolBench β€” datasets β€” ai-agents