ToolBench

Name: ToolBench
License: apache-2.0

Emerging

3papers using it

2,806HF downloads

1HF likes

2024first seen

ToolBench is a benchmark that evaluates the ability of large language models to utilize external tools through a stable and large-scale framework, incorporating a virtual API server and a systematic evaluation approach.

🤗 Hugging Face⚖ apache-2.0

Papers using ToolBench (3)

UTFix: Change Aware Unit Test Repairing using LLM2025 · 3 cites

AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls2024 · 1 cites

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models2024