← all datasets

ToolHop

Emerging
2papers using it
516HF downloads
23HF likes
2025first seen

[ACL 2025] ToolHop [ACL 2025] ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Data for the paper ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Junjie Ye jjye23@m.fudan.edu.cn Jan. 07, 2025 Introduction Effective evaluation of mu

Papers using ToolHop (2)

ToolHop β€” datasets β€” ai-for-code